2025-05-07T19:42:38.9136320Z Current runner version: '2.323.0' 2025-05-07T19:42:38.9145285Z Runner name: 'i-0694834bf9e26cc3e' 2025-05-07T19:42:38.9146233Z Machine name: 'ip-10-0-25-57' 2025-05-07T19:42:38.9148918Z ##[group]GITHUB_TOKEN Permissions 2025-05-07T19:42:38.9151448Z Contents: read 2025-05-07T19:42:38.9152117Z Metadata: read 2025-05-07T19:42:38.9152651Z Packages: read 2025-05-07T19:42:38.9153303Z ##[endgroup] 2025-05-07T19:42:38.9155498Z Secret source: None 2025-05-07T19:42:38.9156251Z Prepare workflow directory 2025-05-07T19:42:38.9767139Z Prepare all required actions 2025-05-07T19:42:38.9805802Z Getting action download info 2025-05-07T19:42:39.1460420Z Download action repository 'actions/checkout@v4' (SHA:11bd71901bbe5b1630ceea73d27597364c9af683) 2025-05-07T19:42:39.4047412Z Download action repository 'actions/upload-artifact@v4' (SHA:ea165f8d65b6e75b540449e92b4886f43607fa02) 2025-05-07T19:42:39.9218151Z Complete job name: build_artifact (x86, linux.24xlarge, default, 3.11, 12.6.3, gcc) 2025-05-07T19:42:40.0027806Z A job started hook has been configured by the self-hosted runner administrator 2025-05-07T19:42:40.0143364Z ##[group]Run '/home/ec2-user/runner-scripts/before_job.sh' 2025-05-07T19:42:40.0153939Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:42:40.0154922Z ##[endgroup] 2025-05-07T19:42:41.1836323Z Runner Type: linux.24xlarge 2025-05-07T19:42:41.1836868Z Instance Type: c5.24xlarge 2025-05-07T19:42:41.1837201Z AMI Name: unknown 2025-05-07T19:42:41.1868379Z AMI ID: ami-071226ecf16aa7d96 2025-05-07T19:42:46.2362905Z ##[group]Checking docker version 2025-05-07T19:42:46.2375808Z ##[command]/usr/bin/docker version --format '{{.Server.APIVersion}}' 2025-05-07T19:42:46.2579912Z '1.44' 2025-05-07T19:42:46.2595165Z Docker daemon API version: '1.44' 2025-05-07T19:42:46.2595704Z ##[command]/usr/bin/docker version --format '{{.Client.APIVersion}}' 2025-05-07T19:42:46.2782457Z '1.44' 2025-05-07T19:42:46.2794371Z Docker client API version: '1.44' 2025-05-07T19:42:46.2798646Z ##[endgroup] 2025-05-07T19:42:46.2801135Z ##[group]Clean up resources from previous jobs 2025-05-07T19:42:46.2806197Z ##[command]/usr/bin/docker ps --all --quiet --no-trunc --filter "label=559f53" 2025-05-07T19:42:46.2951415Z ##[command]/usr/bin/docker network prune --force --filter "label=559f53" 2025-05-07T19:42:46.3099250Z ##[endgroup] 2025-05-07T19:42:46.3099603Z ##[group]Create local container network 2025-05-07T19:42:46.3108528Z ##[command]/usr/bin/docker network create --label 559f53 github_network_d1c31e94b16749b88a8f16884e015518 2025-05-07T19:42:46.5721989Z 9178cf77f3c54f6cad2f576132aae5d983e1117453110ba4473c93ded51f5acc 2025-05-07T19:42:46.5740778Z ##[endgroup] 2025-05-07T19:42:46.5763096Z ##[group]Starting job container 2025-05-07T19:42:46.5781536Z ##[command]/usr/bin/docker pull amazonlinux:2023 2025-05-07T19:42:46.6996855Z 2023: Pulling from library/amazonlinux 2025-05-07T19:42:46.7059553Z Digest: sha256:cb5b4c509d62ae388f674c139ae5e8281fc160c217d474445e912043e1941988 2025-05-07T19:42:46.7060245Z Status: Image is up to date for amazonlinux:2023 2025-05-07T19:42:46.7069460Z docker.io/library/amazonlinux:2023 2025-05-07T19:42:46.7165115Z ##[command]/usr/bin/docker create --name be310ab4d61e4fc4b18391c314211a7d_amazonlinux2023_40b1c5 --label 559f53 --workdir /__w/FBGEMM/FBGEMM --network github_network_d1c31e94b16749b88a8f16884e015518 --user root -e "HOME=/github/home" -e GITHUB_ACTIONS=true -e CI=true -v "/var/run/docker.sock":"/var/run/docker.sock" -v "/home/ec2-user/actions-runner/_work":"/__w" -v "/home/ec2-user/actions-runner/externals":"/__e":ro -v "/home/ec2-user/actions-runner/_work/_temp":"/__w/_temp" -v "/home/ec2-user/actions-runner/_work/_actions":"/__w/_actions" -v "/home/ec2-user/actions-runner/_work/_tool":"/__w/_tool" -v "/home/ec2-user/actions-runner/_work/_temp/_github_home":"/github/home" -v "/home/ec2-user/actions-runner/_work/_temp/_github_workflow":"/github/workflow" --entrypoint "tail" amazonlinux:2023 "-f" "/dev/null" 2025-05-07T19:42:46.7946045Z a4cdfef5f67791de1447d833ba4c241d003473f6a3bbff3a88701c507bc3b898 2025-05-07T19:42:46.7970114Z ##[command]/usr/bin/docker start a4cdfef5f67791de1447d833ba4c241d003473f6a3bbff3a88701c507bc3b898 2025-05-07T19:42:47.2770556Z a4cdfef5f67791de1447d833ba4c241d003473f6a3bbff3a88701c507bc3b898 2025-05-07T19:42:47.2790278Z ##[command]/usr/bin/docker ps --all --filter id=a4cdfef5f67791de1447d833ba4c241d003473f6a3bbff3a88701c507bc3b898 --filter status=running --no-trunc --format "{{.ID}} {{.Status}}" 2025-05-07T19:42:47.2933754Z a4cdfef5f67791de1447d833ba4c241d003473f6a3bbff3a88701c507bc3b898 Up Less than a second 2025-05-07T19:42:47.2949927Z ##[command]/usr/bin/docker inspect --format "{{range .Config.Env}}{{println .}}{{end}}" a4cdfef5f67791de1447d833ba4c241d003473f6a3bbff3a88701c507bc3b898 2025-05-07T19:42:47.3086780Z HOME=/github/home 2025-05-07T19:42:47.3087230Z GITHUB_ACTIONS=true 2025-05-07T19:42:47.3087547Z CI=true 2025-05-07T19:42:47.3088072Z PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-05-07T19:42:47.3110762Z ##[endgroup] 2025-05-07T19:42:47.3120256Z ##[group]Waiting for all services to be ready 2025-05-07T19:42:47.3122059Z ##[endgroup] 2025-05-07T19:42:47.3194777Z ##[group]Run yum update -y; yum install -y binutils findutils git pciutils sudo tar wget which 2025-05-07T19:42:47.3195803Z yum update -y; yum install -y binutils findutils git pciutils sudo tar wget which 2025-05-07T19:42:47.3196671Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:42:47.3197210Z env: 2025-05-07T19:42:47.3197503Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:42:47.3197903Z BUILD_ENV: build_binary 2025-05-07T19:42:47.3198281Z BUILD_TARGET: default 2025-05-07T19:42:47.3198593Z BUILD_VARIANT: cuda 2025-05-07T19:42:47.3198927Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:42:47.3199249Z ##[endgroup] 2025-05-07T19:42:48.1928982Z Amazon Linux 2023 repository 66 MB/s | 37 MB 00:00 2025-05-07T19:42:54.7707105Z Last metadata expiration check: 0:00:07 ago on Wed May 7 19:42:47 2025. 2025-05-07T19:42:55.3286024Z Dependencies resolved. 2025-05-07T19:42:55.3462900Z Nothing to do. 2025-05-07T19:42:55.3463538Z Complete! 2025-05-07T19:42:55.5754025Z Last metadata expiration check: 0:00:08 ago on Wed May 7 19:42:47 2025. 2025-05-07T19:42:55.6382308Z Dependencies resolved. 2025-05-07T19:42:55.6609547Z ======================================================================================== 2025-05-07T19:42:55.6610392Z Package Arch Version Repository Size 2025-05-07T19:42:55.6611177Z ======================================================================================== 2025-05-07T19:42:55.6611626Z Installing: 2025-05-07T19:42:55.6612081Z binutils x86_64 2.41-50.amzn2023.0.3 amazonlinux 5.3 M 2025-05-07T19:42:55.6612798Z findutils x86_64 1:4.8.0-2.amzn2023.0.2 amazonlinux 539 k 2025-05-07T19:42:55.6613383Z git x86_64 2.47.1-1.amzn2023.0.2 amazonlinux 54 k 2025-05-07T19:42:55.6614024Z pciutils x86_64 3.7.0-3.amzn2023.0.2 amazonlinux 93 k 2025-05-07T19:42:55.6614664Z sudo x86_64 1.9.15-1.p5.amzn2023.0.1 amazonlinux 1.3 M 2025-05-07T19:42:55.6615284Z tar x86_64 2:1.34-1.amzn2023.0.4 amazonlinux 879 k 2025-05-07T19:42:55.6615859Z wget x86_64 1.21.3-1.amzn2023.0.4 amazonlinux 779 k 2025-05-07T19:42:55.6616481Z which x86_64 2.21-26.amzn2023.0.2 amazonlinux 42 k 2025-05-07T19:42:55.6617033Z Installing dependencies: 2025-05-07T19:42:55.6617496Z cracklib x86_64 2.9.6-27.amzn2023.0.2 amazonlinux 82 k 2025-05-07T19:42:55.6618301Z cyrus-sasl-lib x86_64 2.1.27-18.amzn2023.0.3 amazonlinux 786 k 2025-05-07T19:42:55.6619034Z elfutils-debuginfod-client x86_64 0.188-3.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:55.6619683Z git-core x86_64 2.47.1-1.amzn2023.0.2 amazonlinux 4.7 M 2025-05-07T19:42:55.6620670Z git-core-doc noarch 2.47.1-1.amzn2023.0.2 amazonlinux 2.8 M 2025-05-07T19:42:55.6621265Z gnutls x86_64 3.8.3-6.amzn2023.0.1 amazonlinux 1.1 M 2025-05-07T19:42:55.6621896Z groff-base x86_64 1.22.4-7.amzn2023.0.2 amazonlinux 1.0 M 2025-05-07T19:42:55.6622548Z gzip x86_64 1.12-1.amzn2023.0.1 amazonlinux 160 k 2025-05-07T19:42:55.6623109Z hwdata noarch 0.384-1.amzn2023.0.3 amazonlinux 1.6 M 2025-05-07T19:42:55.6623765Z jansson x86_64 2.14-0.amzn2023 amazonlinux 46 k 2025-05-07T19:42:55.6624357Z kmod-libs x86_64 29-2.amzn2023.0.5 amazonlinux 62 k 2025-05-07T19:42:55.6625000Z less x86_64 608-2.amzn2023.0.2 amazonlinux 168 k 2025-05-07T19:42:55.6625761Z libcbor x86_64 0.7.0-3.amzn2023.0.2 amazonlinux 57 k 2025-05-07T19:42:55.6626367Z libdb x86_64 5.3.28-49.amzn2023.0.2 amazonlinux 756 k 2025-05-07T19:42:55.6627012Z libeconf x86_64 0.4.0-1.amzn2023.0.3 amazonlinux 28 k 2025-05-07T19:42:55.6627594Z libedit x86_64 3.1-38.20210714cvs.amzn2023.0.2 amazonlinux 108 k 2025-05-07T19:42:55.6742556Z libfdisk x86_64 2.37.4-1.amzn2023.0.4 amazonlinux 153 k 2025-05-07T19:42:55.6743536Z libfido2 x86_64 1.10.0-2.amzn2023.0.2 amazonlinux 95 k 2025-05-07T19:42:55.6744258Z libmetalink x86_64 0.1.3-14.amzn2023.0.2 amazonlinux 31 k 2025-05-07T19:42:55.6744945Z libpwquality x86_64 1.4.4-6.amzn2023.0.2 amazonlinux 106 k 2025-05-07T19:42:55.6745538Z libsemanage x86_64 3.4-5.amzn2023.0.2 amazonlinux 121 k 2025-05-07T19:42:55.6746204Z libutempter x86_64 1.2.1-4.amzn2023.0.2 amazonlinux 26 k 2025-05-07T19:42:55.6746838Z nano x86_64 8.3-1.amzn2023 amazonlinux 706 k 2025-05-07T19:42:55.6747344Z ncurses x86_64 6.2-4.20200222.amzn2023.0.6 amazonlinux 394 k 2025-05-07T19:42:55.6747867Z nettle x86_64 3.10.1-1.amzn2023.0.1 amazonlinux 573 k 2025-05-07T19:42:55.6748389Z openldap x86_64 2.4.57-6.amzn2023.0.7 amazonlinux 256 k 2025-05-07T19:42:55.6748908Z openssh x86_64 8.7p1-8.amzn2023.0.14 amazonlinux 454 k 2025-05-07T19:42:55.6749637Z openssh-clients x86_64 8.7p1-8.amzn2023.0.14 amazonlinux 708 k 2025-05-07T19:42:55.6750191Z pam x86_64 1.5.1-8.amzn2023.0.4 amazonlinux 542 k 2025-05-07T19:42:55.6750728Z pciutils-libs x86_64 3.7.0-3.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:55.6751335Z perl-AutoLoader noarch 5.74-477.amzn2023.0.6 amazonlinux 22 k 2025-05-07T19:42:55.6751886Z perl-B x86_64 1.80-477.amzn2023.0.6 amazonlinux 179 k 2025-05-07T19:42:55.6752440Z perl-Carp noarch 1.50-458.amzn2023.0.2 amazonlinux 29 k 2025-05-07T19:42:55.6753033Z perl-Class-Struct noarch 0.66-477.amzn2023.0.6 amazonlinux 22 k 2025-05-07T19:42:55.6753748Z perl-Data-Dumper x86_64 2.174-460.amzn2023.0.2 amazonlinux 55 k 2025-05-07T19:42:55.6754360Z perl-Digest noarch 1.20-1.amzn2023.0.2 amazonlinux 26 k 2025-05-07T19:42:55.6754944Z perl-Digest-MD5 x86_64 2.58-2.amzn2023.0.2 amazonlinux 36 k 2025-05-07T19:42:55.6755556Z perl-DynaLoader x86_64 1.47-477.amzn2023.0.6 amazonlinux 26 k 2025-05-07T19:42:55.6756392Z perl-Encode x86_64 4:3.15-462.amzn2023.0.2 amazonlinux 1.7 M 2025-05-07T19:42:55.6756938Z perl-Errno x86_64 1.30-477.amzn2023.0.6 amazonlinux 15 k 2025-05-07T19:42:55.6757526Z perl-Error noarch 1:0.17029-5.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:55.6758116Z perl-Exporter noarch 5.74-459.amzn2023.0.2 amazonlinux 31 k 2025-05-07T19:42:55.6758689Z perl-Fcntl x86_64 1.13-477.amzn2023.0.6 amazonlinux 21 k 2025-05-07T19:42:55.6759298Z perl-File-Basename noarch 2.85-477.amzn2023.0.6 amazonlinux 18 k 2025-05-07T19:42:55.6759936Z perl-File-Find noarch 1.37-477.amzn2023.0.6 amazonlinux 26 k 2025-05-07T19:42:55.6760533Z perl-File-Path noarch 2.18-2.amzn2023.0.2 amazonlinux 36 k 2025-05-07T19:42:55.6761148Z perl-File-Temp noarch 1:0.231.100-2.amzn2023.0.2 amazonlinux 60 k 2025-05-07T19:42:55.6762015Z perl-File-stat noarch 1.09-477.amzn2023.0.6 amazonlinux 17 k 2025-05-07T19:42:55.6762663Z perl-FileHandle noarch 2.03-477.amzn2023.0.6 amazonlinux 16 k 2025-05-07T19:42:55.6763280Z perl-Getopt-Long noarch 1:2.52-2.amzn2023.0.2 amazonlinux 60 k 2025-05-07T19:42:55.6763881Z perl-Getopt-Std noarch 1.12-477.amzn2023.0.6 amazonlinux 16 k 2025-05-07T19:42:55.6764465Z perl-Git noarch 2.47.1-1.amzn2023.0.2 amazonlinux 42 k 2025-05-07T19:42:55.6765033Z perl-HTTP-Tiny noarch 0.078-1.amzn2023.0.3 amazonlinux 56 k 2025-05-07T19:42:55.6765602Z perl-IO x86_64 1.43-477.amzn2023.0.6 amazonlinux 87 k 2025-05-07T19:42:55.6766167Z perl-IPC-Open3 noarch 1.21-477.amzn2023.0.6 amazonlinux 23 k 2025-05-07T19:42:55.6766747Z perl-MIME-Base64 x86_64 3.16-2.amzn2023.0.2 amazonlinux 31 k 2025-05-07T19:42:55.6767347Z perl-Net-SSLeay x86_64 1.94-1.amzn2023.0.1 amazonlinux 392 k 2025-05-07T19:42:55.6767904Z perl-POSIX x86_64 1.94-477.amzn2023.0.6 amazonlinux 97 k 2025-05-07T19:42:55.6768479Z perl-PathTools x86_64 3.78-459.amzn2023.0.2 amazonlinux 85 k 2025-05-07T19:42:55.6769063Z perl-Pod-Escapes noarch 1:1.07-458.amzn2023.0.2 amazonlinux 20 k 2025-05-07T19:42:55.6769783Z perl-Pod-Perldoc noarch 3.28.01-459.amzn2023.0.3 amazonlinux 84 k 2025-05-07T19:42:55.6770364Z perl-Pod-Simple noarch 1:3.42-2.amzn2023.0.2 amazonlinux 215 k 2025-05-07T19:42:55.6770918Z perl-Pod-Usage noarch 4:2.01-2.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:55.6771508Z perl-Scalar-List-Utils x86_64 4:1.56-459.amzn2023.0.2 amazonlinux 71 k 2025-05-07T19:42:55.6772105Z perl-SelectSaver noarch 1.02-477.amzn2023.0.6 amazonlinux 12 k 2025-05-07T19:42:55.6772676Z perl-Socket x86_64 4:2.032-1.amzn2023.0.2 amazonlinux 55 k 2025-05-07T19:42:55.6773214Z perl-Storable x86_64 1:3.21-458.amzn2023.0.2 amazonlinux 96 k 2025-05-07T19:42:55.6773746Z perl-Symbol noarch 1.08-477.amzn2023.0.6 amazonlinux 15 k 2025-05-07T19:42:55.6774330Z perl-Term-ANSIColor noarch 5.01-459.amzn2023.0.2 amazonlinux 48 k 2025-05-07T19:42:55.6774905Z perl-Term-Cap noarch 1.17-458.amzn2023.0.2 amazonlinux 22 k 2025-05-07T19:42:55.6775475Z perl-TermReadKey x86_64 2.38-9.amzn2023.0.2 amazonlinux 36 k 2025-05-07T19:42:55.6776054Z perl-Text-ParseWords noarch 3.30-458.amzn2023.0.2 amazonlinux 17 k 2025-05-07T19:42:55.6776667Z perl-Text-Tabs+Wrap noarch 2021.0726-1.amzn2023.0.1 amazonlinux 22 k 2025-05-07T19:42:55.6777333Z perl-Time-Local noarch 2:1.300-5.amzn2023.0.2 amazonlinux 34 k 2025-05-07T19:42:55.6777872Z perl-URI noarch 5.09-1.amzn2023.0.2 amazonlinux 108 k 2025-05-07T19:42:55.6778408Z perl-base noarch 2.27-477.amzn2023.0.6 amazonlinux 17 k 2025-05-07T19:42:55.6778942Z perl-constant noarch 1.33-459.amzn2023.0.2 amazonlinux 23 k 2025-05-07T19:42:55.6779476Z perl-if noarch 0.60.800-477.amzn2023.0.6 amazonlinux 14 k 2025-05-07T19:42:55.6780016Z perl-interpreter x86_64 4:5.32.1-477.amzn2023.0.6 amazonlinux 71 k 2025-05-07T19:42:55.6780532Z perl-lib x86_64 0.65-477.amzn2023.0.6 amazonlinux 15 k 2025-05-07T19:42:55.6781045Z perl-libnet noarch 3.13-2.amzn2023.0.2 amazonlinux 126 k 2025-05-07T19:42:55.6781548Z perl-libs x86_64 4:5.32.1-477.amzn2023.0.6 amazonlinux 2.0 M 2025-05-07T19:42:55.6782104Z perl-mro x86_64 1.23-477.amzn2023.0.6 amazonlinux 29 k 2025-05-07T19:42:55.6782656Z perl-overload noarch 1.31-477.amzn2023.0.6 amazonlinux 46 k 2025-05-07T19:42:55.6783213Z perl-overloading noarch 0.02-477.amzn2023.0.6 amazonlinux 13 k 2025-05-07T19:42:55.6783783Z perl-parent noarch 1:0.238-458.amzn2023.0.2 amazonlinux 14 k 2025-05-07T19:42:55.6784342Z perl-podlators noarch 1:4.14-458.amzn2023.0.2 amazonlinux 112 k 2025-05-07T19:42:55.6785380Z perl-subs noarch 1.03-477.amzn2023.0.6 amazonlinux 12 k 2025-05-07T19:42:55.6785952Z perl-vars noarch 1.05-477.amzn2023.0.6 amazonlinux 13 k 2025-05-07T19:42:55.6786504Z shadow-utils x86_64 2:4.9-12.amzn2023.0.4 amazonlinux 1.1 M 2025-05-07T19:42:55.6787085Z systemd-libs x86_64 252.23-3.amzn2023 amazonlinux 613 k 2025-05-07T19:42:55.6787630Z util-linux x86_64 2.37.4-1.amzn2023.0.4 amazonlinux 2.2 M 2025-05-07T19:42:55.6788204Z util-linux-core x86_64 2.37.4-1.amzn2023.0.4 amazonlinux 432 k 2025-05-07T19:42:55.6788674Z Installing weak dependencies: 2025-05-07T19:42:55.6789226Z nano-default-editor noarch 8.3-1.amzn2023 amazonlinux 10 k 2025-05-07T19:42:55.6789871Z perl-IO-Socket-IP noarch 0.41-3.amzn2023.0.2 amazonlinux 42 k 2025-05-07T19:42:55.6790481Z perl-IO-Socket-SSL noarch 2.075-1.amzn2023.0.2 amazonlinux 218 k 2025-05-07T19:42:55.6791113Z perl-Mozilla-CA noarch 20200520-4.amzn2023.0.2 amazonlinux 13 k 2025-05-07T19:42:55.6791719Z perl-NDBM_File x86_64 1.15-477.amzn2023.0.6 amazonlinux 23 k 2025-05-07T19:42:55.6792314Z sudo-python-plugin x86_64 1.9.15-1.p5.amzn2023.0.1 amazonlinux 56 k 2025-05-07T19:42:55.6792683Z 2025-05-07T19:42:55.6792803Z Transaction Summary 2025-05-07T19:42:55.6793095Z ======================================================================================== 2025-05-07T19:42:55.6793448Z Install 107 Packages 2025-05-07T19:42:55.6793606Z 2025-05-07T19:42:55.6793717Z Total download size: 38 M 2025-05-07T19:42:55.6794003Z Installed size: 151 M 2025-05-07T19:42:55.6794270Z Downloading Packages: 2025-05-07T19:42:55.9565760Z (1/107): cracklib-2.9.6-27.amzn2023.0.2.x86_64. 4.1 MB/s | 82 kB 00:00 2025-05-07T19:42:55.9688393Z (2/107): cyrus-sasl-lib-2.1.27-18.amzn2023.0.3. 24 MB/s | 786 kB 00:00 2025-05-07T19:42:55.9792025Z (3/107): elfutils-debuginfod-client-0.188-3.amz 1.9 MB/s | 41 kB 00:00 2025-05-07T19:42:55.9914074Z (4/107): findutils-4.8.0-2.amzn2023.0.2.x86_64. 38 MB/s | 539 kB 00:00 2025-05-07T19:42:55.9932507Z (5/107): git-2.47.1-1.amzn2023.0.2.x86_64.rpm 4.4 MB/s | 54 kB 00:00 2025-05-07T19:42:56.0159837Z (6/107): binutils-2.41-50.amzn2023.0.3.x86_64.r 66 MB/s | 5.3 MB 00:00 2025-05-07T19:42:56.0504510Z (7/107): git-core-doc-2.47.1-1.amzn2023.0.2.noa 49 MB/s | 2.8 MB 00:00 2025-05-07T19:42:56.0571467Z (8/107): gnutls-3.8.3-6.amzn2023.0.1.x86_64.rpm 38 MB/s | 1.1 MB 00:00 2025-05-07T19:42:56.0807672Z (9/107): git-core-2.47.1-1.amzn2023.0.2.x86_64. 54 MB/s | 4.7 MB 00:00 2025-05-07T19:42:56.0887019Z (10/107): groff-base-1.22.4-7.amzn2023.0.2.x86_ 34 MB/s | 1.0 MB 00:00 2025-05-07T19:42:56.0913345Z (11/107): gzip-1.12-1.amzn2023.0.1.x86_64.rpm 4.8 MB/s | 160 kB 00:00 2025-05-07T19:42:56.1078516Z (12/107): hwdata-0.384-1.amzn2023.0.3.noarch.rp 65 MB/s | 1.6 MB 00:00 2025-05-07T19:42:56.1095101Z (13/107): kmod-libs-29-2.amzn2023.0.5.x86_64.rp 4.1 MB/s | 62 kB 00:00 2025-05-07T19:42:56.1120898Z (14/107): jansson-2.14-0.amzn2023.x86_64.rpm 2.6 MB/s | 46 kB 00:00 2025-05-07T19:42:56.1165818Z (15/107): libcbor-0.7.0-3.amzn2023.0.2.x86_64.r 9.3 MB/s | 57 kB 00:00 2025-05-07T19:42:56.1199391Z (16/107): less-608-2.amzn2023.0.2.x86_64.rpm 17 MB/s | 168 kB 00:00 2025-05-07T19:42:56.1260427Z (17/107): libdb-5.3.28-49.amzn2023.0.2.x86_64.r 55 MB/s | 756 kB 00:00 2025-05-07T19:42:56.1275488Z (18/107): libeconf-0.4.0-1.amzn2023.0.3.x86_64. 2.7 MB/s | 28 kB 00:00 2025-05-07T19:42:56.1296395Z (19/107): libedit-3.1-38.20210714cvs.amzn2023.0 13 MB/s | 108 kB 00:00 2025-05-07T19:42:56.1348204Z (20/107): libfdisk-2.37.4-1.amzn2023.0.4.x86_64 18 MB/s | 153 kB 00:00 2025-05-07T19:42:56.1369030Z (21/107): libfido2-1.10.0-2.amzn2023.0.2.x86_64 11 MB/s | 95 kB 00:00 2025-05-07T19:42:56.1379143Z (22/107): libmetalink-0.1.3-14.amzn2023.0.2.x86 3.9 MB/s | 31 kB 00:00 2025-05-07T19:42:56.1426674Z (23/107): libpwquality-1.4.4-6.amzn2023.0.2.x86 15 MB/s | 106 kB 00:00 2025-05-07T19:42:56.1459661Z (24/107): libsemanage-3.4-5.amzn2023.0.2.x86_64 17 MB/s | 121 kB 00:00 2025-05-07T19:42:56.1473713Z (25/107): libutempter-1.2.1-4.amzn2023.0.2.x86_ 3.0 MB/s | 26 kB 00:00 2025-05-07T19:42:56.1537235Z (26/107): nano-8.3-1.amzn2023.x86_64.rpm 65 MB/s | 706 kB 00:00 2025-05-07T19:42:56.1548815Z (27/107): nano-default-editor-8.3-1.amzn2023.no 1.2 MB/s | 10 kB 00:00 2025-05-07T19:42:56.1618337Z (28/107): ncurses-6.2-4.20200222.amzn2023.0.6.x 31 MB/s | 394 kB 00:00 2025-05-07T19:42:56.1672359Z (29/107): nettle-3.10.1-1.amzn2023.0.1.x86_64.r 41 MB/s | 573 kB 00:00 2025-05-07T19:42:56.1705729Z (30/107): openldap-2.4.57-6.amzn2023.0.7.x86_64 17 MB/s | 256 kB 00:00 2025-05-07T19:42:56.1751094Z (31/107): openssh-8.7p1-8.amzn2023.0.14.x86_64. 39 MB/s | 454 kB 00:00 2025-05-07T19:42:56.1835539Z (32/107): pam-1.5.1-8.amzn2023.0.4.x86_64.rpm 45 MB/s | 542 kB 00:00 2025-05-07T19:42:56.1890241Z (33/107): openssh-clients-8.7p1-8.amzn2023.0.14 40 MB/s | 708 kB 00:00 2025-05-07T19:42:56.1911830Z (34/107): pciutils-3.7.0-3.amzn2023.0.2.x86_64. 5.8 MB/s | 93 kB 00:00 2025-05-07T19:42:56.1940336Z (35/107): pciutils-libs-3.7.0-3.amzn2023.0.2.x8 4.7 MB/s | 41 kB 00:00 2025-05-07T19:42:56.1976395Z (36/107): perl-AutoLoader-5.74-477.amzn2023.0.6 3.9 MB/s | 22 kB 00:00 2025-05-07T19:42:56.2037326Z (37/107): perl-B-1.80-477.amzn2023.0.6.x86_64.r 15 MB/s | 179 kB 00:00 2025-05-07T19:42:56.2060676Z (38/107): perl-Carp-1.50-458.amzn2023.0.2.noarc 2.5 MB/s | 29 kB 00:00 2025-05-07T19:42:56.2075552Z (39/107): perl-Class-Struct-0.66-477.amzn2023.0 2.2 MB/s | 22 kB 00:00 2025-05-07T19:42:56.2094600Z (40/107): perl-Data-Dumper-2.174-460.amzn2023.0 10 MB/s | 55 kB 00:00 2025-05-07T19:42:56.2118022Z (41/107): perl-Digest-1.20-1.amzn2023.0.2.noarc 4.6 MB/s | 26 kB 00:00 2025-05-07T19:42:56.2145196Z (42/107): perl-Digest-MD5-2.58-2.amzn2023.0.2.x 5.8 MB/s | 36 kB 00:00 2025-05-07T19:42:56.2158393Z (43/107): perl-DynaLoader-1.47-477.amzn2023.0.6 4.4 MB/s | 26 kB 00:00 2025-05-07T19:42:56.2285653Z (44/107): perl-Encode-3.15-462.amzn2023.0.2.x86 107 MB/s | 1.7 MB 00:00 2025-05-07T19:42:56.2308719Z (45/107): perl-Errno-1.30-477.amzn2023.0.6.x86_ 972 kB/s | 15 kB 00:00 2025-05-07T19:42:56.2324460Z (46/107): perl-Error-0.17029-5.amzn2023.0.2.noa 2.6 MB/s | 41 kB 00:00 2025-05-07T19:42:56.2348691Z (47/107): perl-Exporter-5.74-459.amzn2023.0.2.n 5.6 MB/s | 31 kB 00:00 2025-05-07T19:42:56.2380401Z (48/107): perl-File-Basename-2.85-477.amzn2023. 3.7 MB/s | 18 kB 00:00 2025-05-07T19:42:56.2394021Z (49/107): perl-Fcntl-1.13-477.amzn2023.0.6.x86_ 3.0 MB/s | 21 kB 00:00 2025-05-07T19:42:56.2418725Z (50/107): perl-File-Find-1.37-477.amzn2023.0.6. 3.8 MB/s | 26 kB 00:00 2025-05-07T19:42:56.2437750Z (51/107): perl-File-Path-2.18-2.amzn2023.0.2.no 6.7 MB/s | 36 kB 00:00 2025-05-07T19:42:56.2464837Z (52/107): perl-File-Temp-0.231.100-2.amzn2023.0 9.5 MB/s | 60 kB 00:00 2025-05-07T19:42:56.2477347Z (53/107): perl-File-stat-1.09-477.amzn2023.0.6. 2.9 MB/s | 17 kB 00:00 2025-05-07T19:42:56.2502859Z (54/107): perl-FileHandle-2.03-477.amzn2023.0.6 2.7 MB/s | 16 kB 00:00 2025-05-07T19:42:56.2523584Z (55/107): perl-Getopt-Long-2.52-2.amzn2023.0.2. 10 MB/s | 60 kB 00:00 2025-05-07T19:42:56.2542575Z (56/107): perl-Getopt-Std-1.12-477.amzn2023.0.6 2.9 MB/s | 16 kB 00:00 2025-05-07T19:42:56.2559236Z (57/107): perl-Git-2.47.1-1.amzn2023.0.2.noarch 7.8 MB/s | 42 kB 00:00 2025-05-07T19:42:56.2585942Z (58/107): perl-HTTP-Tiny-0.078-1.amzn2023.0.3.n 9.8 MB/s | 56 kB 00:00 2025-05-07T19:42:56.2607751Z (59/107): perl-IO-1.43-477.amzn2023.0.6.x86_64. 14 MB/s | 87 kB 00:00 2025-05-07T19:42:56.2627208Z (60/107): perl-IO-Socket-IP-0.41-3.amzn2023.0.2 7.2 MB/s | 42 kB 00:00 2025-05-07T19:42:56.2661295Z (61/107): perl-IO-Socket-SSL-2.075-1.amzn2023.0 30 MB/s | 218 kB 00:00 2025-05-07T19:42:56.2677604Z (62/107): perl-IPC-Open3-1.21-477.amzn2023.0.6. 3.3 MB/s | 23 kB 00:00 2025-05-07T19:42:56.2702575Z (63/107): perl-MIME-Base64-3.16-2.amzn2023.0.2. 4.6 MB/s | 31 kB 00:00 2025-05-07T19:42:56.2717163Z (64/107): perl-Mozilla-CA-20200520-4.amzn2023.0 2.2 MB/s | 13 kB 00:00 2025-05-07T19:42:56.2749637Z (65/107): perl-NDBM_File-1.15-477.amzn2023.0.6. 3.6 MB/s | 23 kB 00:00 2025-05-07T19:42:56.2796258Z (66/107): perl-Net-SSLeay-1.94-1.amzn2023.0.1.x 43 MB/s | 392 kB 00:00 2025-05-07T19:42:56.2826834Z (67/107): perl-POSIX-1.94-477.amzn2023.0.6.x86_ 9.4 MB/s | 97 kB 00:00 2025-05-07T19:42:56.2849451Z (68/107): perl-PathTools-3.78-459.amzn2023.0.2. 9.8 MB/s | 85 kB 00:00 2025-05-07T19:42:56.2872094Z (69/107): perl-Pod-Escapes-1.07-458.amzn2023.0. 2.8 MB/s | 20 kB 00:00 2025-05-07T19:42:56.2922138Z (70/107): perl-Pod-Simple-3.42-2.amzn2023.0.2.n 30 MB/s | 215 kB 00:00 2025-05-07T19:42:56.2950857Z (71/107): perl-Pod-Perldoc-3.28.01-459.amzn2023 7.0 MB/s | 84 kB 00:00 2025-05-07T19:42:56.2966750Z (72/107): perl-Pod-Usage-2.01-2.amzn2023.0.2.no 4.5 MB/s | 41 kB 00:00 2025-05-07T19:42:56.2990488Z (73/107): perl-Scalar-List-Utils-1.56-459.amzn2 11 MB/s | 71 kB 00:00 2025-05-07T19:42:56.3033591Z (74/107): perl-Socket-2.032-1.amzn2023.0.2.x86_ 8.7 MB/s | 55 kB 00:00 2025-05-07T19:42:56.3060256Z (75/107): perl-SelectSaver-1.02-477.amzn2023.0. 1.4 MB/s | 12 kB 00:00 2025-05-07T19:42:56.3080282Z (76/107): perl-Storable-3.21-458.amzn2023.0.2.x 11 MB/s | 96 kB 00:00 2025-05-07T19:42:56.3105356Z (77/107): perl-Symbol-1.08-477.amzn2023.0.6.noa 2.2 MB/s | 15 kB 00:00 2025-05-07T19:42:56.3143477Z (78/107): perl-Term-Cap-1.17-458.amzn2023.0.2.n 4.1 MB/s | 22 kB 00:00 2025-05-07T19:42:56.3158416Z (79/107): perl-TermReadKey-2.38-9.amzn2023.0.2. 6.5 MB/s | 36 kB 00:00 2025-05-07T19:42:56.3185607Z (80/107): perl-Term-ANSIColor-5.01-459.amzn2023 5.0 MB/s | 48 kB 00:00 2025-05-07T19:42:56.3200292Z (81/107): perl-Text-ParseWords-3.30-458.amzn202 3.1 MB/s | 17 kB 00:00 2025-05-07T19:42:56.3221750Z (82/107): perl-Text-Tabs+Wrap-2021.0726-1.amzn2 4.2 MB/s | 22 kB 00:00 2025-05-07T19:42:56.3239125Z (83/107): perl-Time-Local-1.300-5.amzn2023.0.2. 6.6 MB/s | 34 kB 00:00 2025-05-07T19:42:56.3271919Z (84/107): perl-URI-5.09-1.amzn2023.0.2.noarch.r 17 MB/s | 108 kB 00:00 2025-05-07T19:42:56.3286759Z (85/107): perl-base-2.27-477.amzn2023.0.6.noarc 2.7 MB/s | 17 kB 00:00 2025-05-07T19:42:56.3315347Z (86/107): perl-constant-1.33-459.amzn2023.0.2.n 3.4 MB/s | 23 kB 00:00 2025-05-07T19:42:56.3335362Z (87/107): perl-if-0.60.800-477.amzn2023.0.6.noa 2.4 MB/s | 14 kB 00:00 2025-05-07T19:42:56.3356502Z (88/107): perl-interpreter-5.32.1-477.amzn2023. 11 MB/s | 71 kB 00:00 2025-05-07T19:42:56.3387663Z (89/107): perl-lib-0.65-477.amzn2023.0.6.x86_64 2.2 MB/s | 15 kB 00:00 2025-05-07T19:42:56.3420661Z (90/107): perl-libnet-3.13-2.amzn2023.0.2.noarc 16 MB/s | 126 kB 00:00 2025-05-07T19:42:56.3559063Z (91/107): perl-libs-5.32.1-477.amzn2023.0.6.x86 104 MB/s | 2.0 MB 00:00 2025-05-07T19:42:56.3581215Z (92/107): perl-mro-1.23-477.amzn2023.0.6.x86_64 1.5 MB/s | 29 kB 00:00 2025-05-07T19:42:56.3594740Z (93/107): perl-overload-1.31-477.amzn2023.0.6.n 3.0 MB/s | 46 kB 00:00 2025-05-07T19:42:56.3616358Z (94/107): perl-overloading-0.02-477.amzn2023.0. 2.6 MB/s | 13 kB 00:00 2025-05-07T19:42:56.3659694Z (95/107): perl-podlators-4.14-458.amzn2023.0.2. 19 MB/s | 112 kB 00:00 2025-05-07T19:42:56.3673402Z (96/107): perl-parent-0.238-458.amzn2023.0.2.no 1.9 MB/s | 14 kB 00:00 2025-05-07T19:42:56.3686142Z (97/107): perl-subs-1.03-477.amzn2023.0.6.noarc 1.7 MB/s | 12 kB 00:00 2025-05-07T19:42:56.3712373Z (98/107): perl-vars-1.05-477.amzn2023.0.6.noarc 2.7 MB/s | 13 kB 00:00 2025-05-07T19:42:56.3835319Z (99/107): sudo-1.9.15-1.p5.amzn2023.0.1.x86_64. 87 MB/s | 1.3 MB 00:00 2025-05-07T19:42:56.3906387Z (100/107): shadow-utils-4.9-12.amzn2023.0.4.x86 52 MB/s | 1.1 MB 00:00 2025-05-07T19:42:56.3919778Z (101/107): sudo-python-plugin-1.9.15-1.p5.amzn2 2.7 MB/s | 56 kB 00:00 2025-05-07T19:42:56.3976965Z (102/107): systemd-libs-252.23-3.amzn2023.x86_6 45 MB/s | 613 kB 00:00 2025-05-07T19:42:56.4065963Z (103/107): tar-1.34-1.amzn2023.0.4.x86_64.rpm 66 MB/s | 879 kB 00:00 2025-05-07T19:42:56.4142094Z (104/107): util-linux-core-2.37.4-1.amzn2023.0. 26 MB/s | 432 kB 00:00 2025-05-07T19:42:56.4272476Z (105/107): util-linux-2.37.4-1.amzn2023.0.4.x86 64 MB/s | 2.2 MB 00:00 2025-05-07T19:42:56.4334593Z (106/107): wget-1.21.3-1.amzn2023.0.4.x86_64.rp 33 MB/s | 779 kB 00:00 2025-05-07T19:42:56.4352040Z (107/107): which-2.21-26.amzn2023.0.2.x86_64.rp 2.1 MB/s | 42 kB 00:00 2025-05-07T19:42:56.4369507Z -------------------------------------------------------------------------------- 2025-05-07T19:42:56.4370912Z Total 49 MB/s | 38 MB 00:00 2025-05-07T19:42:57.4867239Z Running transaction check 2025-05-07T19:42:57.5322627Z Transaction check succeeded. 2025-05-07T19:42:57.5323550Z Running transaction test 2025-05-07T19:42:57.8995339Z Transaction test succeeded. 2025-05-07T19:42:57.8996734Z Running transaction 2025-05-07T19:42:58.6119848Z Preparing : 1/1 2025-05-07T19:42:58.6264601Z Installing : systemd-libs-252.23-3.amzn2023.x86_64 1/107 2025-05-07T19:42:58.6498328Z Installing : nettle-3.10.1-1.amzn2023.0.1.x86_64 2/107 2025-05-07T19:42:58.6697820Z Installing : gnutls-3.8.3-6.amzn2023.0.1.x86_64 3/107 2025-05-07T19:42:58.6743194Z Installing : util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 4/107 2025-05-07T19:42:58.6810951Z Running scriptlet: util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 4/107 2025-05-07T19:42:58.6902106Z Installing : pciutils-libs-3.7.0-3.amzn2023.0.2.x86_64 5/107 2025-05-07T19:42:58.7175032Z Installing : ncurses-6.2-4.20200222.amzn2023.0.6.x86_64 6/107 2025-05-07T19:42:58.7230159Z Installing : nano-8.3-1.amzn2023.x86_64 7/107 2025-05-07T19:42:58.7282664Z Installing : nano-default-editor-8.3-1.amzn2023.noarch 8/107 2025-05-07T19:42:58.7789324Z Installing : libsemanage-3.4-5.amzn2023.0.2.x86_64 9/107 2025-05-07T19:42:58.7856649Z Installing : shadow-utils-2:4.9-12.amzn2023.0.4.x86_64 10/107 2025-05-07T19:42:58.8180594Z Running scriptlet: libutempter-1.2.1-4.amzn2023.0.2.x86_64 11/107 2025-05-07T19:42:58.8240561Z Installing : libutempter-1.2.1-4.amzn2023.0.2.x86_64 11/107 2025-05-07T19:42:58.8304479Z Installing : libmetalink-0.1.3-14.amzn2023.0.2.x86_64 12/107 2025-05-07T19:42:58.8370908Z Installing : libfdisk-2.37.4-1.amzn2023.0.4.x86_64 13/107 2025-05-07T19:42:58.8415667Z Installing : libedit-3.1-38.20210714cvs.amzn2023.0.2.x86_64 14/107 2025-05-07T19:42:58.8549454Z Installing : libeconf-0.4.0-1.amzn2023.0.3.x86_64 15/107 2025-05-07T19:42:58.8604394Z Installing : libdb-5.3.28-49.amzn2023.0.2.x86_64 16/107 2025-05-07T19:42:58.8665722Z Installing : libcbor-0.7.0-3.amzn2023.0.2.x86_64 17/107 2025-05-07T19:42:58.8737713Z Installing : libfido2-1.10.0-2.amzn2023.0.2.x86_64 18/107 2025-05-07T19:42:58.8800072Z Installing : less-608-2.amzn2023.0.2.x86_64 19/107 2025-05-07T19:42:58.8849836Z Installing : kmod-libs-29-2.amzn2023.0.5.x86_64 20/107 2025-05-07T19:42:58.9276906Z Installing : jansson-2.14-0.amzn2023.x86_64 21/107 2025-05-07T19:42:58.9352139Z Installing : hwdata-0.384-1.amzn2023.0.3.noarch 22/107 2025-05-07T19:42:58.9495782Z Installing : gzip-1.12-1.amzn2023.0.1.x86_64 23/107 2025-05-07T19:42:58.9918008Z Installing : cracklib-2.9.6-27.amzn2023.0.2.x86_64 24/107 2025-05-07T19:42:59.0087553Z Installing : pam-1.5.1-8.amzn2023.0.4.x86_64 25/107 2025-05-07T19:42:59.0904353Z Installing : libpwquality-1.4.4-6.amzn2023.0.2.x86_64 26/107 2025-05-07T19:42:59.0904968Z Installing : util-linux-2.37.4-1.amzn2023.0.4.x86_64 27/107 2025-05-07T19:42:59.0905649Z warning: /etc/adjtime created as /etc/adjtime.rpmnew 2025-05-07T19:42:59.0905916Z 2025-05-07T19:42:59.1110966Z Running scriptlet: util-linux-2.37.4-1.amzn2023.0.4.x86_64 27/107 2025-05-07T19:42:59.1390478Z Running scriptlet: openssh-8.7p1-8.amzn2023.0.14.x86_64 28/107 2025-05-07T19:42:59.1573691Z Installing : openssh-8.7p1-8.amzn2023.0.14.x86_64 28/107 2025-05-07T19:42:59.1619544Z Installing : openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 29/107 2025-05-07T19:42:59.2732266Z Running scriptlet: openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 29/107 2025-05-07T19:42:59.4194034Z Installing : git-core-2.47.1-1.amzn2023.0.2.x86_64 30/107 2025-05-07T19:42:59.4304958Z Installing : git-core-doc-2.47.1-1.amzn2023.0.2.noarch 31/107 2025-05-07T19:42:59.4718960Z Running scriptlet: groff-base-1.22.4-7.amzn2023.0.2.x86_64 32/107 2025-05-07T19:42:59.4774502Z Installing : groff-base-1.22.4-7.amzn2023.0.2.x86_64 32/107 2025-05-07T19:42:59.4847336Z Running scriptlet: groff-base-1.22.4-7.amzn2023.0.2.x86_64 32/107 2025-05-07T19:42:59.4903723Z Installing : perl-Digest-1.20-1.amzn2023.0.2.noarch 33/107 2025-05-07T19:42:59.4973576Z Installing : perl-Digest-MD5-2.58-2.amzn2023.0.2.x86_64 34/107 2025-05-07T19:42:59.5023747Z Installing : perl-B-1.80-477.amzn2023.0.6.x86_64 35/107 2025-05-07T19:42:59.5062408Z Installing : perl-FileHandle-2.03-477.amzn2023.0.6.noarch 36/107 2025-05-07T19:42:59.5097858Z Installing : perl-AutoLoader-5.74-477.amzn2023.0.6.noarch 37/107 2025-05-07T19:42:59.5174209Z Installing : perl-Data-Dumper-2.174-460.amzn2023.0.2.x86_64 38/107 2025-05-07T19:42:59.5226693Z Installing : perl-libnet-3.13-2.amzn2023.0.2.noarch 39/107 2025-05-07T19:42:59.5314720Z Installing : perl-base-2.27-477.amzn2023.0.6.noarch 40/107 2025-05-07T19:42:59.5511637Z Installing : perl-URI-5.09-1.amzn2023.0.2.noarch 41/107 2025-05-07T19:42:59.5580860Z Installing : perl-Net-SSLeay-1.94-1.amzn2023.0.1.x86_64 42/107 2025-05-07T19:42:59.5627082Z Installing : perl-Text-Tabs+Wrap-2021.0726-1.amzn2023.0.1.noa 43/107 2025-05-07T19:42:59.5670687Z Installing : perl-Mozilla-CA-20200520-4.amzn2023.0.2.noarch 44/107 2025-05-07T19:42:59.5721984Z Installing : perl-if-0.60.800-477.amzn2023.0.6.noarch 45/107 2025-05-07T19:42:59.5774972Z Installing : perl-IO-Socket-IP-0.41-3.amzn2023.0.2.noarch 46/107 2025-05-07T19:42:59.5829969Z Installing : perl-Time-Local-2:1.300-5.amzn2023.0.2.noarch 47/107 2025-05-07T19:42:59.5921416Z Installing : perl-File-Path-2.18-2.amzn2023.0.2.noarch 48/107 2025-05-07T19:42:59.5987674Z Installing : perl-IO-Socket-SSL-2.075-1.amzn2023.0.2.noarch 49/107 2025-05-07T19:42:59.6032409Z Installing : perl-Pod-Escapes-1:1.07-458.amzn2023.0.2.noarch 50/107 2025-05-07T19:42:59.6087870Z Installing : perl-Class-Struct-0.66-477.amzn2023.0.6.noarch 51/107 2025-05-07T19:42:59.6144961Z Installing : perl-POSIX-1.94-477.amzn2023.0.6.x86_64 52/107 2025-05-07T19:42:59.6209024Z Installing : perl-Term-ANSIColor-5.01-459.amzn2023.0.2.noarch 53/107 2025-05-07T19:42:59.6251599Z Installing : perl-IPC-Open3-1.21-477.amzn2023.0.6.noarch 54/107 2025-05-07T19:42:59.6306884Z Installing : perl-subs-1.03-477.amzn2023.0.6.noarch 55/107 2025-05-07T19:42:59.6377046Z Installing : perl-File-Temp-1:0.231.100-2.amzn2023.0.2.noarch 56/107 2025-05-07T19:42:59.6436146Z Installing : perl-HTTP-Tiny-0.078-1.amzn2023.0.3.noarch 57/107 2025-05-07T19:42:59.6543574Z Installing : perl-Term-Cap-1.17-458.amzn2023.0.2.noarch 58/107 2025-05-07T19:42:59.6622929Z Installing : perl-Pod-Simple-1:3.42-2.amzn2023.0.2.noarch 59/107 2025-05-07T19:42:59.6681195Z Installing : perl-Socket-4:2.032-1.amzn2023.0.2.x86_64 60/107 2025-05-07T19:42:59.6733416Z Installing : perl-SelectSaver-1.02-477.amzn2023.0.6.noarch 61/107 2025-05-07T19:42:59.6781171Z Installing : perl-Symbol-1.08-477.amzn2023.0.6.noarch 62/107 2025-05-07T19:42:59.6861913Z Installing : perl-File-stat-1.09-477.amzn2023.0.6.noarch 63/107 2025-05-07T19:42:59.6952708Z Installing : perl-podlators-1:4.14-458.amzn2023.0.2.noarch 64/107 2025-05-07T19:42:59.7028945Z Installing : perl-Pod-Perldoc-3.28.01-459.amzn2023.0.3.noarch 65/107 2025-05-07T19:42:59.7088266Z Installing : perl-Fcntl-1.13-477.amzn2023.0.6.x86_64 66/107 2025-05-07T19:42:59.7142301Z Installing : perl-Text-ParseWords-3.30-458.amzn2023.0.2.noarc 67/107 2025-05-07T19:42:59.7219251Z Installing : perl-mro-1.23-477.amzn2023.0.6.x86_64 68/107 2025-05-07T19:42:59.7287856Z Installing : perl-IO-1.43-477.amzn2023.0.6.x86_64 69/107 2025-05-07T19:42:59.7346351Z Installing : perl-overloading-0.02-477.amzn2023.0.6.noarch 70/107 2025-05-07T19:42:59.7415320Z Installing : perl-Pod-Usage-4:2.01-2.amzn2023.0.2.noarch 71/107 2025-05-07T19:42:59.7464687Z Installing : perl-Errno-1.30-477.amzn2023.0.6.x86_64 72/107 2025-05-07T19:42:59.7520567Z Installing : perl-File-Basename-2.85-477.amzn2023.0.6.noarch 73/107 2025-05-07T19:42:59.7584281Z Installing : perl-Getopt-Std-1.12-477.amzn2023.0.6.noarch 74/107 2025-05-07T19:42:59.7663156Z Installing : perl-MIME-Base64-3.16-2.amzn2023.0.2.x86_64 75/107 2025-05-07T19:42:59.7738822Z Installing : perl-Scalar-List-Utils-4:1.56-459.amzn2023.0.2.x 76/107 2025-05-07T19:42:59.7801548Z Installing : perl-constant-1.33-459.amzn2023.0.2.noarch 77/107 2025-05-07T19:42:59.7864047Z Installing : perl-Storable-1:3.21-458.amzn2023.0.2.x86_64 78/107 2025-05-07T19:42:59.7912950Z Installing : perl-overload-1.31-477.amzn2023.0.6.noarch 79/107 2025-05-07T19:42:59.7961174Z Installing : perl-parent-1:0.238-458.amzn2023.0.2.noarch 80/107 2025-05-07T19:42:59.8025588Z Installing : perl-vars-1.05-477.amzn2023.0.6.noarch 81/107 2025-05-07T19:42:59.8081525Z Installing : perl-Getopt-Long-1:2.52-2.amzn2023.0.2.noarch 82/107 2025-05-07T19:42:59.8138488Z Installing : perl-DynaLoader-1.47-477.amzn2023.0.6.x86_64 83/107 2025-05-07T19:42:59.8190220Z Installing : perl-Carp-1.50-458.amzn2023.0.2.noarch 84/107 2025-05-07T19:42:59.8249320Z Installing : perl-Exporter-5.74-459.amzn2023.0.2.noarch 85/107 2025-05-07T19:42:59.8328925Z Installing : perl-NDBM_File-1.15-477.amzn2023.0.6.x86_64 86/107 2025-05-07T19:42:59.8861812Z Installing : perl-PathTools-3.78-459.amzn2023.0.2.x86_64 87/107 2025-05-07T19:42:59.9821637Z Installing : perl-Encode-4:3.15-462.amzn2023.0.2.x86_64 88/107 2025-05-07T19:42:59.9953479Z Installing : perl-libs-4:5.32.1-477.amzn2023.0.6.x86_64 89/107 2025-05-07T19:43:00.0034249Z Installing : perl-interpreter-4:5.32.1-477.amzn2023.0.6.x86_6 90/107 2025-05-07T19:43:00.0107076Z Installing : perl-Error-1:0.17029-5.amzn2023.0.2.noarch 91/107 2025-05-07T19:43:00.0174158Z Installing : perl-File-Find-1.37-477.amzn2023.0.6.noarch 92/107 2025-05-07T19:43:00.0242859Z Installing : perl-TermReadKey-2.38-9.amzn2023.0.2.x86_64 93/107 2025-05-07T19:43:00.0297860Z Installing : perl-lib-0.65-477.amzn2023.0.6.x86_64 94/107 2025-05-07T19:43:00.0363606Z Installing : perl-Git-2.47.1-1.amzn2023.0.2.noarch 95/107 2025-05-07T19:43:00.0432338Z Installing : git-2.47.1-1.amzn2023.0.2.x86_64 96/107 2025-05-07T19:43:00.0633606Z Installing : elfutils-debuginfod-client-0.188-3.amzn2023.0.2. 97/107 2025-05-07T19:43:00.0770699Z Installing : cyrus-sasl-lib-2.1.27-18.amzn2023.0.3.x86_64 98/107 2025-05-07T19:43:00.0853838Z Installing : openldap-2.4.57-6.amzn2023.0.7.x86_64 99/107 2025-05-07T19:43:00.1251504Z Installing : sudo-python-plugin-1.9.15-1.p5.amzn2023.0.1.x86_ 100/107 2025-05-07T19:43:00.2476964Z Installing : sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 101/107 2025-05-07T19:43:00.2572282Z Installing : binutils-2.41-50.amzn2023.0.3.x86_64 102/107 2025-05-07T19:43:00.2690325Z Running scriptlet: binutils-2.41-50.amzn2023.0.3.x86_64 102/107 2025-05-07T19:43:00.2990644Z Installing : pciutils-3.7.0-3.amzn2023.0.2.x86_64 103/107 2025-05-07T19:43:00.3087206Z Installing : wget-1.21.3-1.amzn2023.0.4.x86_64 104/107 2025-05-07T19:43:00.3336154Z Installing : which-2.21-26.amzn2023.0.2.x86_64 105/107 2025-05-07T19:43:00.3545938Z Installing : tar-2:1.34-1.amzn2023.0.4.x86_64 106/107 2025-05-07T19:43:00.3632840Z Installing : findutils-1:4.8.0-2.amzn2023.0.2.x86_64 107/107 2025-05-07T19:43:00.3746235Z Running scriptlet: pam-1.5.1-8.amzn2023.0.4.x86_64 107/107 2025-05-07T19:43:01.1389218Z Running scriptlet: findutils-1:4.8.0-2.amzn2023.0.2.x86_64 107/107 2025-05-07T19:43:01.1390248Z Verifying : binutils-2.41-50.amzn2023.0.3.x86_64 1/107 2025-05-07T19:43:01.1390879Z Verifying : cracklib-2.9.6-27.amzn2023.0.2.x86_64 2/107 2025-05-07T19:43:01.1391554Z Verifying : cyrus-sasl-lib-2.1.27-18.amzn2023.0.3.x86_64 3/107 2025-05-07T19:43:01.1392316Z Verifying : elfutils-debuginfod-client-0.188-3.amzn2023.0.2. 4/107 2025-05-07T19:43:01.1392953Z Verifying : findutils-1:4.8.0-2.amzn2023.0.2.x86_64 5/107 2025-05-07T19:43:01.1393785Z Verifying : git-2.47.1-1.amzn2023.0.2.x86_64 6/107 2025-05-07T19:43:01.1394432Z Verifying : git-core-2.47.1-1.amzn2023.0.2.x86_64 7/107 2025-05-07T19:43:01.1395132Z Verifying : git-core-doc-2.47.1-1.amzn2023.0.2.noarch 8/107 2025-05-07T19:43:01.1396119Z Verifying : gnutls-3.8.3-6.amzn2023.0.1.x86_64 9/107 2025-05-07T19:43:01.1396906Z Verifying : groff-base-1.22.4-7.amzn2023.0.2.x86_64 10/107 2025-05-07T19:43:01.1397583Z Verifying : gzip-1.12-1.amzn2023.0.1.x86_64 11/107 2025-05-07T19:43:01.1398156Z Verifying : hwdata-0.384-1.amzn2023.0.3.noarch 12/107 2025-05-07T19:43:01.1398867Z Verifying : jansson-2.14-0.amzn2023.x86_64 13/107 2025-05-07T19:43:01.1399528Z Verifying : kmod-libs-29-2.amzn2023.0.5.x86_64 14/107 2025-05-07T19:43:01.1400095Z Verifying : less-608-2.amzn2023.0.2.x86_64 15/107 2025-05-07T19:43:01.1400809Z Verifying : libcbor-0.7.0-3.amzn2023.0.2.x86_64 16/107 2025-05-07T19:43:01.1401403Z Verifying : libdb-5.3.28-49.amzn2023.0.2.x86_64 17/107 2025-05-07T19:43:01.1402035Z Verifying : libeconf-0.4.0-1.amzn2023.0.3.x86_64 18/107 2025-05-07T19:43:01.1402740Z Verifying : libedit-3.1-38.20210714cvs.amzn2023.0.2.x86_64 19/107 2025-05-07T19:43:01.1403348Z Verifying : libfdisk-2.37.4-1.amzn2023.0.4.x86_64 20/107 2025-05-07T19:43:01.1403977Z Verifying : libfido2-1.10.0-2.amzn2023.0.2.x86_64 21/107 2025-05-07T19:43:01.1404728Z Verifying : libmetalink-0.1.3-14.amzn2023.0.2.x86_64 22/107 2025-05-07T19:43:01.1405418Z Verifying : libpwquality-1.4.4-6.amzn2023.0.2.x86_64 23/107 2025-05-07T19:43:01.1406081Z Verifying : libsemanage-3.4-5.amzn2023.0.2.x86_64 24/107 2025-05-07T19:43:01.1406754Z Verifying : libutempter-1.2.1-4.amzn2023.0.2.x86_64 25/107 2025-05-07T19:43:01.1407425Z Verifying : nano-8.3-1.amzn2023.x86_64 26/107 2025-05-07T19:43:01.1408084Z Verifying : nano-default-editor-8.3-1.amzn2023.noarch 27/107 2025-05-07T19:43:01.1408849Z Verifying : ncurses-6.2-4.20200222.amzn2023.0.6.x86_64 28/107 2025-05-07T19:43:01.1409499Z Verifying : nettle-3.10.1-1.amzn2023.0.1.x86_64 29/107 2025-05-07T19:43:01.1410070Z Verifying : openldap-2.4.57-6.amzn2023.0.7.x86_64 30/107 2025-05-07T19:43:01.1410801Z Verifying : openssh-8.7p1-8.amzn2023.0.14.x86_64 31/107 2025-05-07T19:43:01.1411412Z Verifying : openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 32/107 2025-05-07T19:43:01.1412051Z Verifying : pam-1.5.1-8.amzn2023.0.4.x86_64 33/107 2025-05-07T19:43:01.1412799Z Verifying : pciutils-3.7.0-3.amzn2023.0.2.x86_64 34/107 2025-05-07T19:43:01.1413466Z Verifying : pciutils-libs-3.7.0-3.amzn2023.0.2.x86_64 35/107 2025-05-07T19:43:01.1414133Z Verifying : perl-AutoLoader-5.74-477.amzn2023.0.6.noarch 36/107 2025-05-07T19:43:01.1414975Z Verifying : perl-B-1.80-477.amzn2023.0.6.x86_64 37/107 2025-05-07T19:43:01.1415641Z Verifying : perl-Carp-1.50-458.amzn2023.0.2.noarch 38/107 2025-05-07T19:43:01.1416273Z Verifying : perl-Class-Struct-0.66-477.amzn2023.0.6.noarch 39/107 2025-05-07T19:43:01.1416875Z Verifying : perl-Data-Dumper-2.174-460.amzn2023.0.2.x86_64 40/107 2025-05-07T19:43:01.1417431Z Verifying : perl-Digest-1.20-1.amzn2023.0.2.noarch 41/107 2025-05-07T19:43:01.1418002Z Verifying : perl-Digest-MD5-2.58-2.amzn2023.0.2.x86_64 42/107 2025-05-07T19:43:01.1418603Z Verifying : perl-DynaLoader-1.47-477.amzn2023.0.6.x86_64 43/107 2025-05-07T19:43:01.1419175Z Verifying : perl-Encode-4:3.15-462.amzn2023.0.2.x86_64 44/107 2025-05-07T19:43:01.1419731Z Verifying : perl-Errno-1.30-477.amzn2023.0.6.x86_64 45/107 2025-05-07T19:43:01.1420354Z Verifying : perl-Error-1:0.17029-5.amzn2023.0.2.noarch 46/107 2025-05-07T19:43:01.1420933Z Verifying : perl-Exporter-5.74-459.amzn2023.0.2.noarch 47/107 2025-05-07T19:43:01.1421483Z Verifying : perl-Fcntl-1.13-477.amzn2023.0.6.x86_64 48/107 2025-05-07T19:43:01.1422068Z Verifying : perl-File-Basename-2.85-477.amzn2023.0.6.noarch 49/107 2025-05-07T19:43:01.1422652Z Verifying : perl-File-Find-1.37-477.amzn2023.0.6.noarch 50/107 2025-05-07T19:43:01.1423208Z Verifying : perl-File-Path-2.18-2.amzn2023.0.2.noarch 51/107 2025-05-07T19:43:01.1423784Z Verifying : perl-File-Temp-1:0.231.100-2.amzn2023.0.2.noarch 52/107 2025-05-07T19:43:01.1424333Z Verifying : perl-File-stat-1.09-477.amzn2023.0.6.noarch 53/107 2025-05-07T19:43:01.1424921Z Verifying : perl-FileHandle-2.03-477.amzn2023.0.6.noarch 54/107 2025-05-07T19:43:01.1425487Z Verifying : perl-Getopt-Long-1:2.52-2.amzn2023.0.2.noarch 55/107 2025-05-07T19:43:01.1426076Z Verifying : perl-Getopt-Std-1.12-477.amzn2023.0.6.noarch 56/107 2025-05-07T19:43:01.1426646Z Verifying : perl-Git-2.47.1-1.amzn2023.0.2.noarch 57/107 2025-05-07T19:43:01.1427195Z Verifying : perl-HTTP-Tiny-0.078-1.amzn2023.0.3.noarch 58/107 2025-05-07T19:43:01.1427758Z Verifying : perl-IO-1.43-477.amzn2023.0.6.x86_64 59/107 2025-05-07T19:43:01.1428297Z Verifying : perl-IO-Socket-IP-0.41-3.amzn2023.0.2.noarch 60/107 2025-05-07T19:43:01.1428882Z Verifying : perl-IO-Socket-SSL-2.075-1.amzn2023.0.2.noarch 61/107 2025-05-07T19:43:01.1429569Z Verifying : perl-IPC-Open3-1.21-477.amzn2023.0.6.noarch 62/107 2025-05-07T19:43:01.1430158Z Verifying : perl-MIME-Base64-3.16-2.amzn2023.0.2.x86_64 63/107 2025-05-07T19:43:01.1430726Z Verifying : perl-Mozilla-CA-20200520-4.amzn2023.0.2.noarch 64/107 2025-05-07T19:43:01.1431275Z Verifying : perl-NDBM_File-1.15-477.amzn2023.0.6.x86_64 65/107 2025-05-07T19:43:01.1431832Z Verifying : perl-Net-SSLeay-1.94-1.amzn2023.0.1.x86_64 66/107 2025-05-07T19:43:01.1432368Z Verifying : perl-POSIX-1.94-477.amzn2023.0.6.x86_64 67/107 2025-05-07T19:43:01.1432928Z Verifying : perl-PathTools-3.78-459.amzn2023.0.2.x86_64 68/107 2025-05-07T19:43:01.1433497Z Verifying : perl-Pod-Escapes-1:1.07-458.amzn2023.0.2.noarch 69/107 2025-05-07T19:43:01.1434047Z Verifying : perl-Pod-Perldoc-3.28.01-459.amzn2023.0.3.noarch 70/107 2025-05-07T19:43:01.1434615Z Verifying : perl-Pod-Simple-1:3.42-2.amzn2023.0.2.noarch 71/107 2025-05-07T19:43:01.1435152Z Verifying : perl-Pod-Usage-4:2.01-2.amzn2023.0.2.noarch 72/107 2025-05-07T19:43:01.1435712Z Verifying : perl-Scalar-List-Utils-4:1.56-459.amzn2023.0.2.x 73/107 2025-05-07T19:43:01.1436470Z Verifying : perl-SelectSaver-1.02-477.amzn2023.0.6.noarch 74/107 2025-05-07T19:43:01.1437028Z Verifying : perl-Socket-4:2.032-1.amzn2023.0.2.x86_64 75/107 2025-05-07T19:43:01.1437565Z Verifying : perl-Storable-1:3.21-458.amzn2023.0.2.x86_64 76/107 2025-05-07T19:43:01.1438090Z Verifying : perl-Symbol-1.08-477.amzn2023.0.6.noarch 77/107 2025-05-07T19:43:01.1438656Z Verifying : perl-Term-ANSIColor-5.01-459.amzn2023.0.2.noarch 78/107 2025-05-07T19:43:01.1439209Z Verifying : perl-Term-Cap-1.17-458.amzn2023.0.2.noarch 79/107 2025-05-07T19:43:01.1439773Z Verifying : perl-TermReadKey-2.38-9.amzn2023.0.2.x86_64 80/107 2025-05-07T19:43:01.1440346Z Verifying : perl-Text-ParseWords-3.30-458.amzn2023.0.2.noarc 81/107 2025-05-07T19:43:01.1440903Z Verifying : perl-Text-Tabs+Wrap-2021.0726-1.amzn2023.0.1.noa 82/107 2025-05-07T19:43:01.1441455Z Verifying : perl-Time-Local-2:1.300-5.amzn2023.0.2.noarch 83/107 2025-05-07T19:43:01.1442051Z Verifying : perl-URI-5.09-1.amzn2023.0.2.noarch 84/107 2025-05-07T19:43:01.1442614Z Verifying : perl-base-2.27-477.amzn2023.0.6.noarch 85/107 2025-05-07T19:43:01.1443176Z Verifying : perl-constant-1.33-459.amzn2023.0.2.noarch 86/107 2025-05-07T19:43:01.1443704Z Verifying : perl-if-0.60.800-477.amzn2023.0.6.noarch 87/107 2025-05-07T19:43:01.1444252Z Verifying : perl-interpreter-4:5.32.1-477.amzn2023.0.6.x86_6 88/107 2025-05-07T19:43:01.1444779Z Verifying : perl-lib-0.65-477.amzn2023.0.6.x86_64 89/107 2025-05-07T19:43:01.1445319Z Verifying : perl-libnet-3.13-2.amzn2023.0.2.noarch 90/107 2025-05-07T19:43:01.1445836Z Verifying : perl-libs-4:5.32.1-477.amzn2023.0.6.x86_64 91/107 2025-05-07T19:43:01.1446367Z Verifying : perl-mro-1.23-477.amzn2023.0.6.x86_64 92/107 2025-05-07T19:43:01.1446921Z Verifying : perl-overload-1.31-477.amzn2023.0.6.noarch 93/107 2025-05-07T19:43:01.1447469Z Verifying : perl-overloading-0.02-477.amzn2023.0.6.noarch 94/107 2025-05-07T19:43:01.1448024Z Verifying : perl-parent-1:0.238-458.amzn2023.0.2.noarch 95/107 2025-05-07T19:43:01.1448550Z Verifying : perl-podlators-1:4.14-458.amzn2023.0.2.noarch 96/107 2025-05-07T19:43:01.1449093Z Verifying : perl-subs-1.03-477.amzn2023.0.6.noarch 97/107 2025-05-07T19:43:01.1449612Z Verifying : perl-vars-1.05-477.amzn2023.0.6.noarch 98/107 2025-05-07T19:43:01.1450152Z Verifying : shadow-utils-2:4.9-12.amzn2023.0.4.x86_64 99/107 2025-05-07T19:43:01.1450675Z Verifying : sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 100/107 2025-05-07T19:43:01.1451202Z Verifying : sudo-python-plugin-1.9.15-1.p5.amzn2023.0.1.x86_ 101/107 2025-05-07T19:43:01.1451771Z Verifying : systemd-libs-252.23-3.amzn2023.x86_64 102/107 2025-05-07T19:43:01.1452278Z Verifying : tar-2:1.34-1.amzn2023.0.4.x86_64 103/107 2025-05-07T19:43:01.1452797Z Verifying : util-linux-2.37.4-1.amzn2023.0.4.x86_64 104/107 2025-05-07T19:43:01.1453345Z Verifying : util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 105/107 2025-05-07T19:43:01.1453858Z Verifying : wget-1.21.3-1.amzn2023.0.4.x86_64 106/107 2025-05-07T19:43:01.2505293Z Verifying : which-2.21-26.amzn2023.0.2.x86_64 107/107 2025-05-07T19:43:01.2505829Z 2025-05-07T19:43:01.2505983Z Installed: 2025-05-07T19:43:01.2506346Z binutils-2.41-50.amzn2023.0.3.x86_64 2025-05-07T19:43:01.2506915Z cracklib-2.9.6-27.amzn2023.0.2.x86_64 2025-05-07T19:43:01.2507462Z cyrus-sasl-lib-2.1.27-18.amzn2023.0.3.x86_64 2025-05-07T19:43:01.2508411Z elfutils-debuginfod-client-0.188-3.amzn2023.0.2.x86_64 2025-05-07T19:43:01.2508988Z findutils-1:4.8.0-2.amzn2023.0.2.x86_64 2025-05-07T19:43:01.2509644Z git-2.47.1-1.amzn2023.0.2.x86_64 2025-05-07T19:43:01.2510170Z git-core-2.47.1-1.amzn2023.0.2.x86_64 2025-05-07T19:43:01.2510704Z git-core-doc-2.47.1-1.amzn2023.0.2.noarch 2025-05-07T19:43:01.2511254Z gnutls-3.8.3-6.amzn2023.0.1.x86_64 2025-05-07T19:43:01.2511781Z groff-base-1.22.4-7.amzn2023.0.2.x86_64 2025-05-07T19:43:01.2512314Z gzip-1.12-1.amzn2023.0.1.x86_64 2025-05-07T19:43:01.2512830Z hwdata-0.384-1.amzn2023.0.3.noarch 2025-05-07T19:43:01.2513498Z jansson-2.14-0.amzn2023.x86_64 2025-05-07T19:43:01.2514037Z kmod-libs-29-2.amzn2023.0.5.x86_64 2025-05-07T19:43:01.2514546Z less-608-2.amzn2023.0.2.x86_64 2025-05-07T19:43:01.2515072Z libcbor-0.7.0-3.amzn2023.0.2.x86_64 2025-05-07T19:43:01.2515812Z libdb-5.3.28-49.amzn2023.0.2.x86_64 2025-05-07T19:43:01.2516318Z libeconf-0.4.0-1.amzn2023.0.3.x86_64 2025-05-07T19:43:01.2516850Z libedit-3.1-38.20210714cvs.amzn2023.0.2.x86_64 2025-05-07T19:43:01.2517364Z libfdisk-2.37.4-1.amzn2023.0.4.x86_64 2025-05-07T19:43:01.2517940Z libfido2-1.10.0-2.amzn2023.0.2.x86_64 2025-05-07T19:43:01.2518464Z libmetalink-0.1.3-14.amzn2023.0.2.x86_64 2025-05-07T19:43:01.2518986Z libpwquality-1.4.4-6.amzn2023.0.2.x86_64 2025-05-07T19:43:01.2519518Z libsemanage-3.4-5.amzn2023.0.2.x86_64 2025-05-07T19:43:01.2520052Z libutempter-1.2.1-4.amzn2023.0.2.x86_64 2025-05-07T19:43:01.2520537Z nano-8.3-1.amzn2023.x86_64 2025-05-07T19:43:01.2521065Z nano-default-editor-8.3-1.amzn2023.noarch 2025-05-07T19:43:01.2521599Z ncurses-6.2-4.20200222.amzn2023.0.6.x86_64 2025-05-07T19:43:01.2522110Z nettle-3.10.1-1.amzn2023.0.1.x86_64 2025-05-07T19:43:01.2522601Z openldap-2.4.57-6.amzn2023.0.7.x86_64 2025-05-07T19:43:01.2523115Z openssh-8.7p1-8.amzn2023.0.14.x86_64 2025-05-07T19:43:01.2523664Z openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 2025-05-07T19:43:01.2524168Z pam-1.5.1-8.amzn2023.0.4.x86_64 2025-05-07T19:43:01.2524671Z pciutils-3.7.0-3.amzn2023.0.2.x86_64 2025-05-07T19:43:01.2525181Z pciutils-libs-3.7.0-3.amzn2023.0.2.x86_64 2025-05-07T19:43:01.2525746Z perl-AutoLoader-5.74-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.2526407Z perl-B-1.80-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.2526948Z perl-Carp-1.50-458.amzn2023.0.2.noarch 2025-05-07T19:43:01.2527508Z perl-Class-Struct-0.66-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.2528057Z perl-Data-Dumper-2.174-460.amzn2023.0.2.x86_64 2025-05-07T19:43:01.2529696Z perl-Digest-1.20-1.amzn2023.0.2.noarch 2025-05-07T19:43:01.2530225Z perl-Digest-MD5-2.58-2.amzn2023.0.2.x86_64 2025-05-07T19:43:01.2530782Z perl-DynaLoader-1.47-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.2531323Z perl-Encode-4:3.15-462.amzn2023.0.2.x86_64 2025-05-07T19:43:01.2531832Z perl-Errno-1.30-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.2532369Z perl-Error-1:0.17029-5.amzn2023.0.2.noarch 2025-05-07T19:43:01.2532900Z perl-Exporter-5.74-459.amzn2023.0.2.noarch 2025-05-07T19:43:01.2533447Z perl-Fcntl-1.13-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.2533982Z perl-File-Basename-2.85-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.2534616Z perl-File-Find-1.37-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.2535223Z perl-File-Path-2.18-2.amzn2023.0.2.noarch 2025-05-07T19:43:01.2535747Z perl-File-Temp-1:0.231.100-2.amzn2023.0.2.noarch 2025-05-07T19:43:01.2536287Z perl-File-stat-1.09-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.2536844Z perl-FileHandle-2.03-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.2537376Z perl-Getopt-Long-1:2.52-2.amzn2023.0.2.noarch 2025-05-07T19:43:01.2537924Z perl-Getopt-Std-1.12-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.2538441Z perl-Git-2.47.1-1.amzn2023.0.2.noarch 2025-05-07T19:43:01.2538971Z perl-HTTP-Tiny-0.078-1.amzn2023.0.3.noarch 2025-05-07T19:43:01.2539477Z perl-IO-1.43-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.2540012Z perl-IO-Socket-IP-0.41-3.amzn2023.0.2.noarch 2025-05-07T19:43:01.2540568Z perl-IO-Socket-SSL-2.075-1.amzn2023.0.2.noarch 2025-05-07T19:43:01.2541102Z perl-IPC-Open3-1.21-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.2541646Z perl-MIME-Base64-3.16-2.amzn2023.0.2.x86_64 2025-05-07T19:43:01.2542187Z perl-Mozilla-CA-20200520-4.amzn2023.0.2.noarch 2025-05-07T19:43:01.2542738Z perl-NDBM_File-1.15-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.2543275Z perl-Net-SSLeay-1.94-1.amzn2023.0.1.x86_64 2025-05-07T19:43:01.2543795Z perl-POSIX-1.94-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.2544344Z perl-PathTools-3.78-459.amzn2023.0.2.x86_64 2025-05-07T19:43:01.2544881Z perl-Pod-Escapes-1:1.07-458.amzn2023.0.2.noarch 2025-05-07T19:43:01.2545437Z perl-Pod-Perldoc-3.28.01-459.amzn2023.0.3.noarch 2025-05-07T19:43:01.2545973Z perl-Pod-Simple-1:3.42-2.amzn2023.0.2.noarch 2025-05-07T19:43:01.2546503Z perl-Pod-Usage-4:2.01-2.amzn2023.0.2.noarch 2025-05-07T19:43:01.2547045Z perl-Scalar-List-Utils-4:1.56-459.amzn2023.0.2.x86_64 2025-05-07T19:43:01.2547587Z perl-SelectSaver-1.02-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.2548130Z perl-Socket-4:2.032-1.amzn2023.0.2.x86_64 2025-05-07T19:43:01.2548640Z perl-Storable-1:3.21-458.amzn2023.0.2.x86_64 2025-05-07T19:43:01.2549294Z perl-Symbol-1.08-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.2550155Z perl-Term-ANSIColor-5.01-459.amzn2023.0.2.noarch 2025-05-07T19:43:01.2550816Z perl-Term-Cap-1.17-458.amzn2023.0.2.noarch 2025-05-07T19:43:01.2551419Z perl-TermReadKey-2.38-9.amzn2023.0.2.x86_64 2025-05-07T19:43:01.2552019Z perl-Text-ParseWords-3.30-458.amzn2023.0.2.noarch 2025-05-07T19:43:01.2552649Z perl-Text-Tabs+Wrap-2021.0726-1.amzn2023.0.1.noarch 2025-05-07T19:43:01.2553237Z perl-Time-Local-2:1.300-5.amzn2023.0.2.noarch 2025-05-07T19:43:01.2553812Z perl-URI-5.09-1.amzn2023.0.2.noarch 2025-05-07T19:43:01.2554376Z perl-base-2.27-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.2554939Z perl-constant-1.33-459.amzn2023.0.2.noarch 2025-05-07T19:43:01.2555515Z perl-if-0.60.800-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.2556313Z perl-interpreter-4:5.32.1-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.2557032Z perl-lib-0.65-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.2557568Z perl-libnet-3.13-2.amzn2023.0.2.noarch 2025-05-07T19:43:01.2558121Z perl-libs-4:5.32.1-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.2558659Z perl-mro-1.23-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.2559200Z perl-overload-1.31-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.2559791Z perl-overloading-0.02-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.2560357Z perl-parent-1:0.238-458.amzn2023.0.2.noarch 2025-05-07T19:43:01.2560924Z perl-podlators-1:4.14-458.amzn2023.0.2.noarch 2025-05-07T19:43:01.2561494Z perl-subs-1.03-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.2562037Z perl-vars-1.05-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.2562592Z shadow-utils-2:4.9-12.amzn2023.0.4.x86_64 2025-05-07T19:43:01.2563104Z sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 2025-05-07T19:43:01.2563655Z sudo-python-plugin-1.9.15-1.p5.amzn2023.0.1.x86_64 2025-05-07T19:43:01.2564211Z systemd-libs-252.23-3.amzn2023.x86_64 2025-05-07T19:43:01.2564734Z tar-2:1.34-1.amzn2023.0.4.x86_64 2025-05-07T19:43:01.2565248Z util-linux-2.37.4-1.amzn2023.0.4.x86_64 2025-05-07T19:43:01.2565890Z util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 2025-05-07T19:43:01.2566440Z wget-1.21.3-1.amzn2023.0.4.x86_64 2025-05-07T19:43:01.2566903Z which-2.21-26.amzn2023.0.2.x86_64 2025-05-07T19:43:01.2567209Z 2025-05-07T19:43:01.2567295Z Complete! 2025-05-07T19:43:01.3253243Z ##[group]Run actions/checkout@v4 2025-05-07T19:43:01.3253629Z with: 2025-05-07T19:43:01.3253889Z submodules: true 2025-05-07T19:43:01.3254140Z repository: pytorch/FBGEMM 2025-05-07T19:43:01.3254632Z token: *** 2025-05-07T19:43:01.3254853Z ssh-strict: true 2025-05-07T19:43:01.3255123Z ssh-user: git 2025-05-07T19:43:01.3255364Z persist-credentials: true 2025-05-07T19:43:01.3255666Z clean: true 2025-05-07T19:43:01.3255935Z sparse-checkout-cone-mode: true 2025-05-07T19:43:01.3256222Z fetch-depth: 1 2025-05-07T19:43:01.3256473Z fetch-tags: false 2025-05-07T19:43:01.3256713Z show-progress: true 2025-05-07T19:43:01.3256978Z lfs: false 2025-05-07T19:43:01.3257202Z set-safe-directory: true 2025-05-07T19:43:01.3257663Z env: 2025-05-07T19:43:01.3257893Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:01.3258236Z BUILD_ENV: build_binary 2025-05-07T19:43:01.3258492Z BUILD_TARGET: default 2025-05-07T19:43:01.3258762Z BUILD_VARIANT: cuda 2025-05-07T19:43:01.3259099Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:43:01.3259363Z ##[endgroup] 2025-05-07T19:43:01.3303687Z ##[command]/usr/bin/docker exec a4cdfef5f67791de1447d833ba4c241d003473f6a3bbff3a88701c507bc3b898 sh -c "cat /etc/*release | grep ^ID" 2025-05-07T19:43:01.6164857Z Syncing repository: pytorch/FBGEMM 2025-05-07T19:43:01.6166059Z ##[group]Getting Git version info 2025-05-07T19:43:01.6166394Z Working directory is '/__w/FBGEMM/FBGEMM' 2025-05-07T19:43:01.6166886Z [command]/usr/bin/git version 2025-05-07T19:43:01.6167138Z git version 2.47.1 2025-05-07T19:43:01.6171502Z ##[endgroup] 2025-05-07T19:43:01.6193775Z Temporarily overriding HOME='/__w/_temp/8ceb4f74-770f-425a-88e9-b0bf8c25be85' before making global git config changes 2025-05-07T19:43:01.6194637Z Adding repository directory to the temporary git global config as a safe directory 2025-05-07T19:43:01.6198403Z [command]/usr/bin/git config --global --add safe.directory /__w/FBGEMM/FBGEMM 2025-05-07T19:43:01.6229009Z [command]/usr/bin/git config --local --get remote.origin.url 2025-05-07T19:43:01.6249251Z https://github.com/pytorch/FBGEMM 2025-05-07T19:43:01.6267752Z ##[group]Removing previously created refs, to avoid conflicts 2025-05-07T19:43:01.6272658Z [command]/usr/bin/git rev-parse --symbolic-full-name --verify --quiet HEAD 2025-05-07T19:43:01.6288873Z HEAD 2025-05-07T19:43:01.6320468Z ##[endgroup] 2025-05-07T19:43:01.6320932Z [command]/usr/bin/git submodule status 2025-05-07T19:43:01.6671748Z e5d7c0bd5d9aec44d68830187138149e6a8c4e32 external/asmjit (e5d7c0b) 2025-05-07T19:43:01.6736984Z 4a61bdd4bd4ed730e078aebc7c0fcf046ff29406 external/composable_kernel (4a61bdd) 2025-05-07T19:43:01.6808414Z 6543fec09b2f04ac4a666882998b534afc9c1349 external/cpuinfo (6543fec) 2025-05-07T19:43:01.6872582Z 3ed8d2ec4ba35ef5d9d8353826209b6f868f63d3 external/cutlass (3ed8d2e) 2025-05-07T19:43:01.6942509Z f8d7d77c06936315286eb55f8de22cd23c188571 external/googletest (f8d7d77) 2025-05-07T19:43:01.7010499Z 420084499c7c1e1c2d801922f40df202eac5f3a0 external/hipify_torch (4200844) 2025-05-07T19:43:01.7072233Z 9cca280a4d0ccf0c08f47a99aa71d1b0e52f8d03 external/json (9cca280) 2025-05-07T19:43:01.7079216Z ##[group]Cleaning the repository 2025-05-07T19:43:01.7080232Z [command]/usr/bin/git clean -ffdx 2025-05-07T19:43:01.8590092Z Removing build_only/ 2025-05-07T19:43:01.8590988Z Removing collect_env.py 2025-05-07T19:43:01.8591323Z Removing fbgemm_gpu/_skbuild/ 2025-05-07T19:43:01.8591679Z Removing fbgemm_gpu/codegen/genscript/__pycache__/ 2025-05-07T19:43:01.8592094Z Removing fbgemm_gpu/dist/ 2025-05-07T19:43:01.8592420Z Removing fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:43:01.8592877Z Removing fbgemm_gpu/fbgemm_gpu_genai_nightly.egg-info/ 2025-05-07T19:43:01.8593433Z Removing fbgemm_gpu/fbgemm_gpu_nightly.egg-info/ 2025-05-07T19:43:01.8596441Z [command]/usr/bin/git reset --hard HEAD 2025-05-07T19:43:01.9689661Z HEAD is now at 1c9ad64 Merge f6528e7b1e8f5602e7dba30cd73b48ae6630981c into fd4df5f456e0cca514bacd98a39efb72990fd9f4 2025-05-07T19:43:01.9691269Z ##[endgroup] 2025-05-07T19:43:01.9692909Z ##[group]Disabling automatic garbage collection 2025-05-07T19:43:01.9699006Z [command]/usr/bin/git config --local gc.auto 0 2025-05-07T19:43:01.9728995Z ##[endgroup] 2025-05-07T19:43:01.9730098Z ##[group]Setting up auth 2025-05-07T19:43:01.9731422Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2025-05-07T19:43:01.9755819Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2025-05-07T19:43:02.0078803Z Entering 'external/asmjit' 2025-05-07T19:43:02.0141922Z Entering 'external/composable_kernel' 2025-05-07T19:43:02.0209898Z Entering 'external/cpuinfo' 2025-05-07T19:43:02.0269433Z Entering 'external/cutlass' 2025-05-07T19:43:02.0339009Z Entering 'external/googletest' 2025-05-07T19:43:02.0386654Z Entering 'external/hipify_torch' 2025-05-07T19:43:02.0433862Z Entering 'external/json' 2025-05-07T19:43:02.0494784Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2025-05-07T19:43:02.0538426Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2025-05-07T19:43:02.0801537Z Entering 'external/asmjit' 2025-05-07T19:43:02.0870255Z Entering 'external/composable_kernel' 2025-05-07T19:43:02.0922853Z Entering 'external/cpuinfo' 2025-05-07T19:43:02.0971500Z Entering 'external/cutlass' 2025-05-07T19:43:02.1039232Z Entering 'external/googletest' 2025-05-07T19:43:02.1087112Z Entering 'external/hipify_torch' 2025-05-07T19:43:02.1134768Z Entering 'external/json' 2025-05-07T19:43:02.1204576Z [command]/usr/bin/git config --local http.https://github.com/.extraheader AUTHORIZATION: basic *** 2025-05-07T19:43:02.1248195Z ##[endgroup] 2025-05-07T19:43:02.1248668Z ##[group]Fetching the repository 2025-05-07T19:43:02.1253065Z [command]/usr/bin/git -c protocol.version=2 fetch --no-tags --prune --no-recurse-submodules --depth=1 origin +a2f4c52051596e74bc8c16e3d2867a4ecdd271e0:refs/remotes/pull/4066/merge 2025-05-07T19:43:02.3690043Z From https://github.com/pytorch/FBGEMM 2025-05-07T19:43:02.3690733Z + 1c9ad64...a2f4c52 a2f4c52051596e74bc8c16e3d2867a4ecdd271e0 -> pull/4066/merge (forced update) 2025-05-07T19:43:02.3709560Z ##[endgroup] 2025-05-07T19:43:02.3710057Z ##[group]Determining the checkout info 2025-05-07T19:43:02.3710534Z ##[endgroup] 2025-05-07T19:43:02.3712414Z [command]/usr/bin/git sparse-checkout disable 2025-05-07T19:43:02.4214178Z [command]/usr/bin/git config --local --unset-all extensions.worktreeConfig 2025-05-07T19:43:02.4242018Z ##[group]Checking out the ref 2025-05-07T19:43:02.4242544Z [command]/usr/bin/git checkout --progress --force refs/remotes/pull/4066/merge 2025-05-07T19:43:02.4320310Z Warning: you are leaving 1 commit behind, not connected to 2025-05-07T19:43:02.4320855Z any of your branches: 2025-05-07T19:43:02.4321236Z 2025-05-07T19:43:02.4321801Z 1c9ad64 Merge f6528e7b1e8f5602e7dba30cd73b48ae6630981c into fd4df5f456e0cca514bacd98a39efb72990fd9f4 2025-05-07T19:43:02.4322305Z 2025-05-07T19:43:02.4322568Z If you want to keep it by creating a new branch, this may be a good time 2025-05-07T19:43:02.4323245Z to do so with: 2025-05-07T19:43:02.4323383Z 2025-05-07T19:43:02.4323537Z git branch 1c9ad64 2025-05-07T19:43:02.4323749Z 2025-05-07T19:43:02.4324158Z HEAD is now at a2f4c52 Merge 6060cd4b5f971680caecdcc657faccb5720d1c3e into fd4df5f456e0cca514bacd98a39efb72990fd9f4 2025-05-07T19:43:02.4325617Z ##[endgroup] 2025-05-07T19:43:02.4326086Z ##[group]Setting up auth for fetching submodules 2025-05-07T19:43:02.4330834Z [command]/usr/bin/git config --global http.https://github.com/.extraheader AUTHORIZATION: basic *** 2025-05-07T19:43:02.4369323Z [command]/usr/bin/git config --global --unset-all url.https://github.com/.insteadOf 2025-05-07T19:43:02.4395086Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf git@github.com: 2025-05-07T19:43:02.4422713Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf org-21003710@github.com: 2025-05-07T19:43:02.4446920Z ##[endgroup] 2025-05-07T19:43:02.4447402Z ##[group]Fetching submodules 2025-05-07T19:43:02.4448172Z [command]/usr/bin/git submodule sync 2025-05-07T19:43:02.4751819Z Synchronizing submodule url for 'external/asmjit' 2025-05-07T19:43:02.4752315Z Synchronizing submodule url for 'external/composable_kernel' 2025-05-07T19:43:02.4752795Z Synchronizing submodule url for 'external/cpuinfo' 2025-05-07T19:43:02.4753198Z Synchronizing submodule url for 'external/cutlass' 2025-05-07T19:43:02.4753629Z Synchronizing submodule url for 'external/googletest' 2025-05-07T19:43:02.4754335Z Synchronizing submodule url for 'external/hipify_torch' 2025-05-07T19:43:02.4754755Z Synchronizing submodule url for 'external/json' 2025-05-07T19:43:02.4760843Z [command]/usr/bin/git -c protocol.version=2 submodule update --init --force --depth=1 2025-05-07T19:43:02.5473437Z Submodule path 'external/asmjit': checked out 'e5d7c0bd5d9aec44d68830187138149e6a8c4e32' 2025-05-07T19:43:02.8110795Z Submodule path 'external/composable_kernel': checked out '4a61bdd4bd4ed730e078aebc7c0fcf046ff29406' 2025-05-07T19:43:02.9033077Z Submodule path 'external/cpuinfo': checked out '6543fec09b2f04ac4a666882998b534afc9c1349' 2025-05-07T19:43:03.6042403Z Submodule path 'external/cutlass': checked out '3ed8d2ec4ba35ef5d9d8353826209b6f868f63d3' 2025-05-07T19:43:03.6428836Z Submodule path 'external/googletest': checked out 'f8d7d77c06936315286eb55f8de22cd23c188571' 2025-05-07T19:43:03.6514733Z Submodule path 'external/hipify_torch': checked out '420084499c7c1e1c2d801922f40df202eac5f3a0' 2025-05-07T19:43:03.7576029Z Submodule path 'external/json': checked out '9cca280a4d0ccf0c08f47a99aa71d1b0e52f8d03' 2025-05-07T19:43:03.7584560Z [command]/usr/bin/git submodule foreach git config --local gc.auto 0 2025-05-07T19:43:03.7857274Z Entering 'external/asmjit' 2025-05-07T19:43:03.7877871Z Entering 'external/composable_kernel' 2025-05-07T19:43:03.7912656Z Entering 'external/cpuinfo' 2025-05-07T19:43:03.7950122Z Entering 'external/cutlass' 2025-05-07T19:43:03.7979125Z Entering 'external/googletest' 2025-05-07T19:43:03.8008913Z Entering 'external/hipify_torch' 2025-05-07T19:43:03.8036848Z Entering 'external/json' 2025-05-07T19:43:03.8074609Z ##[endgroup] 2025-05-07T19:43:03.8075058Z ##[group]Persisting credentials for submodules 2025-05-07T19:43:03.8080356Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'url\.https\:\/\/github\.com\/\.insteadOf' && git config --local --unset-all 'url.https://github.com/.insteadOf' || :" 2025-05-07T19:43:03.8342578Z Entering 'external/asmjit' 2025-05-07T19:43:03.8389532Z url.https://github.com/.insteadof 2025-05-07T19:43:03.8390572Z url.https://github.com/.insteadof 2025-05-07T19:43:03.8425691Z Entering 'external/composable_kernel' 2025-05-07T19:43:03.8457700Z url.https://github.com/.insteadof 2025-05-07T19:43:03.8458105Z url.https://github.com/.insteadof 2025-05-07T19:43:03.8497534Z Entering 'external/cpuinfo' 2025-05-07T19:43:03.8540885Z url.https://github.com/.insteadof 2025-05-07T19:43:03.8541887Z url.https://github.com/.insteadof 2025-05-07T19:43:03.8579472Z Entering 'external/cutlass' 2025-05-07T19:43:03.8612081Z url.https://github.com/.insteadof 2025-05-07T19:43:03.8612502Z url.https://github.com/.insteadof 2025-05-07T19:43:03.8655157Z Entering 'external/googletest' 2025-05-07T19:43:03.8700573Z url.https://github.com/.insteadof 2025-05-07T19:43:03.8701597Z url.https://github.com/.insteadof 2025-05-07T19:43:03.8735756Z Entering 'external/hipify_torch' 2025-05-07T19:43:03.8769504Z url.https://github.com/.insteadof 2025-05-07T19:43:03.8769977Z url.https://github.com/.insteadof 2025-05-07T19:43:03.8807590Z Entering 'external/json' 2025-05-07T19:43:03.8850582Z url.https://github.com/.insteadof 2025-05-07T19:43:03.8851596Z url.https://github.com/.insteadof 2025-05-07T19:43:03.8899780Z [command]/usr/bin/git submodule foreach sh -c "git config --local 'http.https://github.com/.extraheader' 'AUTHORIZATION: basic ***' && git config --local --show-origin --name-only --get-regexp remote.origin.url" 2025-05-07T19:43:03.9214775Z Entering 'external/asmjit' 2025-05-07T19:43:03.9262280Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/asmjit/config remote.origin.url 2025-05-07T19:43:03.9262808Z Entering 'external/composable_kernel' 2025-05-07T19:43:03.9312148Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/composable_kernel/config remote.origin.url 2025-05-07T19:43:03.9312718Z Entering 'external/cpuinfo' 2025-05-07T19:43:03.9362756Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/cpuinfo/config remote.origin.url 2025-05-07T19:43:03.9363983Z Entering 'external/cutlass' 2025-05-07T19:43:03.9411519Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/cutlass/config remote.origin.url 2025-05-07T19:43:03.9412995Z Entering 'external/googletest' 2025-05-07T19:43:03.9458977Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/googletest/config remote.origin.url 2025-05-07T19:43:03.9459573Z Entering 'external/hipify_torch' 2025-05-07T19:43:03.9510747Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/hipify_torch/config remote.origin.url 2025-05-07T19:43:03.9512301Z Entering 'external/json' 2025-05-07T19:43:03.9563595Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/json/config remote.origin.url 2025-05-07T19:43:03.9645030Z [command]/usr/bin/git submodule foreach git config --local --add 'url.https://github.com/.insteadOf' 'git@github.com:' 2025-05-07T19:43:03.9914619Z Entering 'external/asmjit' 2025-05-07T19:43:03.9945295Z Entering 'external/composable_kernel' 2025-05-07T19:43:03.9970885Z Entering 'external/cpuinfo' 2025-05-07T19:43:04.0000890Z Entering 'external/cutlass' 2025-05-07T19:43:04.0028207Z Entering 'external/googletest' 2025-05-07T19:43:04.0061607Z Entering 'external/hipify_torch' 2025-05-07T19:43:04.0088065Z Entering 'external/json' 2025-05-07T19:43:04.0119854Z [command]/usr/bin/git submodule foreach git config --local --add 'url.https://github.com/.insteadOf' 'org-21003710@github.com:' 2025-05-07T19:43:04.0386543Z Entering 'external/asmjit' 2025-05-07T19:43:04.0409251Z Entering 'external/composable_kernel' 2025-05-07T19:43:04.0441957Z Entering 'external/cpuinfo' 2025-05-07T19:43:04.0470745Z Entering 'external/cutlass' 2025-05-07T19:43:04.0504187Z Entering 'external/googletest' 2025-05-07T19:43:04.0536674Z Entering 'external/hipify_torch' 2025-05-07T19:43:04.0566409Z Entering 'external/json' 2025-05-07T19:43:04.0595155Z ##[endgroup] 2025-05-07T19:43:04.0628642Z [command]/usr/bin/git log -1 --format=%H 2025-05-07T19:43:04.0649751Z a2f4c52051596e74bc8c16e3d2867a4ecdd271e0 2025-05-07T19:43:04.0821742Z ##[group]Run . $PRELUDE; print_system_info 2025-05-07T19:43:04.0822193Z . $PRELUDE; print_system_info 2025-05-07T19:43:04.0822710Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:04.0823059Z env: 2025-05-07T19:43:04.0823324Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:04.0823639Z BUILD_ENV: build_binary 2025-05-07T19:43:04.0823917Z BUILD_TARGET: default 2025-05-07T19:43:04.0824151Z BUILD_VARIANT: cuda 2025-05-07T19:43:04.0824426Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:43:04.0824677Z ##[endgroup] 2025-05-07T19:43:04.5286141Z ################################################################################ 2025-05-07T19:43:04.5286567Z # Print System Info 2025-05-07T19:43:04.5286837Z # 2025-05-07T19:43:04.5302553Z # [2025-05-07T19:43:04.529Z] + print_system_info 2025-05-07T19:43:04.5303043Z ################################################################################ 2025-05-07T19:43:04.5303298Z 2025-05-07T19:43:04.5303732Z ################################################################################ 2025-05-07T19:43:04.5304139Z [INFO] Printing environment variables ... 2025-05-07T19:43:04.5304486Z + printenv 2025-05-07T19:43:04.5304614Z 2025-05-07T19:43:04.5318038Z GITHUB_WORKSPACE=/__w/FBGEMM/FBGEMM 2025-05-07T19:43:04.5319114Z BUILD_VARIANT=cuda 2025-05-07T19:43:04.5319383Z HOSTNAME=a4cdfef5f677 2025-05-07T19:43:04.5319921Z GITHUB_PATH=/__w/_temp/_runner_file_commands/add_path_f9455a61-17d5-4064-a9c3-0c282affe551 2025-05-07T19:43:04.5320481Z GITHUB_ACTION=__run_2 2025-05-07T19:43:04.5320735Z GITHUB_RUN_NUMBER=10601 2025-05-07T19:43:04.5321021Z RUNNER_NAME=i-0694834bf9e26cc3e 2025-05-07T19:43:04.5321318Z GITHUB_REPOSITORY_OWNER_ID=21003710 2025-05-07T19:43:04.5321661Z PLATFORM_NAME_LC=linux-x86_64 2025-05-07T19:43:04.5321976Z MACHINE_NAME_LC=x86_64 2025-05-07T19:43:04.5322251Z GITHUB_TRIGGERING_ACTOR=q10 2025-05-07T19:43:04.5322578Z PRELUDE=.github/scripts/setup_env.bash 2025-05-07T19:43:04.5322898Z GITHUB_REF_TYPE=branch 2025-05-07T19:43:04.5323395Z *** 2025-05-07T19:43:04.5323619Z GITHUB_REPOSITORY_ID=150154628 2025-05-07T19:43:04.5324137Z GITHUB_ACTIONS=true 2025-05-07T19:43:04.5324428Z GITHUB_SHA=a2f4c52051596e74bc8c16e3d2867a4ecdd271e0 2025-05-07T19:43:04.5325053Z GITHUB_WORKFLOW_REF=pytorch/FBGEMM/.github/workflows/fbgemm_gpu_ci_cuda.yml@refs/pull/4066/merge 2025-05-07T19:43:04.5325651Z RUNNER_ENVIRONMENT=self-hosted 2025-05-07T19:43:04.5325950Z GITHUB_REF=refs/pull/4066/merge 2025-05-07T19:43:04.5326255Z RUNNER_OS=Linux 2025-05-07T19:43:04.5326494Z GITHUB_REF_PROTECTED=false 2025-05-07T19:43:04.5326779Z HOME=/github/home 2025-05-07T19:43:04.5327049Z GITHUB_API_URL=https://api.github.com 2025-05-07T19:43:04.5327386Z RUNNER_ARCH=X64 2025-05-07T19:43:04.5327619Z RUNNER_TEMP=/__w/_temp 2025-05-07T19:43:04.5327886Z BUILD_TARGET=default 2025-05-07T19:43:04.5328344Z GITHUB_STATE=/__w/_temp/_runner_file_commands/save_state_f9455a61-17d5-4064-a9c3-0c282affe551 2025-05-07T19:43:04.5329013Z GITHUB_ENV=/__w/_temp/_runner_file_commands/set_env_f9455a61-17d5-4064-a9c3-0c282affe551 2025-05-07T19:43:04.5329547Z GITHUB_EVENT_PATH=/github/workflow/event.json 2025-05-07T19:43:04.5329900Z GITHUB_EVENT_NAME=pull_request 2025-05-07T19:43:04.5330204Z GITHUB_RUN_ID=14891846252 2025-05-07T19:43:04.5330694Z GITHUB_STEP_SUMMARY=/__w/_temp/_runner_file_commands/step_summary_f9455a61-17d5-4064-a9c3-0c282affe551 2025-05-07T19:43:04.5331252Z BUILD_ENV=build_binary 2025-05-07T19:43:04.5331501Z GITHUB_ACTOR=q10 2025-05-07T19:43:04.5331756Z GITHUB_RUN_ATTEMPT=1 2025-05-07T19:43:04.5332126Z KERN_NAME_LC=linux 2025-05-07T19:43:04.5332364Z BUILD_CUDA_VERSION=12.6.3 2025-05-07T19:43:04.5332694Z GITHUB_GRAPHQL_URL=https://api.github.com/graphql 2025-05-07T19:43:04.5333052Z PLATFORM_NAME=Linux-x86_64 2025-05-07T19:43:04.5333357Z GITHUB_SERVER_URL=https://github.com 2025-05-07T19:43:04.5333647Z SHLVL=1 2025-05-07T19:43:04.5333872Z GITHUB_ACTOR_ID=255046 2025-05-07T19:43:04.5334124Z RUNNER_TOOL_CACHE=/__w/_tool 2025-05-07T19:43:04.5334643Z GITHUB_WORKFLOW_SHA=6060cd4b5f971680caecdcc657faccb5720d1c3e 2025-05-07T19:43:04.5335085Z GITHUB_REF_NAME=4066/merge 2025-05-07T19:43:04.5335533Z KERN_NAME=Linux 2025-05-07T19:43:04.5335771Z GITHUB_JOB=build_artifact 2025-05-07T19:43:04.5336083Z GITHUB_REPOSITORY=pytorch/FBGEMM 2025-05-07T19:43:04.5336422Z GITHUB_RETENTION_DAYS=90 2025-05-07T19:43:04.5336696Z RUNNER_WORKSPACE=/__w/FBGEMM 2025-05-07T19:43:04.5337065Z GITHUB_ACTION_REPOSITORY= 2025-05-07T19:43:04.5337430Z PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-05-07T19:43:04.5337857Z GITHUB_BASE_REF=main 2025-05-07T19:43:04.5338093Z CI=true 2025-05-07T19:43:04.5338366Z GITHUB_REPOSITORY_OWNER=pytorch 2025-05-07T19:43:04.5338797Z GITHUB_HEAD_REF=bm/genai-rocm-oss-6 2025-05-07T19:43:04.5339118Z GITHUB_ACTION_REF= 2025-05-07T19:43:04.5339452Z GITHUB_WORKFLOW=FBGEMM GPU/GenAI CUDA CI 2025-05-07T19:43:04.5339994Z GITHUB_OUTPUT=/__w/_temp/_runner_file_commands/set_output_f9455a61-17d5-4064-a9c3-0c282affe551 2025-05-07T19:43:04.5340521Z MACHINE_NAME=x86_64 2025-05-07T19:43:04.5340762Z _=/usr/bin/printenv 2025-05-07T19:43:04.5340917Z 2025-05-07T19:43:04.5341072Z ################################################################################ 2025-05-07T19:43:04.5341422Z [INFO] Print ldd version ... 2025-05-07T19:43:04.5341728Z + ldd --version 2025-05-07T19:43:04.5341868Z 2025-05-07T19:43:04.5341977Z ldd (GNU libc) 2.34 2025-05-07T19:43:04.5342286Z Copyright (C) 2021 Free Software Foundation, Inc. 2025-05-07T19:43:04.5342788Z This is free software; see the source for copying conditions. There is NO 2025-05-07T19:43:04.5343373Z warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. 2025-05-07T19:43:04.5343895Z Written by Roland McGrath and Ulrich Drepper. 2025-05-07T19:43:04.5344138Z 2025-05-07T19:43:04.5344266Z ################################################################################ 2025-05-07T19:43:04.5344631Z [INFO] Print CPU info ... 2025-05-07T19:43:04.5344892Z + nproc 2025-05-07T19:43:04.5345032Z 2025-05-07T19:43:04.5353708Z 96 2025-05-07T19:43:04.5354911Z 2025-05-07T19:43:04.5355262Z + lscpu 2025-05-07T19:43:04.5355611Z 2025-05-07T19:43:04.5616343Z Architecture: x86_64 2025-05-07T19:43:04.5616745Z CPU op-mode(s): 32-bit, 64-bit 2025-05-07T19:43:04.5617279Z Address sizes: 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.5617731Z Byte Order: Little Endian 2025-05-07T19:43:04.5618138Z CPU(s): 96 2025-05-07T19:43:04.5618522Z On-line CPU(s) list: 0-95 2025-05-07T19:43:04.5618891Z Vendor ID: GenuineIntel 2025-05-07T19:43:04.5619314Z Model name: Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.5619761Z CPU family: 6 2025-05-07T19:43:04.5620077Z Model: 85 2025-05-07T19:43:04.5620416Z Thread(s) per core: 2 2025-05-07T19:43:04.5620745Z Core(s) per socket: 24 2025-05-07T19:43:04.5621071Z Socket(s): 2 2025-05-07T19:43:04.5621399Z Stepping: 7 2025-05-07T19:43:04.5621747Z BogoMIPS: 6000.01 2025-05-07T19:43:04.5624279Z Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.5626801Z Hypervisor vendor: KVM 2025-05-07T19:43:04.5627358Z Virtualization type: full 2025-05-07T19:43:04.5627946Z L1d cache: 1.5 MiB (48 instances) 2025-05-07T19:43:04.5628675Z L1i cache: 1.5 MiB (48 instances) 2025-05-07T19:43:04.5629385Z L2 cache: 48 MiB (48 instances) 2025-05-07T19:43:04.5629882Z L3 cache: 71.5 MiB (2 instances) 2025-05-07T19:43:04.5630600Z NUMA node(s): 2 2025-05-07T19:43:04.5631067Z NUMA node0 CPU(s): 0-23,48-71 2025-05-07T19:43:04.5631543Z NUMA node1 CPU(s): 24-47,72-95 2025-05-07T19:43:04.5632200Z Vulnerability Gather data sampling: Unknown: Dependent on hypervisor status 2025-05-07T19:43:04.5632959Z Vulnerability Itlb multihit: KVM: Mitigation: VMX unsupported 2025-05-07T19:43:04.5633554Z Vulnerability L1tf: Mitigation; PTE Inversion 2025-05-07T19:43:04.5634307Z Vulnerability Mds: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:43:04.5635116Z Vulnerability Meltdown: Mitigation; PTI 2025-05-07T19:43:04.5635949Z Vulnerability Mmio stale data: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:43:04.5636738Z Vulnerability Reg file data sampling: Not affected 2025-05-07T19:43:04.5637167Z Vulnerability Retbleed: Vulnerable 2025-05-07T19:43:04.5637568Z Vulnerability Spec rstack overflow: Not affected 2025-05-07T19:43:04.5638285Z Vulnerability Spec store bypass: Vulnerable 2025-05-07T19:43:04.5638963Z Vulnerability Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization 2025-05-07T19:43:04.5640016Z Vulnerability Spectre v2: Mitigation; Retpolines; STIBP disabled; RSB filling; PBRSB-eIBRS Not affected; BHI Retpoline 2025-05-07T19:43:04.5640731Z Vulnerability Srbds: Not affected 2025-05-07T19:43:04.5641144Z Vulnerability Tsx async abort: Not affected 2025-05-07T19:43:04.5641435Z 2025-05-07T19:43:04.5641540Z + cat /proc/cpuinfo 2025-05-07T19:43:04.5641819Z 2025-05-07T19:43:04.5641919Z processor : 0 2025-05-07T19:43:04.5642190Z vendor_id : GenuineIntel 2025-05-07T19:43:04.5642491Z cpu family : 6 2025-05-07T19:43:04.5642751Z model : 85 2025-05-07T19:43:04.5643078Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.5643465Z stepping : 7 2025-05-07T19:43:04.5643722Z microcode : 0x5003901 2025-05-07T19:43:04.5643966Z cpu MHz : 1205.404 2025-05-07T19:43:04.5644224Z cache size : 36608 KB 2025-05-07T19:43:04.5644479Z physical id : 0 2025-05-07T19:43:04.5644729Z siblings : 48 2025-05-07T19:43:04.5644952Z core id : 0 2025-05-07T19:43:04.5645192Z cpu cores : 24 2025-05-07T19:43:04.5645419Z apicid : 0 2025-05-07T19:43:04.5645653Z initial apicid : 0 2025-05-07T19:43:04.5645907Z fpu : yes 2025-05-07T19:43:04.5646123Z fpu_exception : yes 2025-05-07T19:43:04.5646390Z cpuid level : 13 2025-05-07T19:43:04.5646615Z wp : yes 2025-05-07T19:43:04.5649058Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.5651862Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.5652478Z bogomips : 6000.01 2025-05-07T19:43:04.5652735Z clflush size : 64 2025-05-07T19:43:04.5652976Z cache_alignment : 64 2025-05-07T19:43:04.5653292Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.5653718Z power management: 2025-05-07T19:43:04.5653893Z 2025-05-07T19:43:04.5653995Z processor : 1 2025-05-07T19:43:04.5654258Z vendor_id : GenuineIntel 2025-05-07T19:43:04.5654525Z cpu family : 6 2025-05-07T19:43:04.5654778Z model : 85 2025-05-07T19:43:04.5655080Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.5655483Z stepping : 7 2025-05-07T19:43:04.5655718Z microcode : 0x5003901 2025-05-07T19:43:04.5655988Z cpu MHz : 1199.475 2025-05-07T19:43:04.5656219Z cache size : 36608 KB 2025-05-07T19:43:04.5656482Z physical id : 0 2025-05-07T19:43:04.5656710Z siblings : 48 2025-05-07T19:43:04.5656944Z core id : 1 2025-05-07T19:43:04.5657159Z cpu cores : 24 2025-05-07T19:43:04.5657405Z apicid : 2 2025-05-07T19:43:04.5657619Z initial apicid : 2 2025-05-07T19:43:04.5657874Z fpu : yes 2025-05-07T19:43:04.5658111Z fpu_exception : yes 2025-05-07T19:43:04.5658347Z cpuid level : 13 2025-05-07T19:43:04.5658592Z wp : yes 2025-05-07T19:43:04.5660986Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.5663789Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.5664434Z bogomips : 6000.01 2025-05-07T19:43:04.5664671Z clflush size : 64 2025-05-07T19:43:04.5664934Z cache_alignment : 64 2025-05-07T19:43:04.5665230Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.5665605Z power management: 2025-05-07T19:43:04.5665752Z 2025-05-07T19:43:04.5665846Z processor : 2 2025-05-07T19:43:04.5666102Z vendor_id : GenuineIntel 2025-05-07T19:43:04.5666443Z cpu family : 6 2025-05-07T19:43:04.5666659Z model : 85 2025-05-07T19:43:04.5666981Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.5667352Z stepping : 7 2025-05-07T19:43:04.5667605Z microcode : 0x5003901 2025-05-07T19:43:04.5667847Z cpu MHz : 1200.018 2025-05-07T19:43:04.5668103Z cache size : 36608 KB 2025-05-07T19:43:04.5668348Z physical id : 0 2025-05-07T19:43:04.5668596Z siblings : 48 2025-05-07T19:43:04.5668820Z core id : 2 2025-05-07T19:43:04.5669063Z cpu cores : 24 2025-05-07T19:43:04.5669384Z apicid : 4 2025-05-07T19:43:04.5669634Z initial apicid : 4 2025-05-07T19:43:04.5669892Z fpu : yes 2025-05-07T19:43:04.5670171Z fpu_exception : yes 2025-05-07T19:43:04.5670433Z cpuid level : 13 2025-05-07T19:43:04.5670663Z wp : yes 2025-05-07T19:43:04.5673092Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.5675886Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.5676503Z bogomips : 6000.01 2025-05-07T19:43:04.5676761Z clflush size : 64 2025-05-07T19:43:04.5677000Z cache_alignment : 64 2025-05-07T19:43:04.5677320Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.5677672Z power management: 2025-05-07T19:43:04.5677843Z 2025-05-07T19:43:04.5677941Z processor : 3 2025-05-07T19:43:04.5682569Z vendor_id : GenuineIntel 2025-05-07T19:43:04.5682922Z cpu family : 6 2025-05-07T19:43:04.5683205Z model : 85 2025-05-07T19:43:04.5683509Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.5683926Z stepping : 7 2025-05-07T19:43:04.5684160Z microcode : 0x5003901 2025-05-07T19:43:04.5684616Z cpu MHz : 3000.008 2025-05-07T19:43:04.5684864Z cache size : 36608 KB 2025-05-07T19:43:04.5685144Z physical id : 0 2025-05-07T19:43:04.5685377Z siblings : 48 2025-05-07T19:43:04.5685731Z core id : 3 2025-05-07T19:43:04.5685956Z cpu cores : 24 2025-05-07T19:43:04.5686212Z apicid : 6 2025-05-07T19:43:04.5686430Z initial apicid : 6 2025-05-07T19:43:04.5686693Z fpu : yes 2025-05-07T19:43:04.5686949Z fpu_exception : yes 2025-05-07T19:43:04.5687190Z cpuid level : 13 2025-05-07T19:43:04.5687435Z wp : yes 2025-05-07T19:43:04.5689818Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.5692627Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.5693268Z bogomips : 6000.01 2025-05-07T19:43:04.5693505Z clflush size : 64 2025-05-07T19:43:04.5693769Z cache_alignment : 64 2025-05-07T19:43:04.5694066Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.5694436Z power management: 2025-05-07T19:43:04.5694578Z 2025-05-07T19:43:04.5694668Z processor : 4 2025-05-07T19:43:04.5694918Z vendor_id : GenuineIntel 2025-05-07T19:43:04.5695207Z cpu family : 6 2025-05-07T19:43:04.5695424Z model : 85 2025-05-07T19:43:04.5695866Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.5696242Z stepping : 7 2025-05-07T19:43:04.5696492Z microcode : 0x5003901 2025-05-07T19:43:04.5696736Z cpu MHz : 3000.008 2025-05-07T19:43:04.5696989Z cache size : 36608 KB 2025-05-07T19:43:04.5697229Z physical id : 0 2025-05-07T19:43:04.5697488Z siblings : 48 2025-05-07T19:43:04.5697711Z core id : 4 2025-05-07T19:43:04.5697963Z cpu cores : 24 2025-05-07T19:43:04.5698181Z apicid : 8 2025-05-07T19:43:04.5698427Z initial apicid : 8 2025-05-07T19:43:04.5698677Z fpu : yes 2025-05-07T19:43:04.5698891Z fpu_exception : yes 2025-05-07T19:43:04.5699143Z cpuid level : 13 2025-05-07T19:43:04.5699365Z wp : yes 2025-05-07T19:43:04.5701777Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.5704564Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.5705184Z bogomips : 6000.01 2025-05-07T19:43:04.5705446Z clflush size : 64 2025-05-07T19:43:04.5705685Z cache_alignment : 64 2025-05-07T19:43:04.5706001Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.5706352Z power management: 2025-05-07T19:43:04.5706523Z 2025-05-07T19:43:04.5706618Z processor : 5 2025-05-07T19:43:04.5706875Z vendor_id : GenuineIntel 2025-05-07T19:43:04.5707129Z cpu family : 6 2025-05-07T19:43:04.5707452Z model : 85 2025-05-07T19:43:04.5707816Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.5708225Z stepping : 7 2025-05-07T19:43:04.5708457Z microcode : 0x5003901 2025-05-07T19:43:04.5708734Z cpu MHz : 3000.008 2025-05-07T19:43:04.5708966Z cache size : 36608 KB 2025-05-07T19:43:04.5709300Z physical id : 0 2025-05-07T19:43:04.5709532Z siblings : 48 2025-05-07T19:43:04.5709780Z core id : 5 2025-05-07T19:43:04.5710004Z cpu cores : 24 2025-05-07T19:43:04.5710333Z apicid : 10 2025-05-07T19:43:04.5710587Z initial apicid : 10 2025-05-07T19:43:04.5710821Z fpu : yes 2025-05-07T19:43:04.5711064Z fpu_exception : yes 2025-05-07T19:43:04.5711301Z cpuid level : 13 2025-05-07T19:43:04.5711545Z wp : yes 2025-05-07T19:43:04.5713927Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.5716717Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.5717353Z bogomips : 6000.01 2025-05-07T19:43:04.5717586Z clflush size : 64 2025-05-07T19:43:04.5717840Z cache_alignment : 64 2025-05-07T19:43:04.5718129Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.5718492Z power management: 2025-05-07T19:43:04.5718638Z 2025-05-07T19:43:04.5718751Z processor : 6 2025-05-07T19:43:04.5718984Z vendor_id : GenuineIntel 2025-05-07T19:43:04.5719260Z cpu family : 6 2025-05-07T19:43:04.5719484Z model : 85 2025-05-07T19:43:04.5719808Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.5720180Z stepping : 7 2025-05-07T19:43:04.5720490Z microcode : 0x5003901 2025-05-07T19:43:04.5720737Z cpu MHz : 1199.514 2025-05-07T19:43:04.5720999Z cache size : 36608 KB 2025-05-07T19:43:04.5721244Z physical id : 0 2025-05-07T19:43:04.5721500Z siblings : 48 2025-05-07T19:43:04.5721817Z core id : 6 2025-05-07T19:43:04.5722044Z cpu cores : 24 2025-05-07T19:43:04.5722248Z apicid : 12 2025-05-07T19:43:04.5722479Z initial apicid : 12 2025-05-07T19:43:04.5722721Z fpu : yes 2025-05-07T19:43:04.5722932Z fpu_exception : yes 2025-05-07T19:43:04.5723173Z cpuid level : 13 2025-05-07T19:43:04.5723383Z wp : yes 2025-05-07T19:43:04.5725633Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.5728223Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.5728807Z bogomips : 6000.01 2025-05-07T19:43:04.5729059Z clflush size : 64 2025-05-07T19:43:04.5729288Z cache_alignment : 64 2025-05-07T19:43:04.5729585Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.5729915Z power management: 2025-05-07T19:43:04.5730078Z 2025-05-07T19:43:04.5730168Z processor : 7 2025-05-07T19:43:04.5730417Z vendor_id : GenuineIntel 2025-05-07T19:43:04.5730667Z cpu family : 6 2025-05-07T19:43:04.5730900Z model : 85 2025-05-07T19:43:04.5731178Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.5731611Z stepping : 7 2025-05-07T19:43:04.5731830Z microcode : 0x5003901 2025-05-07T19:43:04.5732093Z cpu MHz : 1198.874 2025-05-07T19:43:04.5732314Z cache size : 36608 KB 2025-05-07T19:43:04.5732570Z physical id : 0 2025-05-07T19:43:04.5732786Z siblings : 48 2025-05-07T19:43:04.5733020Z core id : 7 2025-05-07T19:43:04.5733229Z cpu cores : 24 2025-05-07T19:43:04.5733463Z apicid : 14 2025-05-07T19:43:04.5733694Z initial apicid : 14 2025-05-07T19:43:04.5733913Z fpu : yes 2025-05-07T19:43:04.5734137Z fpu_exception : yes 2025-05-07T19:43:04.5734351Z cpuid level : 13 2025-05-07T19:43:04.5734581Z wp : yes 2025-05-07T19:43:04.5736784Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.5739363Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.5739955Z bogomips : 6000.01 2025-05-07T19:43:04.5740177Z clflush size : 64 2025-05-07T19:43:04.5740417Z cache_alignment : 64 2025-05-07T19:43:04.5740686Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.5741030Z power management: 2025-05-07T19:43:04.5741163Z 2025-05-07T19:43:04.5741268Z processor : 8 2025-05-07T19:43:04.5741482Z vendor_id : GenuineIntel 2025-05-07T19:43:04.5741742Z cpu family : 6 2025-05-07T19:43:04.5741949Z model : 85 2025-05-07T19:43:04.5742244Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.5742589Z stepping : 7 2025-05-07T19:43:04.5742823Z microcode : 0x5003901 2025-05-07T19:43:04.5743063Z cpu MHz : 1200.010 2025-05-07T19:43:04.5743354Z cache size : 36608 KB 2025-05-07T19:43:04.5743576Z physical id : 0 2025-05-07T19:43:04.5743808Z siblings : 48 2025-05-07T19:43:04.5744013Z core id : 8 2025-05-07T19:43:04.5744240Z cpu cores : 24 2025-05-07T19:43:04.5744468Z apicid : 16 2025-05-07T19:43:04.5744676Z initial apicid : 16 2025-05-07T19:43:04.5744912Z fpu : yes 2025-05-07T19:43:04.5745113Z fpu_exception : yes 2025-05-07T19:43:04.5745357Z cpuid level : 13 2025-05-07T19:43:04.5745567Z wp : yes 2025-05-07T19:43:04.5747968Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.5750971Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.5751695Z bogomips : 6000.01 2025-05-07T19:43:04.5751940Z clflush size : 64 2025-05-07T19:43:04.5752202Z cache_alignment : 64 2025-05-07T19:43:04.5752496Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.5752869Z power management: 2025-05-07T19:43:04.5753011Z 2025-05-07T19:43:04.5753121Z processor : 9 2025-05-07T19:43:04.5753439Z vendor_id : GenuineIntel 2025-05-07T19:43:04.5753696Z cpu family : 6 2025-05-07T19:43:04.5753935Z model : 85 2025-05-07T19:43:04.5754229Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.5754621Z stepping : 7 2025-05-07T19:43:04.5754864Z microcode : 0x5003901 2025-05-07T19:43:04.5755176Z cpu MHz : 1198.217 2025-05-07T19:43:04.5755438Z cache size : 36608 KB 2025-05-07T19:43:04.5755688Z physical id : 0 2025-05-07T19:43:04.5755936Z siblings : 48 2025-05-07T19:43:04.5756154Z core id : 9 2025-05-07T19:43:04.5756397Z cpu cores : 24 2025-05-07T19:43:04.5756614Z apicid : 18 2025-05-07T19:43:04.5756856Z initial apicid : 18 2025-05-07T19:43:04.5757082Z fpu : yes 2025-05-07T19:43:04.5757324Z fpu_exception : yes 2025-05-07T19:43:04.5757554Z cpuid level : 13 2025-05-07T19:43:04.5757801Z wp : yes 2025-05-07T19:43:04.5760217Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.5763070Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.5763692Z bogomips : 6000.01 2025-05-07T19:43:04.5763943Z clflush size : 64 2025-05-07T19:43:04.5764170Z cache_alignment : 64 2025-05-07T19:43:04.5764474Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.5764815Z power management: 2025-05-07T19:43:04.5765137Z 2025-05-07T19:43:04.5765230Z processor : 10 2025-05-07T19:43:04.5765466Z vendor_id : GenuineIntel 2025-05-07T19:43:04.5765753Z cpu family : 6 2025-05-07T19:43:04.5765973Z model : 85 2025-05-07T19:43:04.5766287Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.5766685Z stepping : 7 2025-05-07T19:43:04.5766909Z microcode : 0x5003901 2025-05-07T19:43:04.5767169Z cpu MHz : 3000.008 2025-05-07T19:43:04.5767403Z cache size : 36608 KB 2025-05-07T19:43:04.5767665Z physical id : 0 2025-05-07T19:43:04.5767949Z siblings : 48 2025-05-07T19:43:04.5768182Z core id : 10 2025-05-07T19:43:04.5768408Z cpu cores : 24 2025-05-07T19:43:04.5768678Z apicid : 20 2025-05-07T19:43:04.5768907Z initial apicid : 20 2025-05-07T19:43:04.5769181Z fpu : yes 2025-05-07T19:43:04.5769402Z fpu_exception : yes 2025-05-07T19:43:04.5769667Z cpuid level : 13 2025-05-07T19:43:04.5769901Z wp : yes 2025-05-07T19:43:04.5772313Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.5775129Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.5775780Z bogomips : 6000.01 2025-05-07T19:43:04.5776021Z clflush size : 64 2025-05-07T19:43:04.5776286Z cache_alignment : 64 2025-05-07T19:43:04.5776591Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.5776974Z power management: 2025-05-07T19:43:04.5777121Z 2025-05-07T19:43:04.5777219Z processor : 11 2025-05-07T19:43:04.5777484Z vendor_id : GenuineIntel 2025-05-07T19:43:04.5777743Z cpu family : 6 2025-05-07T19:43:04.5778026Z model : 85 2025-05-07T19:43:04.5778319Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.5778724Z stepping : 7 2025-05-07T19:43:04.5778975Z microcode : 0x5003901 2025-05-07T19:43:04.5779217Z cpu MHz : 3000.008 2025-05-07T19:43:04.5779475Z cache size : 36608 KB 2025-05-07T19:43:04.5779842Z physical id : 0 2025-05-07T19:43:04.5780092Z siblings : 48 2025-05-07T19:43:04.5780307Z core id : 11 2025-05-07T19:43:04.5780561Z cpu cores : 24 2025-05-07T19:43:04.5780774Z apicid : 22 2025-05-07T19:43:04.5781126Z initial apicid : 22 2025-05-07T19:43:04.5781350Z fpu : yes 2025-05-07T19:43:04.5781586Z fpu_exception : yes 2025-05-07T19:43:04.5781814Z cpuid level : 13 2025-05-07T19:43:04.5782057Z wp : yes 2025-05-07T19:43:04.5784872Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.5787779Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.5788401Z bogomips : 6000.01 2025-05-07T19:43:04.5788664Z clflush size : 64 2025-05-07T19:43:04.5788900Z cache_alignment : 64 2025-05-07T19:43:04.5789293Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.5789647Z power management: 2025-05-07T19:43:04.5789819Z 2025-05-07T19:43:04.5789913Z processor : 12 2025-05-07T19:43:04.5790147Z vendor_id : GenuineIntel 2025-05-07T19:43:04.5790427Z cpu family : 6 2025-05-07T19:43:04.5790650Z model : 85 2025-05-07T19:43:04.5790968Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.5791357Z stepping : 7 2025-05-07T19:43:04.5791581Z microcode : 0x5003901 2025-05-07T19:43:04.5791842Z cpu MHz : 3000.008 2025-05-07T19:43:04.5792074Z cache size : 36608 KB 2025-05-07T19:43:04.5792340Z physical id : 0 2025-05-07T19:43:04.5792565Z siblings : 48 2025-05-07T19:43:04.5792806Z core id : 12 2025-05-07T19:43:04.5793141Z cpu cores : 24 2025-05-07T19:43:04.5793381Z apicid : 24 2025-05-07T19:43:04.5793612Z initial apicid : 24 2025-05-07T19:43:04.5793870Z fpu : yes 2025-05-07T19:43:04.5794097Z fpu_exception : yes 2025-05-07T19:43:04.5794364Z cpuid level : 13 2025-05-07T19:43:04.5794584Z wp : yes 2025-05-07T19:43:04.5797150Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.5800215Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.5800857Z bogomips : 6000.01 2025-05-07T19:43:04.5801095Z clflush size : 64 2025-05-07T19:43:04.5801347Z cache_alignment : 64 2025-05-07T19:43:04.5801640Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.5802009Z power management: 2025-05-07T19:43:04.5802151Z 2025-05-07T19:43:04.5802244Z processor : 13 2025-05-07T19:43:04.5802494Z vendor_id : GenuineIntel 2025-05-07T19:43:04.5802748Z cpu family : 6 2025-05-07T19:43:04.5802988Z model : 85 2025-05-07T19:43:04.5803276Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.5803669Z stepping : 7 2025-05-07T19:43:04.5803910Z microcode : 0x5003901 2025-05-07T19:43:04.5804149Z cpu MHz : 3000.008 2025-05-07T19:43:04.5804400Z cache size : 36608 KB 2025-05-07T19:43:04.5804636Z physical id : 0 2025-05-07T19:43:04.5804886Z siblings : 48 2025-05-07T19:43:04.5805200Z core id : 13 2025-05-07T19:43:04.5805442Z cpu cores : 24 2025-05-07T19:43:04.5805656Z apicid : 26 2025-05-07T19:43:04.5805907Z initial apicid : 26 2025-05-07T19:43:04.5806135Z fpu : yes 2025-05-07T19:43:04.5806378Z fpu_exception : yes 2025-05-07T19:43:04.5806609Z cpuid level : 13 2025-05-07T19:43:04.5806850Z wp : yes 2025-05-07T19:43:04.5809260Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.5812154Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.5812765Z bogomips : 6000.01 2025-05-07T19:43:04.5813016Z clflush size : 64 2025-05-07T19:43:04.5813251Z cache_alignment : 64 2025-05-07T19:43:04.5813620Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.5814125Z power management: 2025-05-07T19:43:04.5814299Z 2025-05-07T19:43:04.5814401Z processor : 14 2025-05-07T19:43:04.5814794Z vendor_id : GenuineIntel 2025-05-07T19:43:04.5815072Z cpu family : 6 2025-05-07T19:43:04.5815290Z model : 85 2025-05-07T19:43:04.5815732Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.5816133Z stepping : 7 2025-05-07T19:43:04.5816358Z microcode : 0x5003901 2025-05-07T19:43:04.5816619Z cpu MHz : 3000.008 2025-05-07T19:43:04.5816860Z cache size : 36608 KB 2025-05-07T19:43:04.5817137Z physical id : 0 2025-05-07T19:43:04.5817367Z siblings : 48 2025-05-07T19:43:04.5817617Z core id : 14 2025-05-07T19:43:04.5817836Z cpu cores : 24 2025-05-07T19:43:04.5818095Z apicid : 28 2025-05-07T19:43:04.5818321Z initial apicid : 28 2025-05-07T19:43:04.5818652Z fpu : yes 2025-05-07T19:43:04.5818877Z fpu_exception : yes 2025-05-07T19:43:04.5819141Z cpuid level : 13 2025-05-07T19:43:04.5819373Z wp : yes 2025-05-07T19:43:04.5821798Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.5824597Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.5825251Z bogomips : 6000.01 2025-05-07T19:43:04.5825500Z clflush size : 64 2025-05-07T19:43:04.5825765Z cache_alignment : 64 2025-05-07T19:43:04.5826065Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.5826447Z power management: 2025-05-07T19:43:04.5826593Z 2025-05-07T19:43:04.5826694Z processor : 15 2025-05-07T19:43:04.5826952Z vendor_id : GenuineIntel 2025-05-07T19:43:04.5827211Z cpu family : 6 2025-05-07T19:43:04.5827459Z model : 85 2025-05-07T19:43:04.5827747Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.5828214Z stepping : 7 2025-05-07T19:43:04.5828460Z microcode : 0x5003901 2025-05-07T19:43:04.5828759Z cpu MHz : 1208.949 2025-05-07T19:43:04.5829061Z cache size : 36608 KB 2025-05-07T19:43:04.5829426Z physical id : 0 2025-05-07T19:43:04.5829684Z siblings : 48 2025-05-07T19:43:04.5829898Z core id : 15 2025-05-07T19:43:04.5830138Z cpu cores : 24 2025-05-07T19:43:04.5830386Z apicid : 30 2025-05-07T19:43:04.5830804Z initial apicid : 30 2025-05-07T19:43:04.5831040Z fpu : yes 2025-05-07T19:43:04.5831279Z fpu_exception : yes 2025-05-07T19:43:04.5831513Z cpuid level : 13 2025-05-07T19:43:04.5831761Z wp : yes 2025-05-07T19:43:04.5834170Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.5836965Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.5837596Z bogomips : 6000.01 2025-05-07T19:43:04.5837858Z clflush size : 64 2025-05-07T19:43:04.5838099Z cache_alignment : 64 2025-05-07T19:43:04.5838412Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.5838753Z power management: 2025-05-07T19:43:04.5838917Z 2025-05-07T19:43:04.5839009Z processor : 16 2025-05-07T19:43:04.5839246Z vendor_id : GenuineIntel 2025-05-07T19:43:04.5839523Z cpu family : 6 2025-05-07T19:43:04.5839742Z model : 85 2025-05-07T19:43:04.5840057Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.5840451Z stepping : 7 2025-05-07T19:43:04.5840673Z microcode : 0x5003901 2025-05-07T19:43:04.5840942Z cpu MHz : 3000.008 2025-05-07T19:43:04.5841175Z cache size : 36608 KB 2025-05-07T19:43:04.5841435Z physical id : 0 2025-05-07T19:43:04.5841663Z siblings : 48 2025-05-07T19:43:04.5841898Z core id : 16 2025-05-07T19:43:04.5842121Z cpu cores : 24 2025-05-07T19:43:04.5842358Z apicid : 32 2025-05-07T19:43:04.5842584Z initial apicid : 32 2025-05-07T19:43:04.5842836Z fpu : yes 2025-05-07T19:43:04.5843052Z fpu_exception : yes 2025-05-07T19:43:04.5843366Z cpuid level : 13 2025-05-07T19:43:04.5843614Z wp : yes 2025-05-07T19:43:04.5846003Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.5848779Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.5849418Z bogomips : 6000.01 2025-05-07T19:43:04.5849646Z clflush size : 64 2025-05-07T19:43:04.5849902Z cache_alignment : 64 2025-05-07T19:43:04.5850194Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.5850561Z power management: 2025-05-07T19:43:04.5850704Z 2025-05-07T19:43:04.5850796Z processor : 17 2025-05-07T19:43:04.5851054Z vendor_id : GenuineIntel 2025-05-07T19:43:04.5851307Z cpu family : 6 2025-05-07T19:43:04.5851545Z model : 85 2025-05-07T19:43:04.5851859Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.5852241Z stepping : 7 2025-05-07T19:43:04.5852486Z microcode : 0x5003901 2025-05-07T19:43:04.5852728Z cpu MHz : 1200.725 2025-05-07T19:43:04.5853022Z cache size : 36608 KB 2025-05-07T19:43:04.5853262Z physical id : 0 2025-05-07T19:43:04.5853506Z siblings : 48 2025-05-07T19:43:04.5853719Z core id : 17 2025-05-07T19:43:04.5853959Z cpu cores : 24 2025-05-07T19:43:04.5854175Z apicid : 34 2025-05-07T19:43:04.5854415Z initial apicid : 34 2025-05-07T19:43:04.5854642Z fpu : yes 2025-05-07T19:43:04.5854935Z fpu_exception : yes 2025-05-07T19:43:04.5855175Z cpuid level : 13 2025-05-07T19:43:04.5855431Z wp : yes 2025-05-07T19:43:04.5858055Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.5860904Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.5861529Z bogomips : 6000.01 2025-05-07T19:43:04.5861786Z clflush size : 64 2025-05-07T19:43:04.5862026Z cache_alignment : 64 2025-05-07T19:43:04.5862342Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.5862694Z power management: 2025-05-07T19:43:04.5862863Z 2025-05-07T19:43:04.5862956Z processor : 18 2025-05-07T19:43:04.5863197Z vendor_id : GenuineIntel 2025-05-07T19:43:04.5863471Z cpu family : 6 2025-05-07T19:43:04.5863689Z model : 85 2025-05-07T19:43:04.5864005Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.5864404Z stepping : 7 2025-05-07T19:43:04.5864629Z microcode : 0x5003901 2025-05-07T19:43:04.5864899Z cpu MHz : 1199.766 2025-05-07T19:43:04.5865138Z cache size : 36608 KB 2025-05-07T19:43:04.5865410Z physical id : 0 2025-05-07T19:43:04.5865632Z siblings : 48 2025-05-07T19:43:04.5865884Z core id : 18 2025-05-07T19:43:04.5866103Z cpu cores : 24 2025-05-07T19:43:04.5866355Z apicid : 36 2025-05-07T19:43:04.5866579Z initial apicid : 36 2025-05-07T19:43:04.5866847Z fpu : yes 2025-05-07T19:43:04.5890713Z fpu_exception : yes 2025-05-07T19:43:04.5891028Z cpuid level : 13 2025-05-07T19:43:04.5891287Z wp : yes 2025-05-07T19:43:04.5893706Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.5896657Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.5897270Z bogomips : 6000.01 2025-05-07T19:43:04.5897635Z clflush size : 64 2025-05-07T19:43:04.5897852Z cache_alignment : 64 2025-05-07T19:43:04.5898141Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.5898458Z power management: 2025-05-07T19:43:04.5898610Z 2025-05-07T19:43:04.5898696Z processor : 19 2025-05-07T19:43:04.5898915Z vendor_id : GenuineIntel 2025-05-07T19:43:04.5899176Z cpu family : 6 2025-05-07T19:43:04.5899383Z model : 85 2025-05-07T19:43:04.5899687Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.5900064Z stepping : 7 2025-05-07T19:43:04.5900278Z microcode : 0x5003901 2025-05-07T19:43:04.5900534Z cpu MHz : 3000.008 2025-05-07T19:43:04.5900756Z cache size : 36608 KB 2025-05-07T19:43:04.5901003Z physical id : 0 2025-05-07T19:43:04.5901197Z siblings : 48 2025-05-07T19:43:04.5901420Z core id : 19 2025-05-07T19:43:04.5901630Z cpu cores : 24 2025-05-07T19:43:04.5901862Z apicid : 38 2025-05-07T19:43:04.5902070Z initial apicid : 38 2025-05-07T19:43:04.5902307Z fpu : yes 2025-05-07T19:43:04.5902515Z fpu_exception : yes 2025-05-07T19:43:04.5902756Z cpuid level : 13 2025-05-07T19:43:04.5902977Z wp : yes 2025-05-07T19:43:04.5905309Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.5907912Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.5908516Z bogomips : 6000.01 2025-05-07T19:43:04.5908742Z clflush size : 64 2025-05-07T19:43:04.5908991Z cache_alignment : 64 2025-05-07T19:43:04.5909333Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.5909861Z power management: 2025-05-07T19:43:04.5910006Z 2025-05-07T19:43:04.5910104Z processor : 20 2025-05-07T19:43:04.5910456Z vendor_id : GenuineIntel 2025-05-07T19:43:04.5910706Z cpu family : 6 2025-05-07T19:43:04.5910950Z model : 85 2025-05-07T19:43:04.5911243Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.5911641Z stepping : 7 2025-05-07T19:43:04.5911887Z microcode : 0x5003901 2025-05-07T19:43:04.5912125Z cpu MHz : 3000.008 2025-05-07T19:43:04.5912377Z cache size : 36608 KB 2025-05-07T19:43:04.5912615Z physical id : 0 2025-05-07T19:43:04.5912844Z siblings : 48 2025-05-07T19:43:04.5913052Z core id : 20 2025-05-07T19:43:04.5913262Z cpu cores : 24 2025-05-07T19:43:04.5913473Z apicid : 40 2025-05-07T19:43:04.5913692Z initial apicid : 40 2025-05-07T19:43:04.5913915Z fpu : yes 2025-05-07T19:43:04.5914148Z fpu_exception : yes 2025-05-07T19:43:04.5914376Z cpuid level : 13 2025-05-07T19:43:04.5914609Z wp : yes 2025-05-07T19:43:04.5917001Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.5919842Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.5920441Z bogomips : 6000.01 2025-05-07T19:43:04.5920681Z clflush size : 64 2025-05-07T19:43:04.5920912Z cache_alignment : 64 2025-05-07T19:43:04.5921208Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.5921540Z power management: 2025-05-07T19:43:04.5921807Z 2025-05-07T19:43:04.5921887Z processor : 21 2025-05-07T19:43:04.5922100Z vendor_id : GenuineIntel 2025-05-07T19:43:04.5922357Z cpu family : 6 2025-05-07T19:43:04.5922555Z model : 85 2025-05-07T19:43:04.5922839Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.5923185Z stepping : 7 2025-05-07T19:43:04.5923384Z microcode : 0x5003901 2025-05-07T19:43:04.5923609Z cpu MHz : 1200.063 2025-05-07T19:43:04.5923795Z cache size : 36608 KB 2025-05-07T19:43:04.5924020Z physical id : 0 2025-05-07T19:43:04.5924213Z siblings : 48 2025-05-07T19:43:04.5924409Z core id : 21 2025-05-07T19:43:04.5924585Z cpu cores : 24 2025-05-07T19:43:04.5924798Z apicid : 42 2025-05-07T19:43:04.5924992Z initial apicid : 42 2025-05-07T19:43:04.5925210Z fpu : yes 2025-05-07T19:43:04.5925397Z fpu_exception : yes 2025-05-07T19:43:04.5925621Z cpuid level : 13 2025-05-07T19:43:04.5925818Z wp : yes 2025-05-07T19:43:04.5928081Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.5930649Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.5931223Z bogomips : 6000.01 2025-05-07T19:43:04.5931440Z clflush size : 64 2025-05-07T19:43:04.5931660Z cache_alignment : 64 2025-05-07T19:43:04.5931923Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.5932254Z power management: 2025-05-07T19:43:04.5932379Z 2025-05-07T19:43:04.5932462Z processor : 22 2025-05-07T19:43:04.5932688Z vendor_id : GenuineIntel 2025-05-07T19:43:04.5932917Z cpu family : 6 2025-05-07T19:43:04.5933134Z model : 85 2025-05-07T19:43:04.5933386Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.5933734Z stepping : 7 2025-05-07T19:43:04.5933945Z microcode : 0x5003901 2025-05-07T19:43:04.5934163Z cpu MHz : 3000.008 2025-05-07T19:43:04.5934381Z cache size : 36608 KB 2025-05-07T19:43:04.5934592Z physical id : 0 2025-05-07T19:43:04.5934803Z siblings : 48 2025-05-07T19:43:04.5934997Z core id : 22 2025-05-07T19:43:04.5935213Z cpu cores : 24 2025-05-07T19:43:04.5935406Z apicid : 44 2025-05-07T19:43:04.5935611Z initial apicid : 44 2025-05-07T19:43:04.5935810Z fpu : yes 2025-05-07T19:43:04.5936012Z fpu_exception : yes 2025-05-07T19:43:04.5936213Z cpuid level : 13 2025-05-07T19:43:04.5936426Z wp : yes 2025-05-07T19:43:04.5938629Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.5941240Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.5941797Z bogomips : 6000.01 2025-05-07T19:43:04.5942017Z clflush size : 64 2025-05-07T19:43:04.5942224Z cache_alignment : 64 2025-05-07T19:43:04.5942503Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.5942820Z power management: 2025-05-07T19:43:04.5942962Z 2025-05-07T19:43:04.5943044Z processor : 23 2025-05-07T19:43:04.5943256Z vendor_id : GenuineIntel 2025-05-07T19:43:04.5943516Z cpu family : 6 2025-05-07T19:43:04.5943712Z model : 85 2025-05-07T19:43:04.5943995Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.5944340Z stepping : 7 2025-05-07T19:43:04.5944541Z microcode : 0x5003901 2025-05-07T19:43:04.5944768Z cpu MHz : 1200.205 2025-05-07T19:43:04.5944977Z cache size : 36608 KB 2025-05-07T19:43:04.5945206Z physical id : 0 2025-05-07T19:43:04.5945399Z siblings : 48 2025-05-07T19:43:04.5945613Z core id : 23 2025-05-07T19:43:04.5945803Z cpu cores : 24 2025-05-07T19:43:04.5946012Z apicid : 46 2025-05-07T19:43:04.5946212Z initial apicid : 46 2025-05-07T19:43:04.5946439Z fpu : yes 2025-05-07T19:43:04.5946626Z fpu_exception : yes 2025-05-07T19:43:04.5946849Z cpuid level : 13 2025-05-07T19:43:04.5947045Z wp : yes 2025-05-07T19:43:04.5949391Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.5952335Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.5952962Z bogomips : 6000.01 2025-05-07T19:43:04.5953186Z clflush size : 64 2025-05-07T19:43:04.5953432Z cache_alignment : 64 2025-05-07T19:43:04.5953714Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.5954070Z power management: 2025-05-07T19:43:04.5954212Z 2025-05-07T19:43:04.5954305Z processor : 24 2025-05-07T19:43:04.5954543Z vendor_id : GenuineIntel 2025-05-07T19:43:04.5954790Z cpu family : 6 2025-05-07T19:43:04.5955019Z model : 85 2025-05-07T19:43:04.5955311Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.5955678Z stepping : 7 2025-05-07T19:43:04.5955911Z microcode : 0x5003901 2025-05-07T19:43:04.5956145Z cpu MHz : 3164.566 2025-05-07T19:43:04.5956379Z cache size : 36608 KB 2025-05-07T19:43:04.5956607Z physical id : 1 2025-05-07T19:43:04.5956815Z siblings : 48 2025-05-07T19:43:04.5957004Z core id : 0 2025-05-07T19:43:04.5957221Z cpu cores : 24 2025-05-07T19:43:04.5957425Z apicid : 64 2025-05-07T19:43:04.5957640Z initial apicid : 64 2025-05-07T19:43:04.5957853Z fpu : yes 2025-05-07T19:43:04.5958072Z fpu_exception : yes 2025-05-07T19:43:04.5958290Z cpuid level : 13 2025-05-07T19:43:04.5958501Z wp : yes 2025-05-07T19:43:04.5960878Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.5963648Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.5964210Z bogomips : 6000.01 2025-05-07T19:43:04.5964459Z clflush size : 64 2025-05-07T19:43:04.5964664Z cache_alignment : 64 2025-05-07T19:43:04.5964926Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.5965220Z power management: 2025-05-07T19:43:04.5965340Z 2025-05-07T19:43:04.5965427Z processor : 25 2025-05-07T19:43:04.5965623Z vendor_id : GenuineIntel 2025-05-07T19:43:04.5965850Z cpu family : 6 2025-05-07T19:43:04.5966029Z model : 85 2025-05-07T19:43:04.5966299Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.5966625Z stepping : 7 2025-05-07T19:43:04.5966839Z microcode : 0x5003901 2025-05-07T19:43:04.5967044Z cpu MHz : 3194.211 2025-05-07T19:43:04.5967250Z cache size : 36608 KB 2025-05-07T19:43:04.5967469Z physical id : 1 2025-05-07T19:43:04.5967660Z siblings : 48 2025-05-07T19:43:04.5967848Z core id : 1 2025-05-07T19:43:04.5968025Z cpu cores : 24 2025-05-07T19:43:04.5968227Z apicid : 66 2025-05-07T19:43:04.5968408Z initial apicid : 66 2025-05-07T19:43:04.5968614Z fpu : yes 2025-05-07T19:43:04.5968788Z fpu_exception : yes 2025-05-07T19:43:04.5968989Z cpuid level : 13 2025-05-07T19:43:04.5969172Z wp : yes 2025-05-07T19:43:04.5971405Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.5973960Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.5974522Z bogomips : 6000.01 2025-05-07T19:43:04.5974724Z clflush size : 64 2025-05-07T19:43:04.5974928Z cache_alignment : 64 2025-05-07T19:43:04.5975166Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.5975470Z power management: 2025-05-07T19:43:04.5975590Z 2025-05-07T19:43:04.5975664Z processor : 26 2025-05-07T19:43:04.5975884Z vendor_id : GenuineIntel 2025-05-07T19:43:04.5976101Z cpu family : 6 2025-05-07T19:43:04.5976301Z model : 85 2025-05-07T19:43:04.5976547Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.5976893Z stepping : 7 2025-05-07T19:43:04.5977084Z microcode : 0x5003901 2025-05-07T19:43:04.5977303Z cpu MHz : 3783.800 2025-05-07T19:43:04.5977513Z cache size : 36608 KB 2025-05-07T19:43:04.5977718Z physical id : 1 2025-05-07T19:43:04.5977912Z siblings : 48 2025-05-07T19:43:04.5978090Z core id : 2 2025-05-07T19:43:04.5978283Z cpu cores : 24 2025-05-07T19:43:04.5978464Z apicid : 68 2025-05-07T19:43:04.5978681Z initial apicid : 68 2025-05-07T19:43:04.5978874Z fpu : yes 2025-05-07T19:43:04.5979079Z fpu_exception : yes 2025-05-07T19:43:04.5979277Z cpuid level : 13 2025-05-07T19:43:04.5979480Z wp : yes 2025-05-07T19:43:04.5981684Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.5984290Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.5985251Z bogomips : 6000.01 2025-05-07T19:43:04.5985477Z clflush size : 64 2025-05-07T19:43:04.5985712Z cache_alignment : 64 2025-05-07T19:43:04.5985998Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.5986328Z power management: 2025-05-07T19:43:04.5986464Z 2025-05-07T19:43:04.5986565Z processor : 27 2025-05-07T19:43:04.5986775Z vendor_id : GenuineIntel 2025-05-07T19:43:04.5987005Z cpu family : 6 2025-05-07T19:43:04.5987200Z model : 85 2025-05-07T19:43:04.5987491Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.5987846Z stepping : 7 2025-05-07T19:43:04.5988066Z microcode : 0x5003901 2025-05-07T19:43:04.5988292Z cpu MHz : 3125.842 2025-05-07T19:43:04.5988521Z cache size : 36608 KB 2025-05-07T19:43:04.5988750Z physical id : 1 2025-05-07T19:43:04.5988953Z siblings : 48 2025-05-07T19:43:04.5989231Z core id : 3 2025-05-07T19:43:04.5989424Z cpu cores : 24 2025-05-07T19:43:04.5989645Z apicid : 70 2025-05-07T19:43:04.5989853Z initial apicid : 70 2025-05-07T19:43:04.5990087Z fpu : yes 2025-05-07T19:43:04.5990281Z fpu_exception : yes 2025-05-07T19:43:04.5990504Z cpuid level : 13 2025-05-07T19:43:04.5990713Z wp : yes 2025-05-07T19:43:04.5993205Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.5995968Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.5996577Z bogomips : 6000.01 2025-05-07T19:43:04.5996808Z clflush size : 64 2025-05-07T19:43:04.5997035Z cache_alignment : 64 2025-05-07T19:43:04.5997305Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.5997633Z power management: 2025-05-07T19:43:04.5997765Z 2025-05-07T19:43:04.5997853Z processor : 28 2025-05-07T19:43:04.5998093Z vendor_id : GenuineIntel 2025-05-07T19:43:04.5998322Z cpu family : 6 2025-05-07T19:43:04.5998531Z model : 85 2025-05-07T19:43:04.5998800Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.5999165Z stepping : 7 2025-05-07T19:43:04.5999366Z microcode : 0x5003901 2025-05-07T19:43:04.5999597Z cpu MHz : 3305.174 2025-05-07T19:43:04.5999823Z cache size : 36608 KB 2025-05-07T19:43:04.6000042Z physical id : 1 2025-05-07T19:43:04.6000258Z siblings : 48 2025-05-07T19:43:04.6000462Z core id : 4 2025-05-07T19:43:04.6000662Z cpu cores : 24 2025-05-07T19:43:04.6000859Z apicid : 72 2025-05-07T19:43:04.6001178Z initial apicid : 72 2025-05-07T19:43:04.6001379Z fpu : yes 2025-05-07T19:43:04.6001575Z fpu_exception : yes 2025-05-07T19:43:04.6001779Z cpuid level : 13 2025-05-07T19:43:04.6001982Z wp : yes 2025-05-07T19:43:04.6004175Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6008552Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6009120Z bogomips : 6000.01 2025-05-07T19:43:04.6009324Z clflush size : 64 2025-05-07T19:43:04.6009533Z cache_alignment : 64 2025-05-07T19:43:04.6009802Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6010134Z power management: 2025-05-07T19:43:04.6010265Z 2025-05-07T19:43:04.6010379Z processor : 29 2025-05-07T19:43:04.6010608Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6010875Z cpu family : 6 2025-05-07T19:43:04.6011086Z model : 85 2025-05-07T19:43:04.6011386Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6011728Z stepping : 7 2025-05-07T19:43:04.6011965Z microcode : 0x5003901 2025-05-07T19:43:04.6012196Z cpu MHz : 3000.008 2025-05-07T19:43:04.6012446Z cache size : 36608 KB 2025-05-07T19:43:04.6012695Z physical id : 1 2025-05-07T19:43:04.6012904Z siblings : 48 2025-05-07T19:43:04.6013139Z core id : 5 2025-05-07T19:43:04.6013342Z cpu cores : 24 2025-05-07T19:43:04.6013566Z apicid : 74 2025-05-07T19:43:04.6013769Z initial apicid : 74 2025-05-07T19:43:04.6014004Z fpu : yes 2025-05-07T19:43:04.6014209Z fpu_exception : yes 2025-05-07T19:43:04.6014446Z cpuid level : 13 2025-05-07T19:43:04.6014662Z wp : yes 2025-05-07T19:43:04.6016972Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6019565Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6020143Z bogomips : 6000.01 2025-05-07T19:43:04.6020388Z clflush size : 64 2025-05-07T19:43:04.6020703Z cache_alignment : 64 2025-05-07T19:43:04.6020974Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6021338Z power management: 2025-05-07T19:43:04.6021474Z 2025-05-07T19:43:04.6021566Z processor : 30 2025-05-07T19:43:04.6021811Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6022052Z cpu family : 6 2025-05-07T19:43:04.6022286Z model : 85 2025-05-07T19:43:04.6022561Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6022934Z stepping : 7 2025-05-07T19:43:04.6023144Z microcode : 0x5003901 2025-05-07T19:43:04.6023400Z cpu MHz : 3000.008 2025-05-07T19:43:04.6023648Z cache size : 36608 KB 2025-05-07T19:43:04.6023875Z physical id : 1 2025-05-07T19:43:04.6024113Z siblings : 48 2025-05-07T19:43:04.6024325Z core id : 6 2025-05-07T19:43:04.6024551Z cpu cores : 24 2025-05-07T19:43:04.6024750Z apicid : 76 2025-05-07T19:43:04.6024977Z initial apicid : 76 2025-05-07T19:43:04.6025198Z fpu : yes 2025-05-07T19:43:04.6025419Z fpu_exception : yes 2025-05-07T19:43:04.6025636Z cpuid level : 13 2025-05-07T19:43:04.6025873Z wp : yes 2025-05-07T19:43:04.6028108Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6031077Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6031724Z bogomips : 6000.01 2025-05-07T19:43:04.6031959Z clflush size : 64 2025-05-07T19:43:04.6032221Z cache_alignment : 64 2025-05-07T19:43:04.6032540Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6032895Z power management: 2025-05-07T19:43:04.6033038Z 2025-05-07T19:43:04.6033142Z processor : 31 2025-05-07T19:43:04.6033383Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6033669Z cpu family : 6 2025-05-07T19:43:04.6033886Z model : 85 2025-05-07T19:43:04.6034204Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6034578Z stepping : 7 2025-05-07T19:43:04.6034824Z microcode : 0x5003901 2025-05-07T19:43:04.6035066Z cpu MHz : 3000.008 2025-05-07T19:43:04.6035323Z cache size : 36608 KB 2025-05-07T19:43:04.6035591Z physical id : 1 2025-05-07T19:43:04.6035819Z siblings : 48 2025-05-07T19:43:04.6036055Z core id : 7 2025-05-07T19:43:04.6036273Z cpu cores : 24 2025-05-07T19:43:04.6036511Z apicid : 78 2025-05-07T19:43:04.6036737Z initial apicid : 78 2025-05-07T19:43:04.6036985Z fpu : yes 2025-05-07T19:43:04.6037206Z fpu_exception : yes 2025-05-07T19:43:04.6037458Z cpuid level : 13 2025-05-07T19:43:04.6037683Z wp : yes 2025-05-07T19:43:04.6040166Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6042983Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6043561Z bogomips : 6000.01 2025-05-07T19:43:04.6043814Z clflush size : 64 2025-05-07T19:43:04.6044071Z cache_alignment : 64 2025-05-07T19:43:04.6044347Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6044696Z power management: 2025-05-07T19:43:04.6044831Z 2025-05-07T19:43:04.6044921Z processor : 32 2025-05-07T19:43:04.6045167Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6045406Z cpu family : 6 2025-05-07T19:43:04.6045632Z model : 85 2025-05-07T19:43:04.6045907Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6046274Z stepping : 7 2025-05-07T19:43:04.6046489Z microcode : 0x5003901 2025-05-07T19:43:04.6046734Z cpu MHz : 3151.646 2025-05-07T19:43:04.6046957Z cache size : 36608 KB 2025-05-07T19:43:04.6047167Z physical id : 1 2025-05-07T19:43:04.6047378Z siblings : 48 2025-05-07T19:43:04.6047567Z core id : 8 2025-05-07T19:43:04.6047770Z cpu cores : 24 2025-05-07T19:43:04.6047965Z apicid : 80 2025-05-07T19:43:04.6048171Z initial apicid : 80 2025-05-07T19:43:04.6048371Z fpu : yes 2025-05-07T19:43:04.6048576Z fpu_exception : yes 2025-05-07T19:43:04.6048780Z cpuid level : 13 2025-05-07T19:43:04.6048988Z wp : yes 2025-05-07T19:43:04.6051190Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6053726Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6054350Z bogomips : 6000.01 2025-05-07T19:43:04.6054566Z clflush size : 64 2025-05-07T19:43:04.6054768Z cache_alignment : 64 2025-05-07T19:43:04.6055035Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6055344Z power management: 2025-05-07T19:43:04.6055468Z 2025-05-07T19:43:04.6055562Z processor : 33 2025-05-07T19:43:04.6055768Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6056007Z cpu family : 6 2025-05-07T19:43:04.6056197Z model : 85 2025-05-07T19:43:04.6056478Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6056805Z stepping : 7 2025-05-07T19:43:04.6057018Z microcode : 0x5003901 2025-05-07T19:43:04.6057227Z cpu MHz : 3102.447 2025-05-07T19:43:04.6057443Z cache size : 36608 KB 2025-05-07T19:43:04.6057666Z physical id : 1 2025-05-07T19:43:04.6057860Z siblings : 48 2025-05-07T19:43:04.6058065Z core id : 9 2025-05-07T19:43:04.6058252Z cpu cores : 24 2025-05-07T19:43:04.6058458Z apicid : 82 2025-05-07T19:43:04.6058647Z initial apicid : 82 2025-05-07T19:43:04.6058855Z fpu : yes 2025-05-07T19:43:04.6059048Z fpu_exception : yes 2025-05-07T19:43:04.6059261Z cpuid level : 13 2025-05-07T19:43:04.6059457Z wp : yes 2025-05-07T19:43:04.6061662Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6064259Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6064814Z bogomips : 6000.01 2025-05-07T19:43:04.6065045Z clflush size : 64 2025-05-07T19:43:04.6065260Z cache_alignment : 64 2025-05-07T19:43:04.6065509Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6065819Z power management: 2025-05-07T19:43:04.6065941Z 2025-05-07T19:43:04.6066015Z processor : 34 2025-05-07T19:43:04.6066237Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6066463Z cpu family : 6 2025-05-07T19:43:04.6066661Z model : 85 2025-05-07T19:43:04.6066915Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6067263Z stepping : 7 2025-05-07T19:43:04.6067458Z microcode : 0x5003901 2025-05-07T19:43:04.6067687Z cpu MHz : 3000.008 2025-05-07T19:43:04.6067899Z cache size : 36608 KB 2025-05-07T19:43:04.6068105Z physical id : 1 2025-05-07T19:43:04.6068310Z siblings : 48 2025-05-07T19:43:04.6068493Z core id : 10 2025-05-07T19:43:04.6068688Z cpu cores : 24 2025-05-07T19:43:04.6068875Z apicid : 84 2025-05-07T19:43:04.6069142Z initial apicid : 84 2025-05-07T19:43:04.6069354Z fpu : yes 2025-05-07T19:43:04.6069729Z fpu_exception : yes 2025-05-07T19:43:04.6069949Z cpuid level : 13 2025-05-07T19:43:04.6070047Z wp : yes 2025-05-07T19:43:04.6072318Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6072719Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6072825Z bogomips : 6000.01 2025-05-07T19:43:04.6072914Z clflush size : 64 2025-05-07T19:43:04.6073076Z cache_alignment : 64 2025-05-07T19:43:04.6073232Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6073322Z power management: 2025-05-07T19:43:04.6073327Z 2025-05-07T19:43:04.6073413Z processor : 35 2025-05-07T19:43:04.6073525Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6073604Z cpu family : 6 2025-05-07T19:43:04.6073683Z model : 85 2025-05-07T19:43:04.6073849Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6073941Z stepping : 7 2025-05-07T19:43:04.6074028Z microcode : 0x5003901 2025-05-07T19:43:04.6074112Z cpu MHz : 3000.008 2025-05-07T19:43:04.6074218Z cache size : 36608 KB 2025-05-07T19:43:04.6074303Z physical id : 1 2025-05-07T19:43:04.6074387Z siblings : 48 2025-05-07T19:43:04.6074465Z core id : 11 2025-05-07T19:43:04.6074554Z cpu cores : 24 2025-05-07T19:43:04.6074635Z apicid : 86 2025-05-07T19:43:04.6074730Z initial apicid : 86 2025-05-07T19:43:04.6074816Z fpu : yes 2025-05-07T19:43:04.6074920Z fpu_exception : yes 2025-05-07T19:43:04.6075013Z cpuid level : 13 2025-05-07T19:43:04.6075095Z wp : yes 2025-05-07T19:43:04.6077360Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6077885Z lspci: Unable to load libkmod resources: error -2 2025-05-07T19:43:04.6078349Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6078440Z bogomips : 6000.01 2025-05-07T19:43:04.6078527Z clflush size : 64 2025-05-07T19:43:04.6078616Z cache_alignment : 64 2025-05-07T19:43:04.6078762Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6078846Z power management: 2025-05-07T19:43:04.6078850Z 2025-05-07T19:43:04.6078938Z processor : 36 2025-05-07T19:43:04.6079057Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6079137Z cpu family : 6 2025-05-07T19:43:04.6079218Z model : 85 2025-05-07T19:43:04.6079381Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6079473Z stepping : 7 2025-05-07T19:43:04.6079560Z microcode : 0x5003901 2025-05-07T19:43:04.6079639Z cpu MHz : 3000.008 2025-05-07T19:43:04.6079742Z cache size : 36608 KB 2025-05-07T19:43:04.6079826Z physical id : 1 2025-05-07T19:43:04.6079905Z siblings : 48 2025-05-07T19:43:04.6079983Z core id : 12 2025-05-07T19:43:04.6080077Z cpu cores : 24 2025-05-07T19:43:04.6080160Z apicid : 88 2025-05-07T19:43:04.6080247Z initial apicid : 88 2025-05-07T19:43:04.6080345Z fpu : yes 2025-05-07T19:43:04.6080436Z fpu_exception : yes 2025-05-07T19:43:04.6080518Z cpuid level : 13 2025-05-07T19:43:04.6080598Z wp : yes 2025-05-07T19:43:04.6082909Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6083279Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6083371Z bogomips : 6000.01 2025-05-07T19:43:04.6083452Z clflush size : 64 2025-05-07T19:43:04.6083586Z cache_alignment : 64 2025-05-07T19:43:04.6083708Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6083801Z power management: 2025-05-07T19:43:04.6083805Z 2025-05-07T19:43:04.6083889Z processor : 37 2025-05-07T19:43:04.6083979Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6084056Z cpu family : 6 2025-05-07T19:43:04.6084132Z model : 85 2025-05-07T19:43:04.6084288Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6084501Z stepping : 7 2025-05-07T19:43:04.6084588Z microcode : 0x5003901 2025-05-07T19:43:04.6084664Z cpu MHz : 3000.008 2025-05-07T19:43:04.6084744Z cache size : 36608 KB 2025-05-07T19:43:04.6085002Z physical id : 1 2025-05-07T19:43:04.6085082Z siblings : 48 2025-05-07T19:43:04.6085166Z core id : 13 2025-05-07T19:43:04.6085252Z cpu cores : 24 2025-05-07T19:43:04.6085347Z apicid : 90 2025-05-07T19:43:04.6085553Z initial apicid : 90 2025-05-07T19:43:04.6085638Z fpu : yes 2025-05-07T19:43:04.6085730Z fpu_exception : yes 2025-05-07T19:43:04.6085836Z cpuid level : 13 2025-05-07T19:43:04.6085922Z wp : yes 2025-05-07T19:43:04.6088183Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6088597Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6088687Z bogomips : 6000.01 2025-05-07T19:43:04.6088849Z clflush size : 64 2025-05-07T19:43:04.6088953Z cache_alignment : 64 2025-05-07T19:43:04.6089090Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6089181Z power management: 2025-05-07T19:43:04.6089186Z 2025-05-07T19:43:04.6089288Z processor : 38 2025-05-07T19:43:04.6089385Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6089473Z cpu family : 6 2025-05-07T19:43:04.6089568Z model : 85 2025-05-07T19:43:04.6089734Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6089821Z stepping : 7 2025-05-07T19:43:04.6089907Z microcode : 0x5003901 2025-05-07T19:43:04.6090003Z cpu MHz : 3000.008 2025-05-07T19:43:04.6090087Z cache size : 36608 KB 2025-05-07T19:43:04.6090172Z physical id : 1 2025-05-07T19:43:04.6090262Z siblings : 48 2025-05-07T19:43:04.6090352Z core id : 14 2025-05-07T19:43:04.6090435Z cpu cores : 24 2025-05-07T19:43:04.6090521Z apicid : 92 2025-05-07T19:43:04.6090619Z initial apicid : 92 2025-05-07T19:43:04.6090697Z fpu : yes 2025-05-07T19:43:04.6090795Z fpu_exception : yes 2025-05-07T19:43:04.6090885Z cpuid level : 13 2025-05-07T19:43:04.6090975Z wp : yes 2025-05-07T19:43:04.6093233Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6093644Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6093729Z bogomips : 6000.01 2025-05-07T19:43:04.6093812Z clflush size : 64 2025-05-07T19:43:04.6093904Z cache_alignment : 64 2025-05-07T19:43:04.6094057Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6094204Z power management: 2025-05-07T19:43:04.6094209Z 2025-05-07T19:43:04.6094293Z processor : 39 2025-05-07T19:43:04.6094405Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6094487Z cpu family : 6 2025-05-07T19:43:04.6094564Z model : 85 2025-05-07T19:43:04.6094729Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6094824Z stepping : 7 2025-05-07T19:43:04.6094909Z microcode : 0x5003901 2025-05-07T19:43:04.6094990Z cpu MHz : 3000.008 2025-05-07T19:43:04.6095087Z cache size : 36608 KB 2025-05-07T19:43:04.6095171Z physical id : 1 2025-05-07T19:43:04.6095249Z siblings : 48 2025-05-07T19:43:04.6095324Z core id : 15 2025-05-07T19:43:04.6095426Z cpu cores : 24 2025-05-07T19:43:04.6095505Z apicid : 94 2025-05-07T19:43:04.6095596Z initial apicid : 94 2025-05-07T19:43:04.6095690Z fpu : yes 2025-05-07T19:43:04.6095781Z fpu_exception : yes 2025-05-07T19:43:04.6095866Z cpuid level : 13 2025-05-07T19:43:04.6095951Z wp : yes 2025-05-07T19:43:04.6098344Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6098714Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6098808Z bogomips : 6000.01 2025-05-07T19:43:04.6098891Z clflush size : 64 2025-05-07T19:43:04.6098966Z cache_alignment : 64 2025-05-07T19:43:04.6099134Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6099230Z power management: 2025-05-07T19:43:04.6099238Z 2025-05-07T19:43:04.6099319Z processor : 40 2025-05-07T19:43:04.6099402Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6099490Z cpu family : 6 2025-05-07T19:43:04.6099563Z model : 85 2025-05-07T19:43:04.6099712Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6099787Z stepping : 7 2025-05-07T19:43:04.6099882Z microcode : 0x5003901 2025-05-07T19:43:04.6099965Z cpu MHz : 3000.008 2025-05-07T19:43:04.6100046Z cache size : 36608 KB 2025-05-07T19:43:04.6100140Z physical id : 1 2025-05-07T19:43:04.6100218Z siblings : 48 2025-05-07T19:43:04.6100293Z core id : 16 2025-05-07T19:43:04.6100367Z cpu cores : 24 2025-05-07T19:43:04.6100451Z apicid : 96 2025-05-07T19:43:04.6100528Z initial apicid : 96 2025-05-07T19:43:04.6100599Z fpu : yes 2025-05-07T19:43:04.6100691Z fpu_exception : yes 2025-05-07T19:43:04.6100770Z cpuid level : 13 2025-05-07T19:43:04.6100848Z wp : yes 2025-05-07T19:43:04.6102943Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6103316Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6103400Z bogomips : 6000.01 2025-05-07T19:43:04.6103494Z clflush size : 64 2025-05-07T19:43:04.6103576Z cache_alignment : 64 2025-05-07T19:43:04.6103699Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6103785Z power management: 2025-05-07T19:43:04.6103789Z 2025-05-07T19:43:04.6103924Z processor : 41 2025-05-07T19:43:04.6104007Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6104082Z cpu family : 6 2025-05-07T19:43:04.6104162Z model : 85 2025-05-07T19:43:04.6104308Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6104387Z stepping : 7 2025-05-07T19:43:04.6104466Z microcode : 0x5003901 2025-05-07T19:43:04.6104552Z cpu MHz : 3148.909 2025-05-07T19:43:04.6104631Z cache size : 36608 KB 2025-05-07T19:43:04.6104707Z physical id : 1 2025-05-07T19:43:04.6104809Z siblings : 48 2025-05-07T19:43:04.6104881Z core id : 17 2025-05-07T19:43:04.6104955Z cpu cores : 24 2025-05-07T19:43:04.6105031Z apicid : 98 2025-05-07T19:43:04.6105126Z initial apicid : 98 2025-05-07T19:43:04.6105199Z fpu : yes 2025-05-07T19:43:04.6105282Z fpu_exception : yes 2025-05-07T19:43:04.6105374Z cpuid level : 13 2025-05-07T19:43:04.6105446Z wp : yes 2025-05-07T19:43:04.6107535Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6107917Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6107993Z bogomips : 6000.01 2025-05-07T19:43:04.6108072Z clflush size : 64 2025-05-07T19:43:04.6108167Z cache_alignment : 64 2025-05-07T19:43:04.6108290Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6108371Z power management: 2025-05-07T19:43:04.6108429Z 2025-05-07T19:43:04.6108506Z processor : 42 2025-05-07T19:43:04.6108609Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6108688Z cpu family : 6 2025-05-07T19:43:04.6108760Z model : 85 2025-05-07T19:43:04.6108928Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6109007Z stepping : 7 2025-05-07T19:43:04.6109154Z microcode : 0x5003901 2025-05-07T19:43:04.6109232Z cpu MHz : 3178.214 2025-05-07T19:43:04.6109337Z cache size : 36608 KB 2025-05-07T19:43:04.6109422Z physical id : 1 2025-05-07T19:43:04.6109499Z siblings : 48 2025-05-07T19:43:04.6109763Z core id : 18 2025-05-07T19:43:04.6109848Z cpu cores : 24 2025-05-07T19:43:04.6109930Z apicid : 100 2025-05-07T19:43:04.6110016Z initial apicid : 100 2025-05-07T19:43:04.6110112Z fpu : yes 2025-05-07T19:43:04.6110203Z fpu_exception : yes 2025-05-07T19:43:04.6110285Z cpuid level : 13 2025-05-07T19:43:04.6110364Z wp : yes 2025-05-07T19:43:04.6112634Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6113049Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6113140Z bogomips : 6000.01 2025-05-07T19:43:04.6113231Z clflush size : 64 2025-05-07T19:43:04.6113319Z cache_alignment : 64 2025-05-07T19:43:04.6113469Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6113556Z power management: 2025-05-07T19:43:04.6113561Z 2025-05-07T19:43:04.6113651Z processor : 43 2025-05-07T19:43:04.6113743Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6113889Z cpu family : 6 2025-05-07T19:43:04.6113968Z model : 85 2025-05-07T19:43:04.6114134Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6114229Z stepping : 7 2025-05-07T19:43:04.6114311Z microcode : 0x5003901 2025-05-07T19:43:04.6114388Z cpu MHz : 3062.312 2025-05-07T19:43:04.6114467Z cache size : 36608 KB 2025-05-07T19:43:04.6114566Z physical id : 1 2025-05-07T19:43:04.6114643Z siblings : 48 2025-05-07T19:43:04.6114725Z core id : 19 2025-05-07T19:43:04.6114823Z cpu cores : 24 2025-05-07T19:43:04.6114901Z apicid : 102 2025-05-07T19:43:04.6114983Z initial apicid : 102 2025-05-07T19:43:04.6115063Z fpu : yes 2025-05-07T19:43:04.6115162Z fpu_exception : yes 2025-05-07T19:43:04.6115245Z cpuid level : 13 2025-05-07T19:43:04.6115324Z wp : yes 2025-05-07T19:43:04.6117608Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6118015Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6118105Z bogomips : 6000.01 2025-05-07T19:43:04.6118201Z clflush size : 64 2025-05-07T19:43:04.6118285Z cache_alignment : 64 2025-05-07T19:43:04.6118420Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6118537Z power management: 2025-05-07T19:43:04.6118541Z 2025-05-07T19:43:04.6118622Z processor : 44 2025-05-07T19:43:04.6118764Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6118847Z cpu family : 6 2025-05-07T19:43:04.6118939Z model : 85 2025-05-07T19:43:04.6119103Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6119189Z stepping : 7 2025-05-07T19:43:04.6119299Z microcode : 0x5003901 2025-05-07T19:43:04.6119382Z cpu MHz : 3111.037 2025-05-07T19:43:04.6119464Z cache size : 36608 KB 2025-05-07T19:43:04.6119551Z physical id : 1 2025-05-07T19:43:04.6119645Z siblings : 48 2025-05-07T19:43:04.6119724Z core id : 20 2025-05-07T19:43:04.6119806Z cpu cores : 24 2025-05-07T19:43:04.6119887Z apicid : 104 2025-05-07T19:43:04.6119988Z initial apicid : 104 2025-05-07T19:43:04.6120063Z fpu : yes 2025-05-07T19:43:04.6120150Z fpu_exception : yes 2025-05-07T19:43:04.6120250Z cpuid level : 13 2025-05-07T19:43:04.6120328Z wp : yes 2025-05-07T19:43:04.6122630Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6123020Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6123103Z bogomips : 6000.01 2025-05-07T19:43:04.6123178Z clflush size : 64 2025-05-07T19:43:04.6123285Z cache_alignment : 64 2025-05-07T19:43:04.6123403Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6123485Z power management: 2025-05-07T19:43:04.6123489Z 2025-05-07T19:43:04.6123580Z processor : 45 2025-05-07T19:43:04.6123659Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6123733Z cpu family : 6 2025-05-07T19:43:04.6123807Z model : 85 2025-05-07T19:43:04.6123963Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6124081Z stepping : 7 2025-05-07T19:43:04.6124165Z microcode : 0x5003901 2025-05-07T19:43:04.6124269Z cpu MHz : 3000.008 2025-05-07T19:43:04.6124350Z cache size : 36608 KB 2025-05-07T19:43:04.6124432Z physical id : 1 2025-05-07T19:43:04.6124509Z siblings : 48 2025-05-07T19:43:04.6124593Z core id : 21 2025-05-07T19:43:04.6124672Z cpu cores : 24 2025-05-07T19:43:04.6124745Z apicid : 106 2025-05-07T19:43:04.6124821Z initial apicid : 106 2025-05-07T19:43:04.6124906Z fpu : yes 2025-05-07T19:43:04.6124989Z fpu_exception : yes 2025-05-07T19:43:04.6125072Z cpuid level : 13 2025-05-07T19:43:04.6125155Z wp : yes 2025-05-07T19:43:04.6127237Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6127603Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6127694Z bogomips : 6000.01 2025-05-07T19:43:04.6127769Z clflush size : 64 2025-05-07T19:43:04.6127854Z cache_alignment : 64 2025-05-07T19:43:04.6127978Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6128054Z power management: 2025-05-07T19:43:04.6128058Z 2025-05-07T19:43:04.6128133Z processor : 46 2025-05-07T19:43:04.6128235Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6128316Z cpu family : 6 2025-05-07T19:43:04.6128386Z model : 85 2025-05-07T19:43:04.6128594Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6128682Z stepping : 7 2025-05-07T19:43:04.6128763Z microcode : 0x5003901 2025-05-07T19:43:04.6128833Z cpu MHz : 3000.008 2025-05-07T19:43:04.6128917Z cache size : 36608 KB 2025-05-07T19:43:04.6129008Z physical id : 1 2025-05-07T19:43:04.6129079Z siblings : 48 2025-05-07T19:43:04.6129158Z core id : 22 2025-05-07T19:43:04.6129253Z cpu cores : 24 2025-05-07T19:43:04.6129320Z apicid : 108 2025-05-07T19:43:04.6129406Z initial apicid : 108 2025-05-07T19:43:04.6129479Z fpu : yes 2025-05-07T19:43:04.6129571Z fpu_exception : yes 2025-05-07T19:43:04.6129649Z cpuid level : 13 2025-05-07T19:43:04.6129716Z wp : yes 2025-05-07T19:43:04.6131813Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6132181Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6132258Z bogomips : 6000.01 2025-05-07T19:43:04.6132340Z clflush size : 64 2025-05-07T19:43:04.6132422Z cache_alignment : 64 2025-05-07T19:43:04.6132541Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6132640Z power management: 2025-05-07T19:43:04.6132645Z 2025-05-07T19:43:04.6132716Z processor : 47 2025-05-07T19:43:04.6132804Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6132882Z cpu family : 6 2025-05-07T19:43:04.6132972Z model : 85 2025-05-07T19:43:04.6133126Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6133203Z stepping : 7 2025-05-07T19:43:04.6133351Z microcode : 0x5003901 2025-05-07T19:43:04.6133436Z cpu MHz : 3000.008 2025-05-07T19:43:04.6133516Z cache size : 36608 KB 2025-05-07T19:43:04.6133594Z physical id : 1 2025-05-07T19:43:04.6133683Z siblings : 48 2025-05-07T19:43:04.6133767Z core id : 23 2025-05-07T19:43:04.6133850Z cpu cores : 24 2025-05-07T19:43:04.6133939Z apicid : 110 2025-05-07T19:43:04.6134016Z initial apicid : 110 2025-05-07T19:43:04.6134092Z fpu : yes 2025-05-07T19:43:04.6134173Z fpu_exception : yes 2025-05-07T19:43:04.6134261Z cpuid level : 13 2025-05-07T19:43:04.6134329Z wp : yes 2025-05-07T19:43:04.6136417Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6136797Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6136871Z bogomips : 6000.01 2025-05-07T19:43:04.6136944Z clflush size : 64 2025-05-07T19:43:04.6137036Z cache_alignment : 64 2025-05-07T19:43:04.6137152Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6137227Z power management: 2025-05-07T19:43:04.6137231Z 2025-05-07T19:43:04.6137307Z processor : 48 2025-05-07T19:43:04.6137387Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6137455Z cpu family : 6 2025-05-07T19:43:04.6137520Z model : 85 2025-05-07T19:43:04.6137671Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6137789Z stepping : 7 2025-05-07T19:43:04.6137864Z microcode : 0x5003901 2025-05-07T19:43:04.6137965Z cpu MHz : 1200.699 2025-05-07T19:43:04.6138045Z cache size : 36608 KB 2025-05-07T19:43:04.6138125Z physical id : 0 2025-05-07T19:43:04.6138201Z siblings : 48 2025-05-07T19:43:04.6138291Z core id : 0 2025-05-07T19:43:04.6138366Z cpu cores : 24 2025-05-07T19:43:04.6138439Z apicid : 1 2025-05-07T19:43:04.6138543Z initial apicid : 1 2025-05-07T19:43:04.6138617Z fpu : yes 2025-05-07T19:43:04.6138695Z fpu_exception : yes 2025-05-07T19:43:04.6138775Z cpuid level : 13 2025-05-07T19:43:04.6138871Z wp : yes 2025-05-07T19:43:04.6140953Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6141341Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6141426Z bogomips : 6000.01 2025-05-07T19:43:04.6141506Z clflush size : 64 2025-05-07T19:43:04.6141588Z cache_alignment : 64 2025-05-07T19:43:04.6141728Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6141809Z power management: 2025-05-07T19:43:04.6141814Z 2025-05-07T19:43:04.6141889Z processor : 49 2025-05-07T19:43:04.6141995Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6142077Z cpu family : 6 2025-05-07T19:43:04.6142155Z model : 85 2025-05-07T19:43:04.6142305Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6142400Z stepping : 7 2025-05-07T19:43:04.6142493Z microcode : 0x5003901 2025-05-07T19:43:04.6142573Z cpu MHz : 3000.008 2025-05-07T19:43:04.6142717Z cache size : 36608 KB 2025-05-07T19:43:04.6142797Z physical id : 0 2025-05-07T19:43:04.6142878Z siblings : 48 2025-05-07T19:43:04.6142952Z core id : 1 2025-05-07T19:43:04.6143045Z cpu cores : 24 2025-05-07T19:43:04.6143126Z apicid : 3 2025-05-07T19:43:04.6143210Z initial apicid : 3 2025-05-07T19:43:04.6143298Z fpu : yes 2025-05-07T19:43:04.6143382Z fpu_exception : yes 2025-05-07T19:43:04.6143460Z cpuid level : 13 2025-05-07T19:43:04.6143535Z wp : yes 2025-05-07T19:43:04.6145642Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6146021Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6146120Z bogomips : 6000.01 2025-05-07T19:43:04.6146197Z clflush size : 64 2025-05-07T19:43:04.6146282Z cache_alignment : 64 2025-05-07T19:43:04.6146406Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6146508Z power management: 2025-05-07T19:43:04.6146512Z 2025-05-07T19:43:04.6146588Z processor : 50 2025-05-07T19:43:04.6146676Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6146776Z cpu family : 6 2025-05-07T19:43:04.6146851Z model : 85 2025-05-07T19:43:04.6147003Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6147084Z stepping : 7 2025-05-07T19:43:04.6147175Z microcode : 0x5003901 2025-05-07T19:43:04.6147326Z cpu MHz : 3000.008 2025-05-07T19:43:04.6147408Z cache size : 36608 KB 2025-05-07T19:43:04.6147503Z physical id : 0 2025-05-07T19:43:04.6147580Z siblings : 48 2025-05-07T19:43:04.6147653Z core id : 2 2025-05-07T19:43:04.6147729Z cpu cores : 24 2025-05-07T19:43:04.6147820Z apicid : 5 2025-05-07T19:43:04.6147905Z initial apicid : 5 2025-05-07T19:43:04.6147983Z fpu : yes 2025-05-07T19:43:04.6148066Z fpu_exception : yes 2025-05-07T19:43:04.6148161Z cpuid level : 13 2025-05-07T19:43:04.6148233Z wp : yes 2025-05-07T19:43:04.6150636Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6151059Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6151140Z bogomips : 6000.01 2025-05-07T19:43:04.6151248Z clflush size : 64 2025-05-07T19:43:04.6151333Z cache_alignment : 64 2025-05-07T19:43:04.6151464Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6151556Z power management: 2025-05-07T19:43:04.6151560Z 2025-05-07T19:43:04.6151666Z processor : 51 2025-05-07T19:43:04.6151756Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6151839Z cpu family : 6 2025-05-07T19:43:04.6151937Z model : 85 2025-05-07T19:43:04.6152101Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6152184Z stepping : 7 2025-05-07T19:43:04.6152277Z microcode : 0x5003901 2025-05-07T19:43:04.6152381Z cpu MHz : 1200.221 2025-05-07T19:43:04.6152472Z cache size : 36608 KB 2025-05-07T19:43:04.6152557Z physical id : 0 2025-05-07T19:43:04.6152712Z siblings : 48 2025-05-07T19:43:04.6152797Z core id : 3 2025-05-07T19:43:04.6152880Z cpu cores : 24 2025-05-07T19:43:04.6152964Z apicid : 7 2025-05-07T19:43:04.6153074Z initial apicid : 7 2025-05-07T19:43:04.6153152Z fpu : yes 2025-05-07T19:43:04.6153240Z fpu_exception : yes 2025-05-07T19:43:04.6153325Z cpuid level : 13 2025-05-07T19:43:04.6153423Z wp : yes 2025-05-07T19:43:04.6155683Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6156103Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6156188Z bogomips : 6000.01 2025-05-07T19:43:04.6156278Z clflush size : 64 2025-05-07T19:43:04.6156365Z cache_alignment : 64 2025-05-07T19:43:04.6156504Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6156601Z power management: 2025-05-07T19:43:04.6156606Z 2025-05-07T19:43:04.6156696Z processor : 52 2025-05-07T19:43:04.6156801Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6156883Z cpu family : 6 2025-05-07T19:43:04.6156965Z model : 85 2025-05-07T19:43:04.6157146Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6157229Z stepping : 7 2025-05-07T19:43:04.6157325Z microcode : 0x5003901 2025-05-07T19:43:04.6157412Z cpu MHz : 1200.206 2025-05-07T19:43:04.6157519Z cache size : 36608 KB 2025-05-07T19:43:04.6157658Z physical id : 0 2025-05-07T19:43:04.6157741Z siblings : 48 2025-05-07T19:43:04.6157816Z core id : 4 2025-05-07T19:43:04.6157921Z cpu cores : 24 2025-05-07T19:43:04.6158003Z apicid : 9 2025-05-07T19:43:04.6158098Z initial apicid : 9 2025-05-07T19:43:04.6158197Z fpu : yes 2025-05-07T19:43:04.6158296Z fpu_exception : yes 2025-05-07T19:43:04.6158382Z cpuid level : 13 2025-05-07T19:43:04.6158465Z wp : yes 2025-05-07T19:43:04.6160747Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6161156Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6161260Z bogomips : 6000.01 2025-05-07T19:43:04.6161351Z clflush size : 64 2025-05-07T19:43:04.6161444Z cache_alignment : 64 2025-05-07T19:43:04.6161581Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6161684Z power management: 2025-05-07T19:43:04.6161688Z 2025-05-07T19:43:04.6161882Z processor : 53 2025-05-07T19:43:04.6161975Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6162069Z cpu family : 6 2025-05-07T19:43:04.6162143Z model : 85 2025-05-07T19:43:04.6162297Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6162381Z stepping : 7 2025-05-07T19:43:04.6162477Z microcode : 0x5003901 2025-05-07T19:43:04.6162556Z cpu MHz : 1200.837 2025-05-07T19:43:04.6162639Z cache size : 36608 KB 2025-05-07T19:43:04.6162736Z physical id : 0 2025-05-07T19:43:04.6162814Z siblings : 48 2025-05-07T19:43:04.6162892Z core id : 5 2025-05-07T19:43:04.6162970Z cpu cores : 24 2025-05-07T19:43:04.6163111Z apicid : 11 2025-05-07T19:43:04.6163195Z initial apicid : 11 2025-05-07T19:43:04.6163274Z fpu : yes 2025-05-07T19:43:04.6163374Z fpu_exception : yes 2025-05-07T19:43:04.6163454Z cpuid level : 13 2025-05-07T19:43:04.6163529Z wp : yes 2025-05-07T19:43:04.6165632Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6166009Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6166095Z bogomips : 6000.01 2025-05-07T19:43:04.6166201Z clflush size : 64 2025-05-07T19:43:04.6166288Z cache_alignment : 64 2025-05-07T19:43:04.6166412Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6166499Z power management: 2025-05-07T19:43:04.6166503Z 2025-05-07T19:43:04.6166604Z processor : 54 2025-05-07T19:43:04.6166694Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6166776Z cpu family : 6 2025-05-07T19:43:04.6166874Z model : 85 2025-05-07T19:43:04.6167028Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6167107Z stepping : 7 2025-05-07T19:43:04.6167195Z microcode : 0x5003901 2025-05-07T19:43:04.6167292Z cpu MHz : 1208.895 2025-05-07T19:43:04.6167375Z cache size : 36608 KB 2025-05-07T19:43:04.6167460Z physical id : 0 2025-05-07T19:43:04.6167543Z siblings : 48 2025-05-07T19:43:04.6167617Z core id : 6 2025-05-07T19:43:04.6167741Z cpu cores : 24 2025-05-07T19:43:04.6167819Z apicid : 13 2025-05-07T19:43:04.6167916Z initial apicid : 13 2025-05-07T19:43:04.6167993Z fpu : yes 2025-05-07T19:43:04.6168076Z fpu_exception : yes 2025-05-07T19:43:04.6168177Z cpuid level : 13 2025-05-07T19:43:04.6168247Z wp : yes 2025-05-07T19:43:04.6170330Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6170726Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6170811Z bogomips : 6000.01 2025-05-07T19:43:04.6170884Z clflush size : 64 2025-05-07T19:43:04.6170984Z cache_alignment : 64 2025-05-07T19:43:04.6171110Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6171196Z power management: 2025-05-07T19:43:04.6171201Z 2025-05-07T19:43:04.6171279Z processor : 55 2025-05-07T19:43:04.6171377Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6171456Z cpu family : 6 2025-05-07T19:43:04.6171533Z model : 85 2025-05-07T19:43:04.6171694Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6171773Z stepping : 7 2025-05-07T19:43:04.6171856Z microcode : 0x5003901 2025-05-07T19:43:04.6171933Z cpu MHz : 1210.747 2025-05-07T19:43:04.6172021Z cache size : 36608 KB 2025-05-07T19:43:04.6172095Z physical id : 0 2025-05-07T19:43:04.6172173Z siblings : 48 2025-05-07T19:43:04.6172260Z core id : 7 2025-05-07T19:43:04.6172342Z cpu cores : 24 2025-05-07T19:43:04.6172418Z apicid : 15 2025-05-07T19:43:04.6172500Z initial apicid : 15 2025-05-07T19:43:04.6172634Z fpu : yes 2025-05-07T19:43:04.6172720Z fpu_exception : yes 2025-05-07T19:43:04.6172800Z cpuid level : 13 2025-05-07T19:43:04.6172886Z wp : yes 2025-05-07T19:43:04.6174966Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6175339Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6175436Z bogomips : 6000.01 2025-05-07T19:43:04.6175512Z clflush size : 64 2025-05-07T19:43:04.6175600Z cache_alignment : 64 2025-05-07T19:43:04.6175740Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6175822Z power management: 2025-05-07T19:43:04.6175826Z 2025-05-07T19:43:04.6175904Z processor : 56 2025-05-07T19:43:04.6175992Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6176094Z cpu family : 6 2025-05-07T19:43:04.6176170Z model : 85 2025-05-07T19:43:04.6176328Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6176423Z stepping : 7 2025-05-07T19:43:04.6176505Z microcode : 0x5003901 2025-05-07T19:43:04.6176592Z cpu MHz : 2391.397 2025-05-07T19:43:04.6176678Z cache size : 36608 KB 2025-05-07T19:43:04.6176770Z physical id : 0 2025-05-07T19:43:04.6176852Z siblings : 48 2025-05-07T19:43:04.6176929Z core id : 8 2025-05-07T19:43:04.6177031Z cpu cores : 24 2025-05-07T19:43:04.6177107Z apicid : 17 2025-05-07T19:43:04.6177236Z initial apicid : 17 2025-05-07T19:43:04.6177317Z fpu : yes 2025-05-07T19:43:04.6177416Z fpu_exception : yes 2025-05-07T19:43:04.6177502Z cpuid level : 13 2025-05-07T19:43:04.6177589Z wp : yes 2025-05-07T19:43:04.6179696Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6180064Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6180147Z bogomips : 6000.01 2025-05-07T19:43:04.6180251Z clflush size : 64 2025-05-07T19:43:04.6180327Z cache_alignment : 64 2025-05-07T19:43:04.6180450Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6180546Z power management: 2025-05-07T19:43:04.6180550Z 2025-05-07T19:43:04.6180627Z processor : 57 2025-05-07T19:43:04.6180716Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6180794Z cpu family : 6 2025-05-07T19:43:04.6180884Z model : 85 2025-05-07T19:43:04.6181046Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6181120Z stepping : 7 2025-05-07T19:43:04.6181219Z microcode : 0x5003901 2025-05-07T19:43:04.6181296Z cpu MHz : 1199.100 2025-05-07T19:43:04.6181372Z cache size : 36608 KB 2025-05-07T19:43:04.6181451Z physical id : 0 2025-05-07T19:43:04.6181544Z siblings : 48 2025-05-07T19:43:04.6181627Z core id : 9 2025-05-07T19:43:04.6181709Z cpu cores : 24 2025-05-07T19:43:04.6181783Z apicid : 19 2025-05-07T19:43:04.6181884Z initial apicid : 19 2025-05-07T19:43:04.6181959Z fpu : yes 2025-05-07T19:43:04.6182042Z fpu_exception : yes 2025-05-07T19:43:04.6182134Z cpuid level : 13 2025-05-07T19:43:04.6182273Z wp : yes 2025-05-07T19:43:04.6184485Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6185054Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6185143Z bogomips : 6000.01 2025-05-07T19:43:04.6185236Z clflush size : 64 2025-05-07T19:43:04.6185349Z cache_alignment : 64 2025-05-07T19:43:04.6185485Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6185581Z power management: 2025-05-07T19:43:04.6185586Z 2025-05-07T19:43:04.6185696Z processor : 58 2025-05-07T19:43:04.6185789Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6185881Z cpu family : 6 2025-05-07T19:43:04.6185965Z model : 85 2025-05-07T19:43:04.6186151Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6186242Z stepping : 7 2025-05-07T19:43:04.6186335Z microcode : 0x5003901 2025-05-07T19:43:04.6186428Z cpu MHz : 1199.376 2025-05-07T19:43:04.6186515Z cache size : 36608 KB 2025-05-07T19:43:04.6186603Z physical id : 0 2025-05-07T19:43:04.6186690Z siblings : 48 2025-05-07T19:43:04.6186790Z core id : 10 2025-05-07T19:43:04.6186878Z cpu cores : 24 2025-05-07T19:43:04.6186965Z apicid : 21 2025-05-07T19:43:04.6187051Z initial apicid : 21 2025-05-07T19:43:04.6187145Z fpu : yes 2025-05-07T19:43:04.6187234Z fpu_exception : yes 2025-05-07T19:43:04.6187397Z cpuid level : 13 2025-05-07T19:43:04.6187496Z wp : yes 2025-05-07T19:43:04.6189821Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6190228Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6190332Z bogomips : 6000.01 2025-05-07T19:43:04.6190417Z clflush size : 64 2025-05-07T19:43:04.6190506Z cache_alignment : 64 2025-05-07T19:43:04.6190663Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6190754Z power management: 2025-05-07T19:43:04.6190759Z 2025-05-07T19:43:04.6190852Z processor : 59 2025-05-07T19:43:04.6190944Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6191043Z cpu family : 6 2025-05-07T19:43:04.6191124Z model : 85 2025-05-07T19:43:04.6191287Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6191384Z stepping : 7 2025-05-07T19:43:04.6191475Z microcode : 0x5003901 2025-05-07T19:43:04.6191557Z cpu MHz : 3000.008 2025-05-07T19:43:04.6191646Z cache size : 36608 KB 2025-05-07T19:43:04.6191749Z physical id : 0 2025-05-07T19:43:04.6191830Z siblings : 48 2025-05-07T19:43:04.6191906Z core id : 11 2025-05-07T19:43:04.6192003Z cpu cores : 24 2025-05-07T19:43:04.6192087Z apicid : 23 2025-05-07T19:43:04.6192172Z initial apicid : 23 2025-05-07T19:43:04.6192252Z fpu : yes 2025-05-07T19:43:04.6192360Z fpu_exception : yes 2025-05-07T19:43:04.6192442Z cpuid level : 13 2025-05-07T19:43:04.6192521Z wp : yes 2025-05-07T19:43:04.6194789Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6195262Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6195347Z bogomips : 6000.01 2025-05-07T19:43:04.6195449Z clflush size : 64 2025-05-07T19:43:04.6195541Z cache_alignment : 64 2025-05-07T19:43:04.6195686Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6195784Z power management: 2025-05-07T19:43:04.6195792Z 2025-05-07T19:43:04.6195876Z processor : 60 2025-05-07T19:43:04.6195987Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6196082Z cpu family : 6 2025-05-07T19:43:04.6196192Z model : 85 2025-05-07T19:43:04.6196366Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6196460Z stepping : 7 2025-05-07T19:43:04.6196580Z microcode : 0x5003901 2025-05-07T19:43:04.6196670Z cpu MHz : 2387.877 2025-05-07T19:43:04.6196772Z cache size : 36608 KB 2025-05-07T19:43:04.6196864Z physical id : 0 2025-05-07T19:43:04.6196971Z siblings : 48 2025-05-07T19:43:04.6197056Z core id : 12 2025-05-07T19:43:04.6197145Z cpu cores : 24 2025-05-07T19:43:04.6197250Z apicid : 25 2025-05-07T19:43:04.6197341Z initial apicid : 25 2025-05-07T19:43:04.6197425Z fpu : yes 2025-05-07T19:43:04.6197526Z fpu_exception : yes 2025-05-07T19:43:04.6197642Z cpuid level : 13 2025-05-07T19:43:04.6197737Z wp : yes 2025-05-07T19:43:04.6200058Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6200501Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6200604Z bogomips : 6000.01 2025-05-07T19:43:04.6200703Z clflush size : 64 2025-05-07T19:43:04.6200833Z cache_alignment : 64 2025-05-07T19:43:04.6200983Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6201191Z power management: 2025-05-07T19:43:04.6201195Z 2025-05-07T19:43:04.6201317Z processor : 61 2025-05-07T19:43:04.6201425Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6201519Z cpu family : 6 2025-05-07T19:43:04.6201612Z model : 85 2025-05-07T19:43:04.6201806Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6201901Z stepping : 7 2025-05-07T19:43:04.6201993Z microcode : 0x5003901 2025-05-07T19:43:04.6202108Z cpu MHz : 1200.704 2025-05-07T19:43:04.6202204Z cache size : 36608 KB 2025-05-07T19:43:04.6202302Z physical id : 0 2025-05-07T19:43:04.6202390Z siblings : 48 2025-05-07T19:43:04.6202505Z core id : 13 2025-05-07T19:43:04.6202593Z cpu cores : 24 2025-05-07T19:43:04.6202681Z apicid : 27 2025-05-07T19:43:04.6202796Z initial apicid : 27 2025-05-07T19:43:04.6202883Z fpu : yes 2025-05-07T19:43:04.6202977Z fpu_exception : yes 2025-05-07T19:43:04.6203066Z cpuid level : 13 2025-05-07T19:43:04.6203178Z wp : yes 2025-05-07T19:43:04.6205386Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6206032Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6206713Z bogomips : 6000.01 2025-05-07T19:43:04.6206997Z clflush size : 64 2025-05-07T19:43:04.6207244Z cache_alignment : 64 2025-05-07T19:43:04.6207562Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6207913Z power management: 2025-05-07T19:43:04.6208091Z 2025-05-07T19:43:04.6208190Z processor : 62 2025-05-07T19:43:04.6208439Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6208746Z cpu family : 6 2025-05-07T19:43:04.6208966Z model : 85 2025-05-07T19:43:04.6209281Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6209675Z stepping : 7 2025-05-07T19:43:04.6209901Z microcode : 0x5003901 2025-05-07T19:43:04.6210173Z cpu MHz : 1200.373 2025-05-07T19:43:04.6210406Z cache size : 36608 KB 2025-05-07T19:43:04.6210678Z physical id : 0 2025-05-07T19:43:04.6210909Z siblings : 48 2025-05-07T19:43:04.6211145Z core id : 14 2025-05-07T19:43:04.6211378Z cpu cores : 24 2025-05-07T19:43:04.6211617Z apicid : 29 2025-05-07T19:43:04.6211845Z initial apicid : 29 2025-05-07T19:43:04.6212111Z fpu : yes 2025-05-07T19:43:04.6212337Z fpu_exception : yes 2025-05-07T19:43:04.6212600Z cpuid level : 13 2025-05-07T19:43:04.6212859Z wp : yes 2025-05-07T19:43:04.6215342Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6218434Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6219177Z bogomips : 6000.01 2025-05-07T19:43:04.6219408Z clflush size : 64 2025-05-07T19:43:04.6219659Z cache_alignment : 64 2025-05-07T19:43:04.6219946Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6220305Z power management: 2025-05-07T19:43:04.6220447Z 2025-05-07T19:43:04.6220538Z processor : 63 2025-05-07T19:43:04.6220798Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6221076Z cpu family : 6 2025-05-07T19:43:04.6221293Z model : 85 2025-05-07T19:43:04.6221598Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6221962Z stepping : 7 2025-05-07T19:43:04.6222208Z microcode : 0x5003901 2025-05-07T19:43:04.6222451Z cpu MHz : 1199.184 2025-05-07T19:43:04.6222699Z cache size : 36608 KB 2025-05-07T19:43:04.6222936Z physical id : 0 2025-05-07T19:43:04.6223177Z siblings : 48 2025-05-07T19:43:04.6223388Z core id : 15 2025-05-07T19:43:04.6223621Z cpu cores : 24 2025-05-07T19:43:04.6223833Z apicid : 31 2025-05-07T19:43:04.6224077Z initial apicid : 31 2025-05-07T19:43:04.6224303Z fpu : yes 2025-05-07T19:43:04.6224572Z fpu_exception : yes 2025-05-07T19:43:04.6224834Z cpuid level : 13 2025-05-07T19:43:04.6225050Z wp : yes 2025-05-07T19:43:04.6227396Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6230431Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6231058Z bogomips : 6000.01 2025-05-07T19:43:04.6231324Z clflush size : 64 2025-05-07T19:43:04.6231562Z cache_alignment : 64 2025-05-07T19:43:04.6231883Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6232234Z power management: 2025-05-07T19:43:04.6232404Z 2025-05-07T19:43:04.6232501Z processor : 64 2025-05-07T19:43:04.6232742Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6233034Z cpu family : 6 2025-05-07T19:43:04.6233279Z model : 85 2025-05-07T19:43:04.6233571Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6233978Z stepping : 7 2025-05-07T19:43:04.6234203Z microcode : 0x5003901 2025-05-07T19:43:04.6234477Z cpu MHz : 1199.925 2025-05-07T19:43:04.6234713Z cache size : 36608 KB 2025-05-07T19:43:04.6234990Z physical id : 0 2025-05-07T19:43:04.6235219Z siblings : 48 2025-05-07T19:43:04.6235468Z core id : 16 2025-05-07T19:43:04.6235693Z cpu cores : 24 2025-05-07T19:43:04.6235954Z apicid : 33 2025-05-07T19:43:04.6236182Z initial apicid : 33 2025-05-07T19:43:04.6236449Z fpu : yes 2025-05-07T19:43:04.6236705Z fpu_exception : yes 2025-05-07T19:43:04.6236945Z cpuid level : 13 2025-05-07T19:43:04.6237199Z wp : yes 2025-05-07T19:43:04.6239662Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6242551Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6243196Z bogomips : 6000.01 2025-05-07T19:43:04.6243438Z clflush size : 64 2025-05-07T19:43:04.6243700Z cache_alignment : 64 2025-05-07T19:43:04.6243987Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6244357Z power management: 2025-05-07T19:43:04.6244498Z 2025-05-07T19:43:04.6244591Z processor : 65 2025-05-07T19:43:04.6244851Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6245130Z cpu family : 6 2025-05-07T19:43:04.6245346Z model : 85 2025-05-07T19:43:04.6245657Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6246023Z stepping : 7 2025-05-07T19:43:04.6246260Z microcode : 0x5003901 2025-05-07T19:43:04.6246494Z cpu MHz : 3000.008 2025-05-07T19:43:04.6246746Z cache size : 36608 KB 2025-05-07T19:43:04.6246978Z physical id : 0 2025-05-07T19:43:04.6247222Z siblings : 48 2025-05-07T19:43:04.6247434Z core id : 17 2025-05-07T19:43:04.6247658Z cpu cores : 24 2025-05-07T19:43:04.6247875Z apicid : 35 2025-05-07T19:43:04.6248113Z initial apicid : 35 2025-05-07T19:43:04.6263353Z fpu : yes 2025-05-07T19:43:04.6263623Z fpu_exception : yes 2025-05-07T19:43:04.6263839Z cpuid level : 13 2025-05-07T19:43:04.6264059Z wp : yes 2025-05-07T19:43:04.6266277Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6268972Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6269823Z bogomips : 6000.01 2025-05-07T19:43:04.6270049Z clflush size : 64 2025-05-07T19:43:04.6270277Z cache_alignment : 64 2025-05-07T19:43:04.6270624Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6270972Z power management: 2025-05-07T19:43:04.6271111Z 2025-05-07T19:43:04.6271201Z processor : 66 2025-05-07T19:43:04.6271438Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6271698Z cpu family : 6 2025-05-07T19:43:04.6271903Z model : 85 2025-05-07T19:43:04.6272191Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6272550Z stepping : 7 2025-05-07T19:43:04.6272766Z microcode : 0x5003901 2025-05-07T19:43:04.6273000Z cpu MHz : 3000.008 2025-05-07T19:43:04.6273224Z cache size : 36608 KB 2025-05-07T19:43:04.6273453Z physical id : 0 2025-05-07T19:43:04.6273677Z siblings : 48 2025-05-07T19:43:04.6273879Z core id : 18 2025-05-07T19:43:04.6274090Z cpu cores : 24 2025-05-07T19:43:04.6274289Z apicid : 37 2025-05-07T19:43:04.6274501Z initial apicid : 37 2025-05-07T19:43:04.6274739Z fpu : yes 2025-05-07T19:43:04.6274938Z fpu_exception : yes 2025-05-07T19:43:04.6275165Z cpuid level : 13 2025-05-07T19:43:04.6275367Z wp : yes 2025-05-07T19:43:04.6277815Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6280576Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6281173Z bogomips : 6000.01 2025-05-07T19:43:04.6281393Z clflush size : 64 2025-05-07T19:43:04.6281607Z cache_alignment : 64 2025-05-07T19:43:04.6281997Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6282316Z power management: 2025-05-07T19:43:04.6282458Z 2025-05-07T19:43:04.6282538Z processor : 67 2025-05-07T19:43:04.6282761Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6283000Z cpu family : 6 2025-05-07T19:43:04.6283207Z model : 85 2025-05-07T19:43:04.6283481Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6283841Z stepping : 7 2025-05-07T19:43:04.6284045Z microcode : 0x5003901 2025-05-07T19:43:04.6284276Z cpu MHz : 1200.336 2025-05-07T19:43:04.6284651Z cache size : 36608 KB 2025-05-07T19:43:04.6285048Z physical id : 0 2025-05-07T19:43:04.6285253Z siblings : 48 2025-05-07T19:43:04.6285494Z core id : 19 2025-05-07T19:43:04.6285694Z cpu cores : 24 2025-05-07T19:43:04.6285903Z apicid : 39 2025-05-07T19:43:04.6286115Z initial apicid : 39 2025-05-07T19:43:04.6286345Z fpu : yes 2025-05-07T19:43:04.6286552Z fpu_exception : yes 2025-05-07T19:43:04.6286763Z cpuid level : 13 2025-05-07T19:43:04.6286973Z wp : yes 2025-05-07T19:43:04.6289336Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6292187Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6292793Z bogomips : 6000.01 2025-05-07T19:43:04.6293003Z clflush size : 64 2025-05-07T19:43:04.6293222Z cache_alignment : 64 2025-05-07T19:43:04.6293490Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6293820Z power management: 2025-05-07T19:43:04.6293959Z 2025-05-07T19:43:04.6294041Z processor : 68 2025-05-07T19:43:04.6294260Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6294502Z cpu family : 6 2025-05-07T19:43:04.6294698Z model : 85 2025-05-07T19:43:04.6294972Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6295320Z stepping : 7 2025-05-07T19:43:04.6295531Z microcode : 0x5003901 2025-05-07T19:43:04.6295746Z cpu MHz : 1199.537 2025-05-07T19:43:04.6295967Z cache size : 36608 KB 2025-05-07T19:43:04.6296183Z physical id : 0 2025-05-07T19:43:04.6296388Z siblings : 48 2025-05-07T19:43:04.6296579Z core id : 20 2025-05-07T19:43:04.6296780Z cpu cores : 24 2025-05-07T19:43:04.6296973Z apicid : 41 2025-05-07T19:43:04.6297174Z initial apicid : 41 2025-05-07T19:43:04.6297393Z fpu : yes 2025-05-07T19:43:04.6297582Z fpu_exception : yes 2025-05-07T19:43:04.6297912Z cpuid level : 13 2025-05-07T19:43:04.6298102Z wp : yes 2025-05-07T19:43:04.6300492Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6303215Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6303767Z bogomips : 6000.01 2025-05-07T19:43:04.6303973Z clflush size : 64 2025-05-07T19:43:04.6304163Z cache_alignment : 64 2025-05-07T19:43:04.6304416Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6304715Z power management: 2025-05-07T19:43:04.6304843Z 2025-05-07T19:43:04.6304917Z processor : 69 2025-05-07T19:43:04.6305118Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6305333Z cpu family : 6 2025-05-07T19:43:04.6305525Z model : 85 2025-05-07T19:43:04.6305778Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6306108Z stepping : 7 2025-05-07T19:43:04.6306293Z microcode : 0x5003901 2025-05-07T19:43:04.6306507Z cpu MHz : 3000.008 2025-05-07T19:43:04.6306702Z cache size : 36608 KB 2025-05-07T19:43:04.6306915Z physical id : 0 2025-05-07T19:43:04.6307106Z siblings : 48 2025-05-07T19:43:04.6307292Z core id : 21 2025-05-07T19:43:04.6307468Z cpu cores : 24 2025-05-07T19:43:04.6307656Z apicid : 43 2025-05-07T19:43:04.6307849Z initial apicid : 43 2025-05-07T19:43:04.6308041Z fpu : yes 2025-05-07T19:43:04.6308224Z fpu_exception : yes 2025-05-07T19:43:04.6308419Z cpuid level : 13 2025-05-07T19:43:04.6308615Z wp : yes 2025-05-07T19:43:04.6311142Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6313956Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6314557Z bogomips : 6000.01 2025-05-07T19:43:04.6314768Z clflush size : 64 2025-05-07T19:43:04.6314990Z cache_alignment : 64 2025-05-07T19:43:04.6315259Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6315602Z power management: 2025-05-07T19:43:04.6315734Z 2025-05-07T19:43:04.6315816Z processor : 70 2025-05-07T19:43:04.6316036Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6316278Z cpu family : 6 2025-05-07T19:43:04.6316469Z model : 85 2025-05-07T19:43:04.6316747Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6317092Z stepping : 7 2025-05-07T19:43:04.6317293Z microcode : 0x5003901 2025-05-07T19:43:04.6317514Z cpu MHz : 1199.479 2025-05-07T19:43:04.6317727Z cache size : 36608 KB 2025-05-07T19:43:04.6317938Z physical id : 0 2025-05-07T19:43:04.6318147Z siblings : 48 2025-05-07T19:43:04.6318336Z core id : 22 2025-05-07T19:43:04.6318533Z cpu cores : 24 2025-05-07T19:43:04.6318725Z apicid : 45 2025-05-07T19:43:04.6318933Z initial apicid : 45 2025-05-07T19:43:04.6319145Z fpu : yes 2025-05-07T19:43:04.6319334Z fpu_exception : yes 2025-05-07T19:43:04.6319545Z cpuid level : 13 2025-05-07T19:43:04.6319743Z wp : yes 2025-05-07T19:43:04.6322250Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6324787Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6325346Z bogomips : 6000.01 2025-05-07T19:43:04.6325548Z clflush size : 64 2025-05-07T19:43:04.6325748Z cache_alignment : 64 2025-05-07T19:43:04.6326004Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6326304Z power management: 2025-05-07T19:43:04.6326437Z 2025-05-07T19:43:04.6326522Z processor : 71 2025-05-07T19:43:04.6326728Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6326949Z cpu family : 6 2025-05-07T19:43:04.6327137Z model : 85 2025-05-07T19:43:04.6327389Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6327723Z stepping : 7 2025-05-07T19:43:04.6327910Z microcode : 0x5003901 2025-05-07T19:43:04.6328117Z cpu MHz : 3000.008 2025-05-07T19:43:04.6328309Z cache size : 36608 KB 2025-05-07T19:43:04.6328514Z physical id : 0 2025-05-07T19:43:04.6328704Z siblings : 48 2025-05-07T19:43:04.6328899Z core id : 23 2025-05-07T19:43:04.6329083Z cpu cores : 24 2025-05-07T19:43:04.6329272Z apicid : 47 2025-05-07T19:43:04.6329457Z initial apicid : 47 2025-05-07T19:43:04.6329657Z fpu : yes 2025-05-07T19:43:04.6329851Z fpu_exception : yes 2025-05-07T19:43:04.6330054Z cpuid level : 13 2025-05-07T19:43:04.6330252Z wp : yes 2025-05-07T19:43:04.6332441Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6334982Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6335634Z bogomips : 6000.01 2025-05-07T19:43:04.6335831Z clflush size : 64 2025-05-07T19:43:04.6336044Z cache_alignment : 64 2025-05-07T19:43:04.6336310Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6336615Z power management: 2025-05-07T19:43:04.6336740Z 2025-05-07T19:43:04.6336827Z processor : 72 2025-05-07T19:43:04.6337032Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6337259Z cpu family : 6 2025-05-07T19:43:04.6337448Z model : 85 2025-05-07T19:43:04.6337717Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6338043Z stepping : 7 2025-05-07T19:43:04.6338248Z microcode : 0x5003901 2025-05-07T19:43:04.6338460Z cpu MHz : 3000.008 2025-05-07T19:43:04.6338672Z cache size : 36608 KB 2025-05-07T19:43:04.6338892Z physical id : 1 2025-05-07T19:43:04.6339092Z siblings : 48 2025-05-07T19:43:04.6339303Z core id : 0 2025-05-07T19:43:04.6339485Z cpu cores : 24 2025-05-07T19:43:04.6339679Z apicid : 65 2025-05-07T19:43:04.6339856Z initial apicid : 65 2025-05-07T19:43:04.6340065Z fpu : yes 2025-05-07T19:43:04.6340240Z fpu_exception : yes 2025-05-07T19:43:04.6340448Z cpuid level : 13 2025-05-07T19:43:04.6340634Z wp : yes 2025-05-07T19:43:04.6342844Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6345450Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6346020Z bogomips : 6000.01 2025-05-07T19:43:04.6346225Z clflush size : 64 2025-05-07T19:43:04.6346442Z cache_alignment : 64 2025-05-07T19:43:04.6346692Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6347009Z power management: 2025-05-07T19:43:04.6347132Z 2025-05-07T19:43:04.6347210Z processor : 73 2025-05-07T19:43:04.6347427Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6347649Z cpu family : 6 2025-05-07T19:43:04.6347855Z model : 85 2025-05-07T19:43:04.6348113Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6348452Z stepping : 7 2025-05-07T19:43:04.6348646Z microcode : 0x5003901 2025-05-07T19:43:04.6348864Z cpu MHz : 3210.051 2025-05-07T19:43:04.6349123Z cache size : 36608 KB 2025-05-07T19:43:04.6349362Z physical id : 1 2025-05-07T19:43:04.6349742Z siblings : 48 2025-05-07T19:43:04.6349951Z core id : 1 2025-05-07T19:43:04.6350188Z cpu cores : 24 2025-05-07T19:43:04.6350405Z apicid : 67 2025-05-07T19:43:04.6350663Z initial apicid : 67 2025-05-07T19:43:04.6350873Z fpu : yes 2025-05-07T19:43:04.6351087Z fpu_exception : yes 2025-05-07T19:43:04.6351313Z cpuid level : 13 2025-05-07T19:43:04.6351541Z wp : yes 2025-05-07T19:43:04.6353931Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6356679Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6357351Z bogomips : 6000.01 2025-05-07T19:43:04.6357570Z clflush size : 64 2025-05-07T19:43:04.6357814Z cache_alignment : 64 2025-05-07T19:43:04.6358102Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6358428Z power management: 2025-05-07T19:43:04.6358564Z 2025-05-07T19:43:04.6358654Z processor : 74 2025-05-07T19:43:04.6358872Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6359127Z cpu family : 6 2025-05-07T19:43:04.6359330Z model : 85 2025-05-07T19:43:04.6359612Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6359970Z stepping : 7 2025-05-07T19:43:04.6360187Z microcode : 0x5003901 2025-05-07T19:43:04.6360408Z cpu MHz : 3000.008 2025-05-07T19:43:04.6360627Z cache size : 36608 KB 2025-05-07T19:43:04.6360861Z physical id : 1 2025-05-07T19:43:04.6361069Z siblings : 48 2025-05-07T19:43:04.6361279Z core id : 2 2025-05-07T19:43:04.6361481Z cpu cores : 24 2025-05-07T19:43:04.6361691Z apicid : 69 2025-05-07T19:43:04.6361983Z initial apicid : 69 2025-05-07T19:43:04.6362188Z fpu : yes 2025-05-07T19:43:04.6362370Z fpu_exception : yes 2025-05-07T19:43:04.6362585Z cpuid level : 13 2025-05-07T19:43:04.6362776Z wp : yes 2025-05-07T19:43:04.6364979Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6367585Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6368142Z bogomips : 6000.01 2025-05-07T19:43:04.6368368Z clflush size : 64 2025-05-07T19:43:04.6368584Z cache_alignment : 64 2025-05-07T19:43:04.6368828Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6369144Z power management: 2025-05-07T19:43:04.6369267Z 2025-05-07T19:43:04.6369342Z processor : 75 2025-05-07T19:43:04.6369544Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6369756Z cpu family : 6 2025-05-07T19:43:04.6369945Z model : 85 2025-05-07T19:43:04.6370194Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6370520Z stepping : 7 2025-05-07T19:43:04.6370701Z microcode : 0x5003901 2025-05-07T19:43:04.6370908Z cpu MHz : 3131.522 2025-05-07T19:43:04.6371102Z cache size : 36608 KB 2025-05-07T19:43:04.6371335Z physical id : 1 2025-05-07T19:43:04.6371536Z siblings : 48 2025-05-07T19:43:04.6371731Z core id : 3 2025-05-07T19:43:04.6371917Z cpu cores : 24 2025-05-07T19:43:04.6372107Z apicid : 71 2025-05-07T19:43:04.6372301Z initial apicid : 71 2025-05-07T19:43:04.6372499Z fpu : yes 2025-05-07T19:43:04.6372693Z fpu_exception : yes 2025-05-07T19:43:04.6372895Z cpuid level : 13 2025-05-07T19:43:04.6373103Z wp : yes 2025-05-07T19:43:04.6375293Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6377828Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6378397Z bogomips : 6000.01 2025-05-07T19:43:04.6378597Z clflush size : 64 2025-05-07T19:43:04.6378844Z cache_alignment : 64 2025-05-07T19:43:04.6379103Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6379406Z power management: 2025-05-07T19:43:04.6379528Z 2025-05-07T19:43:04.6379612Z processor : 76 2025-05-07T19:43:04.6379806Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6380021Z cpu family : 6 2025-05-07T19:43:04.6380201Z model : 85 2025-05-07T19:43:04.6380457Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6380779Z stepping : 7 2025-05-07T19:43:04.6380974Z microcode : 0x5003901 2025-05-07T19:43:04.6381177Z cpu MHz : 3225.389 2025-05-07T19:43:04.6381379Z cache size : 36608 KB 2025-05-07T19:43:04.6381594Z physical id : 1 2025-05-07T19:43:04.6381779Z siblings : 48 2025-05-07T19:43:04.6381966Z core id : 4 2025-05-07T19:43:04.6382137Z cpu cores : 24 2025-05-07T19:43:04.6382320Z apicid : 73 2025-05-07T19:43:04.6382497Z initial apicid : 73 2025-05-07T19:43:04.6382697Z fpu : yes 2025-05-07T19:43:04.6382872Z fpu_exception : yes 2025-05-07T19:43:04.6383071Z cpuid level : 13 2025-05-07T19:43:04.6383249Z wp : yes 2025-05-07T19:43:04.6385826Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6388575Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6389237Z bogomips : 6000.01 2025-05-07T19:43:04.6389597Z clflush size : 64 2025-05-07T19:43:04.6389864Z cache_alignment : 64 2025-05-07T19:43:04.6390164Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6390549Z power management: 2025-05-07T19:43:04.6390696Z 2025-05-07T19:43:04.6390796Z processor : 77 2025-05-07T19:43:04.6391060Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6391324Z cpu family : 6 2025-05-07T19:43:04.6391568Z model : 85 2025-05-07T19:43:04.6391863Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6392261Z stepping : 7 2025-05-07T19:43:04.6392486Z microcode : 0x5003901 2025-05-07T19:43:04.6392759Z cpu MHz : 3176.906 2025-05-07T19:43:04.6392995Z cache size : 36608 KB 2025-05-07T19:43:04.6393268Z physical id : 1 2025-05-07T19:43:04.6393512Z siblings : 48 2025-05-07T19:43:04.6393734Z core id : 5 2025-05-07T19:43:04.6393970Z cpu cores : 24 2025-05-07T19:43:04.6394196Z apicid : 75 2025-05-07T19:43:04.6394438Z initial apicid : 75 2025-05-07T19:43:04.6394673Z fpu : yes 2025-05-07T19:43:04.6394906Z fpu_exception : yes 2025-05-07T19:43:04.6395135Z cpuid level : 13 2025-05-07T19:43:04.6395374Z wp : yes 2025-05-07T19:43:04.6397781Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6400525Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6401246Z bogomips : 6000.01 2025-05-07T19:43:04.6401455Z clflush size : 64 2025-05-07T19:43:04.6401686Z cache_alignment : 64 2025-05-07T19:43:04.6401955Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6402340Z power management: 2025-05-07T19:43:04.6402462Z 2025-05-07T19:43:04.6402566Z processor : 78 2025-05-07T19:43:04.6402785Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6403040Z cpu family : 6 2025-05-07T19:43:04.6403232Z model : 85 2025-05-07T19:43:04.6403510Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6403835Z stepping : 7 2025-05-07T19:43:04.6404045Z microcode : 0x5003901 2025-05-07T19:43:04.6404259Z cpu MHz : 3307.460 2025-05-07T19:43:04.6404469Z cache size : 36608 KB 2025-05-07T19:43:04.6404674Z physical id : 1 2025-05-07T19:43:04.6404855Z siblings : 48 2025-05-07T19:43:04.6405054Z core id : 6 2025-05-07T19:43:04.6405230Z cpu cores : 24 2025-05-07T19:43:04.6405422Z apicid : 77 2025-05-07T19:43:04.6405610Z initial apicid : 77 2025-05-07T19:43:04.6405815Z fpu : yes 2025-05-07T19:43:04.6405996Z fpu_exception : yes 2025-05-07T19:43:04.6406188Z cpuid level : 13 2025-05-07T19:43:04.6406387Z wp : yes 2025-05-07T19:43:04.6408597Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6411154Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6411709Z bogomips : 6000.01 2025-05-07T19:43:04.6411914Z clflush size : 64 2025-05-07T19:43:04.6412141Z cache_alignment : 64 2025-05-07T19:43:04.6412443Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6412757Z power management: 2025-05-07T19:43:04.6412892Z 2025-05-07T19:43:04.6412973Z processor : 79 2025-05-07T19:43:04.6413189Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6413402Z cpu family : 6 2025-05-07T19:43:04.6413604Z model : 85 2025-05-07T19:43:04.6413855Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6414188Z stepping : 7 2025-05-07T19:43:04.6414372Z microcode : 0x5003901 2025-05-07T19:43:04.6414574Z cpu MHz : 3152.536 2025-05-07T19:43:04.6414768Z cache size : 36608 KB 2025-05-07T19:43:04.6414966Z physical id : 1 2025-05-07T19:43:04.6415147Z siblings : 48 2025-05-07T19:43:04.6415336Z core id : 7 2025-05-07T19:43:04.6415526Z cpu cores : 24 2025-05-07T19:43:04.6415699Z apicid : 79 2025-05-07T19:43:04.6415888Z initial apicid : 79 2025-05-07T19:43:04.6416069Z fpu : yes 2025-05-07T19:43:04.6416255Z fpu_exception : yes 2025-05-07T19:43:04.6416445Z cpuid level : 13 2025-05-07T19:43:04.6416634Z wp : yes 2025-05-07T19:43:04.6418825Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6421351Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6421899Z bogomips : 6000.01 2025-05-07T19:43:04.6422091Z clflush size : 64 2025-05-07T19:43:04.6422292Z cache_alignment : 64 2025-05-07T19:43:04.6422545Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6422838Z power management: 2025-05-07T19:43:04.6422955Z 2025-05-07T19:43:04.6423772Z processor : 80 2025-05-07T19:43:04.6423959Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6424180Z cpu family : 6 2025-05-07T19:43:04.6424357Z model : 85 2025-05-07T19:43:04.6424607Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6424930Z stepping : 7 2025-05-07T19:43:04.6425113Z microcode : 0x5003901 2025-05-07T19:43:04.6425315Z cpu MHz : 3150.238 2025-05-07T19:43:04.6425510Z cache size : 36608 KB 2025-05-07T19:43:04.6425716Z physical id : 1 2025-05-07T19:43:04.6425898Z siblings : 48 2025-05-07T19:43:04.6426090Z core id : 8 2025-05-07T19:43:04.6426261Z cpu cores : 24 2025-05-07T19:43:04.6426445Z apicid : 81 2025-05-07T19:43:04.6426634Z initial apicid : 81 2025-05-07T19:43:04.6426828Z fpu : yes 2025-05-07T19:43:04.6427002Z fpu_exception : yes 2025-05-07T19:43:04.6427196Z cpuid level : 13 2025-05-07T19:43:04.6427373Z wp : yes 2025-05-07T19:43:04.6429813Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6432574Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6433164Z bogomips : 6000.01 2025-05-07T19:43:04.6433375Z clflush size : 64 2025-05-07T19:43:04.6433583Z cache_alignment : 64 2025-05-07T19:43:04.6433855Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6434190Z power management: 2025-05-07T19:43:04.6434390Z 2025-05-07T19:43:04.6434473Z processor : 81 2025-05-07T19:43:04.6434691Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6434774Z cpu family : 6 2025-05-07T19:43:04.6434849Z model : 85 2025-05-07T19:43:04.6435021Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6435101Z stepping : 7 2025-05-07T19:43:04.6435192Z microcode : 0x5003901 2025-05-07T19:43:04.6435273Z cpu MHz : 3000.008 2025-05-07T19:43:04.6435369Z cache size : 36608 KB 2025-05-07T19:43:04.6435451Z physical id : 1 2025-05-07T19:43:04.6435535Z siblings : 48 2025-05-07T19:43:04.6435621Z core id : 9 2025-05-07T19:43:04.6435702Z cpu cores : 24 2025-05-07T19:43:04.6435789Z apicid : 83 2025-05-07T19:43:04.6435873Z initial apicid : 83 2025-05-07T19:43:04.6435963Z fpu : yes 2025-05-07T19:43:04.6436049Z fpu_exception : yes 2025-05-07T19:43:04.6436129Z cpuid level : 13 2025-05-07T19:43:04.6436213Z wp : yes 2025-05-07T19:43:04.6438478Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6438885Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6438979Z bogomips : 6000.01 2025-05-07T19:43:04.6439062Z clflush size : 64 2025-05-07T19:43:04.6439143Z cache_alignment : 64 2025-05-07T19:43:04.6439283Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6439365Z power management: 2025-05-07T19:43:04.6439370Z 2025-05-07T19:43:04.6439454Z processor : 82 2025-05-07T19:43:04.6439539Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6439676Z cpu family : 6 2025-05-07T19:43:04.6439749Z model : 85 2025-05-07T19:43:04.6439912Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6439997Z stepping : 7 2025-05-07T19:43:04.6440080Z microcode : 0x5003901 2025-05-07T19:43:04.6440161Z cpu MHz : 3154.389 2025-05-07T19:43:04.6440242Z cache size : 36608 KB 2025-05-07T19:43:04.6440330Z physical id : 1 2025-05-07T19:43:04.6440405Z siblings : 48 2025-05-07T19:43:04.6440478Z core id : 10 2025-05-07T19:43:04.6440568Z cpu cores : 24 2025-05-07T19:43:04.6440650Z apicid : 85 2025-05-07T19:43:04.6440731Z initial apicid : 85 2025-05-07T19:43:04.6440805Z fpu : yes 2025-05-07T19:43:04.6440898Z fpu_exception : yes 2025-05-07T19:43:04.6440979Z cpuid level : 13 2025-05-07T19:43:04.6441050Z wp : yes 2025-05-07T19:43:04.6443324Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6443692Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6443766Z bogomips : 6000.01 2025-05-07T19:43:04.6443848Z clflush size : 64 2025-05-07T19:43:04.6443928Z cache_alignment : 64 2025-05-07T19:43:04.6444051Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6444135Z power management: 2025-05-07T19:43:04.6444139Z 2025-05-07T19:43:04.6444214Z processor : 83 2025-05-07T19:43:04.6444340Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6444410Z cpu family : 6 2025-05-07T19:43:04.6444490Z model : 85 2025-05-07T19:43:04.6444637Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6444710Z stepping : 7 2025-05-07T19:43:04.6444795Z microcode : 0x5003901 2025-05-07T19:43:04.6444876Z cpu MHz : 3170.572 2025-05-07T19:43:04.6444949Z cache size : 36608 KB 2025-05-07T19:43:04.6445022Z physical id : 1 2025-05-07T19:43:04.6445099Z siblings : 48 2025-05-07T19:43:04.6445171Z core id : 11 2025-05-07T19:43:04.6445239Z cpu cores : 24 2025-05-07T19:43:04.6445308Z apicid : 87 2025-05-07T19:43:04.6445396Z initial apicid : 87 2025-05-07T19:43:04.6445464Z fpu : yes 2025-05-07T19:43:04.6445546Z fpu_exception : yes 2025-05-07T19:43:04.6445631Z cpuid level : 13 2025-05-07T19:43:04.6445701Z wp : yes 2025-05-07T19:43:04.6447784Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6448159Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6448238Z bogomips : 6000.01 2025-05-07T19:43:04.6448311Z clflush size : 64 2025-05-07T19:43:04.6448403Z cache_alignment : 64 2025-05-07T19:43:04.6448523Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6448598Z power management: 2025-05-07T19:43:04.6448602Z 2025-05-07T19:43:04.6448688Z processor : 84 2025-05-07T19:43:04.6448777Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6448855Z cpu family : 6 2025-05-07T19:43:04.6448924Z model : 85 2025-05-07T19:43:04.6449080Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6449200Z stepping : 7 2025-05-07T19:43:04.6449279Z microcode : 0x5003901 2025-05-07T19:43:04.6449361Z cpu MHz : 3203.897 2025-05-07T19:43:04.6449445Z cache size : 36608 KB 2025-05-07T19:43:04.6449517Z physical id : 1 2025-05-07T19:43:04.6449589Z siblings : 48 2025-05-07T19:43:04.6449667Z core id : 12 2025-05-07T19:43:04.6449739Z cpu cores : 24 2025-05-07T19:43:04.6449808Z apicid : 89 2025-05-07T19:43:04.6449887Z initial apicid : 89 2025-05-07T19:43:04.6449964Z fpu : yes 2025-05-07T19:43:04.6450038Z fpu_exception : yes 2025-05-07T19:43:04.6450109Z cpuid level : 13 2025-05-07T19:43:04.6450185Z wp : yes 2025-05-07T19:43:04.6452272Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6452646Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6452732Z bogomips : 6000.01 2025-05-07T19:43:04.6452808Z clflush size : 64 2025-05-07T19:43:04.6452883Z cache_alignment : 64 2025-05-07T19:43:04.6453005Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6453080Z power management: 2025-05-07T19:43:04.6453084Z 2025-05-07T19:43:04.6453156Z processor : 85 2025-05-07T19:43:04.6453246Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6453319Z cpu family : 6 2025-05-07T19:43:04.6453389Z model : 85 2025-05-07T19:43:04.6453581Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6453668Z stepping : 7 2025-05-07T19:43:04.6453747Z microcode : 0x5003901 2025-05-07T19:43:04.6453822Z cpu MHz : 3152.110 2025-05-07T19:43:04.6453897Z cache size : 36608 KB 2025-05-07T19:43:04.6453982Z physical id : 1 2025-05-07T19:43:04.6454054Z siblings : 48 2025-05-07T19:43:04.6454125Z core id : 13 2025-05-07T19:43:04.6454206Z cpu cores : 24 2025-05-07T19:43:04.6454276Z apicid : 91 2025-05-07T19:43:04.6454352Z initial apicid : 91 2025-05-07T19:43:04.6454422Z fpu : yes 2025-05-07T19:43:04.6454511Z fpu_exception : yes 2025-05-07T19:43:04.6454582Z cpuid level : 13 2025-05-07T19:43:04.6454652Z wp : yes 2025-05-07T19:43:04.6456751Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6457123Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6457199Z bogomips : 6000.01 2025-05-07T19:43:04.6457284Z clflush size : 64 2025-05-07T19:43:04.6457369Z cache_alignment : 64 2025-05-07T19:43:04.6457486Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6457573Z power management: 2025-05-07T19:43:04.6457577Z 2025-05-07T19:43:04.6457653Z processor : 86 2025-05-07T19:43:04.6457736Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6457810Z cpu family : 6 2025-05-07T19:43:04.6457887Z model : 85 2025-05-07T19:43:04.6458041Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6458116Z stepping : 7 2025-05-07T19:43:04.6458279Z microcode : 0x5003901 2025-05-07T19:43:04.6458357Z cpu MHz : 3071.180 2025-05-07T19:43:04.6458436Z cache size : 36608 KB 2025-05-07T19:43:04.6458512Z physical id : 1 2025-05-07T19:43:04.6458594Z siblings : 48 2025-05-07T19:43:04.6458668Z core id : 14 2025-05-07T19:43:04.6458740Z cpu cores : 24 2025-05-07T19:43:04.6458820Z apicid : 93 2025-05-07T19:43:04.6458893Z initial apicid : 93 2025-05-07T19:43:04.6458963Z fpu : yes 2025-05-07T19:43:04.6459046Z fpu_exception : yes 2025-05-07T19:43:04.6459129Z cpuid level : 13 2025-05-07T19:43:04.6459198Z wp : yes 2025-05-07T19:43:04.6461278Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6461659Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6461738Z bogomips : 6000.01 2025-05-07T19:43:04.6461813Z clflush size : 64 2025-05-07T19:43:04.6461899Z cache_alignment : 64 2025-05-07T19:43:04.6462019Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6462096Z power management: 2025-05-07T19:43:04.6462099Z 2025-05-07T19:43:04.6462185Z processor : 87 2025-05-07T19:43:04.6462271Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6462340Z cpu family : 6 2025-05-07T19:43:04.6462408Z model : 85 2025-05-07T19:43:04.6462558Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6462677Z stepping : 7 2025-05-07T19:43:04.6462755Z microcode : 0x5003901 2025-05-07T19:43:04.6462836Z cpu MHz : 3104.129 2025-05-07T19:43:04.6462916Z cache size : 36608 KB 2025-05-07T19:43:04.6462987Z physical id : 1 2025-05-07T19:43:04.6463057Z siblings : 48 2025-05-07T19:43:04.6463133Z core id : 15 2025-05-07T19:43:04.6463202Z cpu cores : 24 2025-05-07T19:43:04.6463270Z apicid : 95 2025-05-07T19:43:04.6463361Z initial apicid : 95 2025-05-07T19:43:04.6463428Z fpu : yes 2025-05-07T19:43:04.6463503Z fpu_exception : yes 2025-05-07T19:43:04.6463578Z cpuid level : 13 2025-05-07T19:43:04.6463661Z wp : yes 2025-05-07T19:43:04.6465739Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6466115Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6466190Z bogomips : 6000.01 2025-05-07T19:43:04.6466264Z clflush size : 64 2025-05-07T19:43:04.6466340Z cache_alignment : 64 2025-05-07T19:43:04.6466464Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6466540Z power management: 2025-05-07T19:43:04.6466544Z 2025-05-07T19:43:04.6466617Z processor : 88 2025-05-07T19:43:04.6466711Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6466783Z cpu family : 6 2025-05-07T19:43:04.6466854Z model : 85 2025-05-07T19:43:04.6467000Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6467078Z stepping : 7 2025-05-07T19:43:04.6467161Z microcode : 0x5003901 2025-05-07T19:43:04.6467232Z cpu MHz : 3217.979 2025-05-07T19:43:04.6467373Z cache size : 36608 KB 2025-05-07T19:43:04.6467452Z physical id : 1 2025-05-07T19:43:04.6467520Z siblings : 48 2025-05-07T19:43:04.6467588Z core id : 16 2025-05-07T19:43:04.6467667Z cpu cores : 24 2025-05-07T19:43:04.6467740Z apicid : 97 2025-05-07T19:43:04.6467817Z initial apicid : 97 2025-05-07T19:43:04.6467899Z fpu : yes 2025-05-07T19:43:04.6467980Z fpu_exception : yes 2025-05-07T19:43:04.6468057Z cpuid level : 13 2025-05-07T19:43:04.6468124Z wp : yes 2025-05-07T19:43:04.6470534Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6470937Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6471029Z bogomips : 6000.01 2025-05-07T19:43:04.6471112Z clflush size : 64 2025-05-07T19:43:04.6471202Z cache_alignment : 64 2025-05-07T19:43:04.6471334Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6471430Z power management: 2025-05-07T19:43:04.6471434Z 2025-05-07T19:43:04.6471518Z processor : 89 2025-05-07T19:43:04.6471609Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6471704Z cpu family : 6 2025-05-07T19:43:04.6471782Z model : 85 2025-05-07T19:43:04.6471946Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6472025Z stepping : 7 2025-05-07T19:43:04.6472126Z microcode : 0x5003901 2025-05-07T19:43:04.6472260Z cpu MHz : 3162.173 2025-05-07T19:43:04.6472341Z cache size : 36608 KB 2025-05-07T19:43:04.6472438Z physical id : 1 2025-05-07T19:43:04.6472520Z siblings : 48 2025-05-07T19:43:04.6472595Z core id : 17 2025-05-07T19:43:04.6472672Z cpu cores : 24 2025-05-07T19:43:04.6472756Z apicid : 99 2025-05-07T19:43:04.6472838Z initial apicid : 99 2025-05-07T19:43:04.6472913Z fpu : yes 2025-05-07T19:43:04.6473000Z fpu_exception : yes 2025-05-07T19:43:04.6473097Z cpuid level : 13 2025-05-07T19:43:04.6473173Z wp : yes 2025-05-07T19:43:04.6475437Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6475850Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6475936Z bogomips : 6000.01 2025-05-07T19:43:04.6476025Z clflush size : 64 2025-05-07T19:43:04.6476116Z cache_alignment : 64 2025-05-07T19:43:04.6476245Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6476328Z power management: 2025-05-07T19:43:04.6476332Z 2025-05-07T19:43:04.6476428Z processor : 90 2025-05-07T19:43:04.6476516Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6476592Z cpu family : 6 2025-05-07T19:43:04.6476678Z model : 85 2025-05-07T19:43:04.6476833Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6476912Z stepping : 7 2025-05-07T19:43:04.6476990Z microcode : 0x5003901 2025-05-07T19:43:04.6477082Z cpu MHz : 3000.008 2025-05-07T19:43:04.6477164Z cache size : 36608 KB 2025-05-07T19:43:04.6477242Z physical id : 1 2025-05-07T19:43:04.6477377Z siblings : 48 2025-05-07T19:43:04.6477454Z core id : 18 2025-05-07T19:43:04.6477533Z cpu cores : 24 2025-05-07T19:43:04.6477606Z apicid : 101 2025-05-07T19:43:04.6477703Z initial apicid : 101 2025-05-07T19:43:04.6477780Z fpu : yes 2025-05-07T19:43:04.6477861Z fpu_exception : yes 2025-05-07T19:43:04.6477937Z cpuid level : 13 2025-05-07T19:43:04.6478020Z wp : yes 2025-05-07T19:43:04.6480267Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6480686Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6480768Z bogomips : 6000.01 2025-05-07T19:43:04.6480855Z clflush size : 64 2025-05-07T19:43:04.6480946Z cache_alignment : 64 2025-05-07T19:43:04.6481085Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6481168Z power management: 2025-05-07T19:43:04.6481172Z 2025-05-07T19:43:04.6481252Z processor : 91 2025-05-07T19:43:04.6481353Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6481427Z cpu family : 6 2025-05-07T19:43:04.6481506Z model : 85 2025-05-07T19:43:04.6481676Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6481862Z stepping : 7 2025-05-07T19:43:04.6481937Z microcode : 0x5003901 2025-05-07T19:43:04.6482006Z cpu MHz : 3074.368 2025-05-07T19:43:04.6482101Z cache size : 36608 KB 2025-05-07T19:43:04.6482222Z physical id : 1 2025-05-07T19:43:04.6482298Z siblings : 48 2025-05-07T19:43:04.6482368Z core id : 19 2025-05-07T19:43:04.6482453Z cpu cores : 24 2025-05-07T19:43:04.6482521Z apicid : 103 2025-05-07T19:43:04.6482600Z initial apicid : 103 2025-05-07T19:43:04.6482680Z fpu : yes 2025-05-07T19:43:04.6482759Z fpu_exception : yes 2025-05-07T19:43:04.6482830Z cpuid level : 13 2025-05-07T19:43:04.6482897Z wp : yes 2025-05-07T19:43:04.6485321Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6485722Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6485821Z bogomips : 6000.01 2025-05-07T19:43:04.6485905Z clflush size : 64 2025-05-07T19:43:04.6485988Z cache_alignment : 64 2025-05-07T19:43:04.6486125Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6486232Z power management: 2025-05-07T19:43:04.6486236Z 2025-05-07T19:43:04.6486313Z processor : 92 2025-05-07T19:43:04.6486403Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6486493Z cpu family : 6 2025-05-07T19:43:04.6486572Z model : 85 2025-05-07T19:43:04.6486733Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6486814Z stepping : 7 2025-05-07T19:43:04.6486910Z microcode : 0x5003901 2025-05-07T19:43:04.6486991Z cpu MHz : 3000.008 2025-05-07T19:43:04.6487069Z cache size : 36608 KB 2025-05-07T19:43:04.6487160Z physical id : 1 2025-05-07T19:43:04.6487242Z siblings : 48 2025-05-07T19:43:04.6487320Z core id : 20 2025-05-07T19:43:04.6487398Z cpu cores : 24 2025-05-07T19:43:04.6487575Z apicid : 105 2025-05-07T19:43:04.6487657Z initial apicid : 105 2025-05-07T19:43:04.6487732Z fpu : yes 2025-05-07T19:43:04.6487832Z fpu_exception : yes 2025-05-07T19:43:04.6487914Z cpuid level : 13 2025-05-07T19:43:04.6487999Z wp : yes 2025-05-07T19:43:04.6490269Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6490670Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6490759Z bogomips : 6000.01 2025-05-07T19:43:04.6490852Z clflush size : 64 2025-05-07T19:43:04.6490938Z cache_alignment : 64 2025-05-07T19:43:04.6491073Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6491155Z power management: 2025-05-07T19:43:04.6491159Z 2025-05-07T19:43:04.6491251Z processor : 93 2025-05-07T19:43:04.6491336Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6491417Z cpu family : 6 2025-05-07T19:43:04.6491502Z model : 85 2025-05-07T19:43:04.6491660Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6491737Z stepping : 7 2025-05-07T19:43:04.6491819Z microcode : 0x5003901 2025-05-07T19:43:04.6491908Z cpu MHz : 3488.586 2025-05-07T19:43:04.6491991Z cache size : 36608 KB 2025-05-07T19:43:04.6492074Z physical id : 1 2025-05-07T19:43:04.6492167Z siblings : 48 2025-05-07T19:43:04.6492243Z core id : 21 2025-05-07T19:43:04.6492385Z cpu cores : 24 2025-05-07T19:43:04.6492465Z apicid : 107 2025-05-07T19:43:04.6492558Z initial apicid : 107 2025-05-07T19:43:04.6492637Z fpu : yes 2025-05-07T19:43:04.6492724Z fpu_exception : yes 2025-05-07T19:43:04.6492810Z cpuid level : 13 2025-05-07T19:43:04.6492883Z wp : yes 2025-05-07T19:43:04.6495151Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6495564Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6495652Z bogomips : 6000.01 2025-05-07T19:43:04.6495732Z clflush size : 64 2025-05-07T19:43:04.6495837Z cache_alignment : 64 2025-05-07T19:43:04.6495964Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6496054Z power management: 2025-05-07T19:43:04.6496059Z 2025-05-07T19:43:04.6496142Z processor : 94 2025-05-07T19:43:04.6496243Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6496320Z cpu family : 6 2025-05-07T19:43:04.6496393Z model : 85 2025-05-07T19:43:04.6496569Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6496650Z stepping : 7 2025-05-07T19:43:04.6496731Z microcode : 0x5003901 2025-05-07T19:43:04.6496812Z cpu MHz : 3211.056 2025-05-07T19:43:04.6496909Z cache size : 36608 KB 2025-05-07T19:43:04.6496988Z physical id : 1 2025-05-07T19:43:04.6497066Z siblings : 48 2025-05-07T19:43:04.6497157Z core id : 22 2025-05-07T19:43:04.6497344Z cpu cores : 24 2025-05-07T19:43:04.6497421Z apicid : 109 2025-05-07T19:43:04.6497504Z initial apicid : 109 2025-05-07T19:43:04.6497639Z fpu : yes 2025-05-07T19:43:04.6497720Z fpu_exception : yes 2025-05-07T19:43:04.6497797Z cpuid level : 13 2025-05-07T19:43:04.6497882Z wp : yes 2025-05-07T19:43:04.6500074Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6500462Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6500561Z bogomips : 6000.01 2025-05-07T19:43:04.6500648Z clflush size : 64 2025-05-07T19:43:04.6500732Z cache_alignment : 64 2025-05-07T19:43:04.6500868Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6500949Z power management: 2025-05-07T19:43:04.6500953Z 2025-05-07T19:43:04.6501028Z processor : 95 2025-05-07T19:43:04.6501113Z vendor_id : GenuineIntel 2025-05-07T19:43:04.6501195Z cpu family : 6 2025-05-07T19:43:04.6501273Z model : 85 2025-05-07T19:43:04.6501427Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:04.6501518Z stepping : 7 2025-05-07T19:43:04.6501599Z microcode : 0x5003901 2025-05-07T19:43:04.6501676Z cpu MHz : 3216.365 2025-05-07T19:43:04.6501761Z cache size : 36608 KB 2025-05-07T19:43:04.6501848Z physical id : 1 2025-05-07T19:43:04.6501929Z siblings : 48 2025-05-07T19:43:04.6502010Z core id : 23 2025-05-07T19:43:04.6502102Z cpu cores : 24 2025-05-07T19:43:04.6502176Z apicid : 111 2025-05-07T19:43:04.6502319Z initial apicid : 111 2025-05-07T19:43:04.6502392Z fpu : yes 2025-05-07T19:43:04.6502484Z fpu_exception : yes 2025-05-07T19:43:04.6502565Z cpuid level : 13 2025-05-07T19:43:04.6502639Z wp : yes 2025-05-07T19:43:04.6504891Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:04.6505254Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:04.6505338Z bogomips : 6000.01 2025-05-07T19:43:04.6505417Z clflush size : 64 2025-05-07T19:43:04.6505498Z cache_alignment : 64 2025-05-07T19:43:04.6505615Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:04.6505701Z power management: 2025-05-07T19:43:04.6505705Z 2025-05-07T19:43:04.6505708Z 2025-05-07T19:43:04.6505818Z ################################################################################ 2025-05-07T19:43:04.6505909Z [INFO] Print PCI info ... 2025-05-07T19:43:04.6505983Z + lspci -v 2025-05-07T19:43:04.6505994Z 2025-05-07T19:43:04.6506162Z 00:00.0 Host bridge: Intel Corporation 440FX - 82441FX PMC [Natoma] 2025-05-07T19:43:04.6506263Z Subsystem: Amazon.com, Inc. Device 1237 2025-05-07T19:43:04.6506379Z Flags: bus master, medium devsel, latency 0 2025-05-07T19:43:04.6506383Z 2025-05-07T19:43:04.6506578Z 00:01.0 ISA bridge: Intel Corporation 82371SB PIIX3 ISA [Natoma/Triton II] 2025-05-07T19:43:04.6506655Z Physical Slot: 1 2025-05-07T19:43:04.6506766Z Flags: bus master, fast devsel, latency 0 2025-05-07T19:43:04.6506770Z 2025-05-07T19:43:04.6507020Z 00:01.3 Non-VGA unclassified device: Intel Corporation 82371AB/EB/MB PIIX4 ACPI (rev 08) 2025-05-07T19:43:04.6507158Z Physical Slot: 1 2025-05-07T19:43:04.6507278Z Flags: bus master, fast devsel, latency 0, IRQ 9 2025-05-07T19:43:04.6507282Z 2025-05-07T19:43:04.6507551Z 00:03.0 VGA compatible controller: Amazon.com, Inc. Device 1111 (prog-if 00 [VGA controller]) 2025-05-07T19:43:04.6507625Z Physical Slot: 3 2025-05-07T19:43:04.6507729Z Flags: bus master, fast devsel, latency 0 2025-05-07T19:43:04.6507858Z Memory at c0000000 (32-bit, prefetchable) [size=4M] 2025-05-07T19:43:04.6507997Z Expansion ROM at 000c0000 [disabled] [size=128K] 2025-05-07T19:43:04.6508001Z 2025-05-07T19:43:04.6508300Z 00:04.0 Non-Volatile memory controller: Amazon.com, Inc. NVMe EBS Controller (prog-if 02 [NVM Express]) 2025-05-07T19:43:04.6508404Z Subsystem: Amazon.com, Inc. Device 0000 2025-05-07T19:43:04.6508504Z Physical Slot: 4 2025-05-07T19:43:04.6508633Z Flags: bus master, fast devsel, latency 0, IRQ 11 2025-05-07T19:43:04.6508776Z Memory at c0514000 (32-bit, non-prefetchable) [size=16K] 2025-05-07T19:43:04.6508882Z Capabilities: 2025-05-07T19:43:04.6508965Z Kernel driver in use: nvme 2025-05-07T19:43:04.6508969Z 2025-05-07T19:43:04.6509256Z 00:05.0 Ethernet controller: Amazon.com, Inc. Elastic Network Adapter (ENA) 2025-05-07T19:43:04.6509339Z Physical Slot: 5 2025-05-07T19:43:04.6509441Z Flags: bus master, fast devsel, latency 0 2025-05-07T19:43:04.6509757Z Memory at c0510000 (32-bit, non-prefetchable) [size=16K] 2025-05-07T19:43:04.6509889Z Memory at c0400000 (32-bit, prefetchable) [size=1M] 2025-05-07T19:43:04.6510048Z Memory at c0500000 (32-bit, non-prefetchable) [size=64K] 2025-05-07T19:43:04.6510147Z Capabilities: 2025-05-07T19:43:04.6510240Z Kernel driver in use: ena 2025-05-07T19:43:04.6510246Z 2025-05-07T19:43:04.6510249Z 2025-05-07T19:43:04.6510427Z ################################################################################ 2025-05-07T19:43:04.6510538Z [INFO] Print Linux distribution info ... 2025-05-07T19:43:04.6510625Z + uname -a 2025-05-07T19:43:04.6510630Z 2025-05-07T19:43:04.6511042Z Linux a4cdfef5f677 6.1.130-139.222.amzn2023.x86_64 #1 SMP PREEMPT_DYNAMIC Tue Mar 11 01:10:58 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux 2025-05-07T19:43:04.6511047Z 2025-05-07T19:43:04.6511125Z + uname -m 2025-05-07T19:43:04.6511129Z 2025-05-07T19:43:04.6511203Z x86_64 2025-05-07T19:43:04.6511207Z 2025-05-07T19:43:04.6511296Z + cat /proc/version 2025-05-07T19:43:04.6511300Z 2025-05-07T19:43:04.6511915Z Linux version 6.1.130-139.222.amzn2023.x86_64 (mockbuild@ip-10-0-55-76) (gcc (GCC) 11.5.0 20240719 (Red Hat 11.5.0-5), GNU ld version 2.39-6.amzn2023.0.11) #1 SMP PREEMPT_DYNAMIC Tue Mar 11 01:10:58 UTC 2025 2025-05-07T19:43:04.6511920Z 2025-05-07T19:43:04.6512003Z + cat /etc/os-release 2025-05-07T19:43:04.6512008Z 2025-05-07T19:43:04.6512097Z NAME="Amazon Linux" 2025-05-07T19:43:04.6512180Z VERSION="2023" 2025-05-07T19:43:04.6512259Z ID="amzn" 2025-05-07T19:43:04.6512346Z ID_LIKE="fedora" 2025-05-07T19:43:04.6512433Z VERSION_ID="2023" 2025-05-07T19:43:04.6512532Z PLATFORM_ID="platform:al2023" 2025-05-07T19:43:04.6512638Z PRETTY_NAME="Amazon Linux 2023.7.20250428" 2025-05-07T19:43:04.6512723Z ANSI_COLOR="0;33" 2025-05-07T19:43:04.6512847Z CPE_NAME="cpe:2.3:o:amazon:amazon_linux:2023" 2025-05-07T19:43:04.6513039Z HOME_URL="https://aws.amazon.com/linux/amazon-linux-2023/" 2025-05-07T19:43:04.6513215Z DOCUMENTATION_URL="https://docs.aws.amazon.com/linux/" 2025-05-07T19:43:04.6513371Z SUPPORT_URL="https://aws.amazon.com/premiumsupport/" 2025-05-07T19:43:04.6513568Z BUG_REPORT_URL="https://github.com/amazonlinux/amazon-linux-2023" 2025-05-07T19:43:04.6513652Z VENDOR_NAME="AWS" 2025-05-07T19:43:04.6513772Z VENDOR_URL="https://aws.amazon.com/" 2025-05-07T19:43:04.6513860Z SUPPORT_END="2029-06-30" 2025-05-07T19:43:04.6513865Z 2025-05-07T19:43:04.6551824Z ##[group]Run . $PRELUDE; print_gpu_info 2025-05-07T19:43:04.6551998Z . $PRELUDE; print_gpu_info 2025-05-07T19:43:04.6552307Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:04.6552470Z env: 2025-05-07T19:43:04.6552584Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:04.6552693Z BUILD_ENV: build_binary 2025-05-07T19:43:04.6552783Z BUILD_TARGET: default 2025-05-07T19:43:04.6552867Z BUILD_VARIANT: cuda 2025-05-07T19:43:04.6552973Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:43:04.6553050Z ##[endgroup] 2025-05-07T19:43:05.1018802Z ################################################################################ 2025-05-07T19:43:05.1020602Z [INFO] Printing general display info ... 2025-05-07T19:43:05.1044054Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:43:05.2024025Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:43:05.2036229Z /usr/bin/sudo 2025-05-07T19:43:05.2047175Z which: no apt-get in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:05.2054403Z /usr/bin/yum 2025-05-07T19:43:05.2054681Z [INSTALL] Updating system repositories ... 2025-05-07T19:43:05.2076634Z [EXEC] [ATTEMPT 0/3] + sudo yum update -y 2025-05-07T19:43:05.4276365Z Last metadata expiration check: 0:00:18 ago on Wed May 7 19:42:47 2025. 2025-05-07T19:43:05.5232896Z Dependencies resolved. 2025-05-07T19:43:05.5452517Z Nothing to do. 2025-05-07T19:43:05.5452841Z Complete! 2025-05-07T19:43:05.5841354Z [INSTALL] Installing system package(s): hostname lshw ... 2025-05-07T19:43:05.5865909Z [EXEC] [ATTEMPT 0/3] + sudo yum install -y hostname lshw 2025-05-07T19:43:05.8048737Z Last metadata expiration check: 0:00:18 ago on Wed May 7 19:42:47 2025. 2025-05-07T19:43:05.8571398Z Dependencies resolved. 2025-05-07T19:43:05.8736784Z ================================================================================ 2025-05-07T19:43:05.8737291Z Package Arch Version Repository Size 2025-05-07T19:43:05.8737731Z ================================================================================ 2025-05-07T19:43:05.8738057Z Installing: 2025-05-07T19:43:05.8738419Z hostname x86_64 3.23-4.amzn2023.0.3 amazonlinux 28 k 2025-05-07T19:43:05.8738907Z lshw x86_64 B.02.19.2-7.amzn2023.0.3 amazonlinux 319 k 2025-05-07T19:43:05.8739216Z 2025-05-07T19:43:05.8739314Z Transaction Summary 2025-05-07T19:43:05.8739590Z ================================================================================ 2025-05-07T19:43:05.8739910Z Install 2 Packages 2025-05-07T19:43:05.8740061Z 2025-05-07T19:43:05.8740175Z Total download size: 347 k 2025-05-07T19:43:05.8740433Z Installed size: 883 k 2025-05-07T19:43:05.8740689Z Downloading Packages: 2025-05-07T19:43:06.0232938Z (1/2): hostname-3.23-4.amzn2023.0.3.x86_64.rpm 1.5 MB/s | 28 kB 00:00 2025-05-07T19:43:06.0330018Z (2/2): lshw-B.02.19.2-7.amzn2023.0.3.x86_64.rpm 11 MB/s | 319 kB 00:00 2025-05-07T19:43:06.0337529Z -------------------------------------------------------------------------------- 2025-05-07T19:43:06.0338952Z Total 2.1 MB/s | 347 kB 00:00 2025-05-07T19:43:06.0552119Z Running transaction check 2025-05-07T19:43:06.0606627Z Transaction check succeeded. 2025-05-07T19:43:06.0607533Z Running transaction test 2025-05-07T19:43:06.0758620Z Transaction test succeeded. 2025-05-07T19:43:06.0760393Z Running transaction 2025-05-07T19:43:06.1032361Z Preparing : 1/1 2025-05-07T19:43:06.1105151Z Installing : lshw-B.02.19.2-7.amzn2023.0.3.x86_64 1/2 2025-05-07T19:43:06.1147009Z Installing : hostname-3.23-4.amzn2023.0.3.x86_64 2/2 2025-05-07T19:43:07.1579449Z Running scriptlet: hostname-3.23-4.amzn2023.0.3.x86_64 2/2 2025-05-07T19:43:07.1581910Z Verifying : hostname-3.23-4.amzn2023.0.3.x86_64 1/2 2025-05-07T19:43:07.1951047Z Verifying : lshw-B.02.19.2-7.amzn2023.0.3.x86_64 2/2 2025-05-07T19:43:07.1951447Z 2025-05-07T19:43:07.1951854Z Installed: 2025-05-07T19:43:07.1952228Z hostname-3.23-4.amzn2023.0.3.x86_64 lshw-B.02.19.2-7.amzn2023.0.3.x86_64 2025-05-07T19:43:07.1952614Z 2025-05-07T19:43:07.1952716Z Complete! 2025-05-07T19:43:07.2305544Z + hostname 2025-05-07T19:43:07.2305976Z 2025-05-07T19:43:07.2312575Z a4cdfef5f677 2025-05-07T19:43:07.2313352Z 2025-05-07T19:43:07.2313878Z + sudo lshw -C display 2025-05-07T19:43:07.2314095Z 2025-05-07T19:43:07.4272838Z *-display UNCLAIMED 2025-05-07T19:43:07.4273785Z description: VGA compatible controller 2025-05-07T19:43:07.4274784Z product: Amazon.com, Inc. 2025-05-07T19:43:07.4275653Z vendor: Amazon.com, Inc. 2025-05-07T19:43:07.4276416Z physical id: 3 2025-05-07T19:43:07.4277148Z bus info: pci@0000:00:03.0 2025-05-07T19:43:07.4277918Z version: 00 2025-05-07T19:43:07.4278597Z width: 32 bits 2025-05-07T19:43:07.4279269Z clock: 33MHz 2025-05-07T19:43:07.4279657Z capabilities: vga_controller bus_master 2025-05-07T19:43:07.4280020Z configuration: latency=0 2025-05-07T19:43:07.4280351Z resources: memory:c0000000-c03fffff memory:c0000-dffff 2025-05-07T19:43:07.4293756Z 2025-05-07T19:43:07.4294872Z ################################################################################ 2025-05-07T19:43:07.4295976Z [INFO] Printing NVIDIA GPU info ... 2025-05-07T19:43:07.4405304Z lspci: Unable to load libkmod resources: error -2 2025-05-07T19:43:07.4429690Z which: no nvidia-smi in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:07.4430804Z [CHECK] nvidia-smi not found 2025-05-07T19:43:07.4431131Z ################################################################################ 2025-05-07T19:43:07.4431505Z [INFO] Printing AMD GPU info ... 2025-05-07T19:43:07.4558861Z lspci: Unable to load libkmod resources: error -2 2025-05-07T19:43:07.4588307Z which: no rocminfo in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:07.4589938Z [CHECK] rocminfo not found 2025-05-07T19:43:07.4599178Z which: no rocm-smi in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:07.4604482Z [CHECK] rocm-smi not found 2025-05-07T19:43:07.4676611Z ##[group]Run . $PRELUDE; setup_miniconda $HOME/miniconda 2025-05-07T19:43:07.4677111Z . $PRELUDE; setup_miniconda $HOME/miniconda 2025-05-07T19:43:07.4677765Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:07.4678111Z env: 2025-05-07T19:43:07.4678338Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:07.4678670Z BUILD_ENV: build_binary 2025-05-07T19:43:07.4678921Z BUILD_TARGET: default 2025-05-07T19:43:07.4679172Z BUILD_VARIANT: cuda 2025-05-07T19:43:07.4679428Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:43:07.4679680Z ##[endgroup] 2025-05-07T19:43:07.8744895Z ################################################################################ 2025-05-07T19:43:07.8745284Z # Setup Miniconda 2025-05-07T19:43:07.8745553Z # 2025-05-07T19:43:07.8760265Z # [2025-05-07T19:43:07.875Z] + setup_miniconda /github/home/miniconda 2025-05-07T19:43:07.8761632Z ################################################################################ 2025-05-07T19:43:07.8762607Z 2025-05-07T19:43:07.8776578Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:43:07.9609335Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:43:07.9609756Z + mkdir -p /github/home/miniconda 2025-05-07T19:43:07.9609954Z 2025-05-07T19:43:07.9629010Z 2025-05-07T19:43:07.9629630Z [SETUP] Downloading the Miniconda installer ... 2025-05-07T19:43:07.9657692Z [EXEC] [ATTEMPT 0/3] + wget -q https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh -O miniconda.sh 2025-05-07T19:43:08.9861913Z [SETUP] Installing Miniconda ... 2025-05-07T19:43:08.9863052Z + bash miniconda.sh -b -p /github/home/miniconda -u 2025-05-07T19:43:08.9863829Z 2025-05-07T19:43:08.9998129Z PREFIX=/github/home/miniconda 2025-05-07T19:43:09.3579352Z Unpacking payload ... 2025-05-07T19:43:09.8407573Z entry_point.py:256: DeprecationWarning: Python 3.14 will, by default, filter extracted tar archives and reject files or modify their metadata. Use the filter argument to control this behavior. 2025-05-07T19:43:10.5134201Z entry_point.py:256: DeprecationWarning: Python 3.14 will, by default, filter extracted tar archives and reject files or modify their metadata. Use the filter argument to control this behavior. 2025-05-07T19:43:12.3642971Z 2025-05-07T19:43:12.3643788Z Installing base environment... 2025-05-07T19:43:12.3644163Z 2025-05-07T19:43:13.3531208Z Preparing transaction: ...working... done 2025-05-07T19:43:16.2161712Z Executing transaction: ...working... done 2025-05-07T19:43:16.7640298Z entry_point.py:256: DeprecationWarning: Python 3.14 will, by default, filter extracted tar archives and reject files or modify their metadata. Use the filter argument to control this behavior. 2025-05-07T19:43:16.8322592Z installation finished. 2025-05-07T19:43:16.8331170Z 2025-05-07T19:43:16.8331496Z + rm -f miniconda.sh 2025-05-07T19:43:16.8331740Z 2025-05-07T19:43:16.8514193Z 2025-05-07T19:43:16.8515270Z [SETUP] Reloading the bash configuration ... 2025-05-07T19:43:16.8515747Z + /github/home/miniconda/bin/conda init bash 2025-05-07T19:43:16.8515987Z 2025-05-07T19:43:17.2237662Z no change /github/home/miniconda/condabin/conda 2025-05-07T19:43:17.2238913Z no change /github/home/miniconda/bin/conda 2025-05-07T19:43:17.2239992Z no change /github/home/miniconda/bin/conda-env 2025-05-07T19:43:17.2241111Z no change /github/home/miniconda/bin/activate 2025-05-07T19:43:17.2242180Z no change /github/home/miniconda/bin/deactivate 2025-05-07T19:43:17.2243393Z no change /github/home/miniconda/etc/profile.d/conda.sh 2025-05-07T19:43:17.2244709Z no change /github/home/miniconda/etc/fish/conf.d/conda.fish 2025-05-07T19:43:17.2246074Z no change /github/home/miniconda/shell/condabin/Conda.psm1 2025-05-07T19:43:17.2247457Z no change /github/home/miniconda/shell/condabin/conda-hook.ps1 2025-05-07T19:43:17.2248364Z no change /github/home/miniconda/lib/python3.13/site-packages/xontrib/conda.xsh 2025-05-07T19:43:17.2249288Z no change /github/home/miniconda/etc/profile.d/conda.csh 2025-05-07T19:43:17.2249676Z modified /github/home/.bashrc 2025-05-07T19:43:17.2249865Z 2025-05-07T19:43:17.2250104Z ==> For changes to take effect, close and re-open your current shell. <== 2025-05-07T19:43:17.2250410Z 2025-05-07T19:43:17.2785449Z 2025-05-07T19:43:17.2785976Z + . /github/home/.bashrc 2025-05-07T19:43:17.2786528Z 2025-05-07T19:43:18.0673096Z 2025-05-07T19:43:18.0674336Z [SETUP] Installing libmamba-solver (required since Anaconda 2024.02-1) and libarchive ... 2025-05-07T19:43:18.0708124Z [EXEC] [ATTEMPT 0/3] + conda install --solver=classic -c conda-forge --override-channels -y conda-libmamba-solver libmamba libmambapy libarchive 2025-05-07T19:43:29.9396324Z Collecting package metadata (current_repodata.json): - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - done 2025-05-07T19:43:31.4133235Z Solving environment: | / - \ | / - \ | / - done 2025-05-07T19:43:31.5033623Z 2025-05-07T19:43:31.5034480Z ## Package Plan ## 2025-05-07T19:43:31.5034748Z 2025-05-07T19:43:31.5034901Z environment location: /github/home/miniconda 2025-05-07T19:43:31.5035196Z 2025-05-07T19:43:31.5035322Z added / updated specs: 2025-05-07T19:43:31.5035662Z - conda-libmamba-solver 2025-05-07T19:43:31.5035987Z - libarchive 2025-05-07T19:43:31.5036228Z - libmamba 2025-05-07T19:43:31.5036494Z - libmambapy 2025-05-07T19:43:31.5036672Z 2025-05-07T19:43:31.5036676Z 2025-05-07T19:43:31.5036863Z The following packages will be downloaded: 2025-05-07T19:43:31.5037140Z 2025-05-07T19:43:31.5037650Z package | build 2025-05-07T19:43:31.5038032Z ---------------------------|----------------- 2025-05-07T19:43:31.5038542Z ca-certificates-2025.4.26 | hbd8a1cb_0 149 KB conda-forge 2025-05-07T19:43:31.5039121Z certifi-2025.4.26 | pyhd8ed1ab_0 154 KB conda-forge 2025-05-07T19:43:31.5039614Z conda-25.3.1 | py313h78bf25f_1 1.1 MB conda-forge 2025-05-07T19:43:31.5040170Z conda-libmamba-solver-25.4.0| pyhd8ed1ab_0 41 KB conda-forge 2025-05-07T19:43:31.5040670Z ------------------------------------------------------------ 2025-05-07T19:43:31.5041079Z Total: 1.4 MB 2025-05-07T19:43:31.5041313Z 2025-05-07T19:43:31.5041445Z The following packages will be UPDATED: 2025-05-07T19:43:31.5041712Z 2025-05-07T19:43:31.5048096Z ca-certificates pkgs/main/linux-64::ca-certificates-2~ --> conda-forge/noarch::ca-certificates-2025.4.26-hbd8a1cb_0 2025-05-07T19:43:31.5049052Z conda pkgs/main::conda-25.3.1-py313h06a4308~ --> conda-forge::conda-25.3.1-py313h78bf25f_1 2025-05-07T19:43:31.5049507Z 2025-05-07T19:43:31.5049762Z The following packages will be SUPERSEDED by a higher-priority channel: 2025-05-07T19:43:31.5050158Z 2025-05-07T19:43:31.5050519Z certifi pkgs/main/linux-64::certifi-2025.4.26~ --> conda-forge/noarch::certifi-2025.4.26-pyhd8ed1ab_0 2025-05-07T19:43:31.5051444Z conda-libmamba-so~ pkgs/main::conda-libmamba-solver-25.4~ --> conda-forge::conda-libmamba-solver-25.4.0-pyhd8ed1ab_0 2025-05-07T19:43:31.5051992Z 2025-05-07T19:43:31.5051996Z 2025-05-07T19:43:31.5052000Z 2025-05-07T19:43:31.5052165Z Downloading and Extracting Packages: ...working... 2025-05-07T19:43:31.5052639Z conda-25.3.1 | 1.1 MB | | 0% 2025-05-07T19:43:31.5052896Z 2025-05-07T19:43:31.5053377Z certifi-2025.4.26 | 154 KB | | 0%  2025-05-07T19:43:31.5053650Z 2025-05-07T19:43:31.5053663Z 2025-05-07T19:43:31.5053908Z ca-certificates-2025 | 149 KB | | 0%  2025-05-07T19:43:31.5054231Z 2025-05-07T19:43:31.5054234Z 2025-05-07T19:43:31.5054493Z 2025-05-07T19:43:31.5644120Z conda-libmamba-solve | 41 KB | | 0%  2025-05-07T19:43:31.5644590Z 2025-05-07T19:43:31.5644598Z 2025-05-07T19:43:31.5644606Z 2025-05-07T19:43:31.5711812Z conda-libmamba-solve | 41 KB | ########## | 100%  2025-05-07T19:43:31.5772858Z conda-25.3.1 | 1.1 MB | ########## | 100% 2025-05-07T19:43:31.5773668Z 2025-05-07T19:43:31.5773683Z 2025-05-07T19:43:31.5773696Z 2025-05-07T19:43:31.5854752Z conda-libmamba-solve | 41 KB | ########## | 100%  2025-05-07T19:43:31.5855096Z 2025-05-07T19:43:31.5855257Z 2025-05-07T19:43:31.5975640Z ca-certificates-2025 | 149 KB | ########## | 100%  2025-05-07T19:43:31.5975957Z 2025-05-07T19:43:31.5976081Z 2025-05-07T19:43:31.5990375Z ca-certificates-2025 | 149 KB | ########## | 100%  2025-05-07T19:43:31.5991632Z 2025-05-07T19:43:31.6116244Z certifi-2025.4.26 | 154 KB | ########## | 100%  2025-05-07T19:43:31.6117075Z 2025-05-07T19:43:31.6118454Z certifi-2025.4.26 | 154 KB | ########## | 100%  2025-05-07T19:43:31.6118721Z 2025-05-07T19:43:31.6862447Z certifi-2025.4.26 | 154 KB | ########## | 100%  2025-05-07T19:43:31.6863736Z conda-25.3.1 | 1.1 MB | ########## | 100% 2025-05-07T19:43:31.6865398Z conda-25.3.1 | 1.1 MB | ########## | 100% 2025-05-07T19:43:31.6866381Z 2025-05-07T19:43:31.6866992Z 2025-05-07T19:43:31.6867481Z  2025-05-07T19:43:31.6867697Z 2025-05-07T19:43:31.6867701Z 2025-05-07T19:43:31.6867878Z  2025-05-07T19:43:31.6868124Z 2025-05-07T19:43:31.6868128Z 2025-05-07T19:43:31.6868131Z 2025-05-07T19:43:31.6869892Z  done 2025-05-07T19:43:31.7874969Z Preparing transaction: | done 2025-05-07T19:43:31.8888110Z Verifying transaction: - done 2025-05-07T19:43:33.1916330Z Executing transaction: | / - \ | / - \ | / - \ | done 2025-05-07T19:43:34.8317210Z [SETUP] Updating Miniconda base packages ... 2025-05-07T19:43:34.8349735Z [EXEC] [ATTEMPT 0/3] + conda update -n base -c defaults --update-deps -y conda 2025-05-07T19:43:35.5561036Z Channels: 2025-05-07T19:43:36.6249898Z - defaults 2025-05-07T19:43:36.6250626Z Platform: linux-64 2025-05-07T19:43:36.6252331Z Collecting package metadata (repodata.json): - \ | / - \ done 2025-05-07T19:43:36.7543225Z Solving environment: / - Channels: 2025-05-07T19:43:36.7544202Z - defaults 2025-05-07T19:43:36.7544825Z Platform: linux-64 2025-05-07T19:43:37.0912938Z Collecting package metadata (repodata.json): | / - \ | / done 2025-05-07T19:43:37.3192785Z Solving environment: \ | / - done 2025-05-07T19:43:37.4095505Z done 2025-05-07T19:43:37.4730722Z 2025-05-07T19:43:37.4731319Z ## Package Plan ## 2025-05-07T19:43:37.4731835Z 2025-05-07T19:43:37.4732464Z environment location: /github/home/miniconda 2025-05-07T19:43:37.4733540Z 2025-05-07T19:43:37.4733879Z added / updated specs: 2025-05-07T19:43:37.4734636Z - conda 2025-05-07T19:43:37.4734982Z 2025-05-07T19:43:37.4734994Z 2025-05-07T19:43:37.4735383Z The following packages will be downloaded: 2025-05-07T19:43:37.4736057Z 2025-05-07T19:43:37.4736393Z package | build 2025-05-07T19:43:37.4737326Z ---------------------------|----------------- 2025-05-07T19:43:37.4737816Z pip-25.1 | pyhc872135_2 1.3 MB 2025-05-07T19:43:37.4738254Z tzdata-2025b | h04d1e81_0 116 KB 2025-05-07T19:43:37.4738683Z ------------------------------------------------------------ 2025-05-07T19:43:37.4739045Z Total: 1.4 MB 2025-05-07T19:43:37.4739424Z 2025-05-07T19:43:37.4739575Z The following packages will be UPDATED: 2025-05-07T19:43:37.4739788Z 2025-05-07T19:43:37.4740395Z pip pkgs/main/linux-64::pip-25.0-py313h06~ --> pkgs/main/noarch::pip-25.1-pyhc872135_2 2025-05-07T19:43:37.4740962Z tzdata 2025a-h04d1e81_0 --> 2025b-h04d1e81_0 2025-05-07T19:43:37.4741223Z 2025-05-07T19:43:37.4741227Z 2025-05-07T19:43:37.4741231Z 2025-05-07T19:43:37.4741384Z Downloading and Extracting Packages: ...working... 2025-05-07T19:43:37.4741792Z pip-25.1 | 1.3 MB | | 0% 2025-05-07T19:43:37.4742027Z 2025-05-07T19:43:37.5096893Z tzdata-2025b | 116 KB | | 0%  2025-05-07T19:43:37.5097198Z 2025-05-07T19:43:37.5526806Z tzdata-2025b | 116 KB | ########## | 100%  2025-05-07T19:43:37.6811528Z pip-25.1 | 1.3 MB | ########## | 100% 2025-05-07T19:43:37.6812314Z 2025-05-07T19:43:37.6813122Z tzdata-2025b | 116 KB | ########## | 100%  2025-05-07T19:43:37.6813908Z 2025-05-07T19:43:37.7136281Z tzdata-2025b | 116 KB | ########## | 100%  2025-05-07T19:43:37.7136761Z pip-25.1 | 1.3 MB | ########## | 100% 2025-05-07T19:43:37.7137451Z pip-25.1 | 1.3 MB | ########## | 100% 2025-05-07T19:43:37.7137841Z 2025-05-07T19:43:37.7138059Z 2025-05-07T19:43:37.7138762Z  done 2025-05-07T19:43:37.8149124Z Preparing transaction: | done 2025-05-07T19:43:37.9156321Z Verifying transaction: - done 2025-05-07T19:43:39.9191990Z Executing transaction: | / - \ | / - \ | / - \ | / - \ | / - \ done 2025-05-07T19:43:40.4676659Z [SETUP] Cleaning up Conda packages ... 2025-05-07T19:43:40.4677875Z + conda clean --packages --tarball -y 2025-05-07T19:43:40.4678091Z 2025-05-07T19:43:40.9055246Z Will remove 99 (117.8 MB) tarball(s). 2025-05-07T19:43:40.9056266Z Will remove 11 (16.0 MB) package(s). 2025-05-07T19:43:40.9602303Z 2025-05-07T19:43:40.9606644Z + conda clean --all -y 2025-05-07T19:43:40.9607167Z 2025-05-07T19:43:41.4052425Z There are no unused tarball(s) to remove. 2025-05-07T19:43:41.4053447Z Will remove 1 index cache(s). 2025-05-07T19:43:41.4054290Z There are no unused package(s) to remove. 2025-05-07T19:43:41.4055240Z There are no tempfile(s) to remove. 2025-05-07T19:43:41.4056106Z There are no logfile(s) to remove. 2025-05-07T19:43:41.4587786Z 2025-05-07T19:43:41.4590008Z + conda info 2025-05-07T19:43:41.4590180Z 2025-05-07T19:43:42.0255552Z 2025-05-07T19:43:42.0256319Z active environment : base 2025-05-07T19:43:42.0256708Z active env location : /github/home/miniconda 2025-05-07T19:43:42.0257063Z shell level : 1 2025-05-07T19:43:42.0257350Z user config file : /github/home/.condarc 2025-05-07T19:43:42.0257761Z populated config files : /github/home/miniconda/.condarc 2025-05-07T19:43:42.0258258Z conda version : 25.3.1 2025-05-07T19:43:42.0258556Z conda-build version : not installed 2025-05-07T19:43:42.0258866Z python version : 3.13.2.final.0 2025-05-07T19:43:42.0259168Z solver : libmamba (default) 2025-05-07T19:43:42.0259504Z virtual packages : __archspec=1=cascadelake 2025-05-07T19:43:42.0259814Z __conda=25.3.1=0 2025-05-07T19:43:42.0260107Z __glibc=2.34=0 2025-05-07T19:43:42.0260383Z __linux=6.1.130=0 2025-05-07T19:43:42.0260671Z __unix=0=0 2025-05-07T19:43:42.0260999Z base environment : /github/home/miniconda (writable) 2025-05-07T19:43:42.0261413Z conda av data dir : /github/home/miniconda/etc/conda 2025-05-07T19:43:42.0261765Z conda av metadata url : None 2025-05-07T19:43:42.0262127Z channel URLs : https://repo.anaconda.com/pkgs/main/linux-64 2025-05-07T19:43:42.0262570Z https://repo.anaconda.com/pkgs/main/noarch 2025-05-07T19:43:42.0262953Z https://repo.anaconda.com/pkgs/r/linux-64 2025-05-07T19:43:42.0263605Z https://repo.anaconda.com/pkgs/r/noarch 2025-05-07T19:43:42.0263980Z package cache : /github/home/miniconda/pkgs 2025-05-07T19:43:42.0264337Z /github/home/.conda/pkgs 2025-05-07T19:43:42.0264689Z envs directories : /github/home/miniconda/envs 2025-05-07T19:43:42.0265017Z /github/home/.conda/envs 2025-05-07T19:43:42.0265344Z platform : linux-64 2025-05-07T19:43:42.0266206Z user-agent : conda/25.3.1 requests/2.32.3 CPython/3.13.2 Linux/6.1.130-139.222.amzn2023.x86_64 amzn/2023.7.20250428 glibc/2.34 solver/libmamba conda-libmamba-solver/25.4.0 libmambapy/2.0.5 aau/0.7.0 c/. s/. e/. 2025-05-07T19:43:42.0267102Z UID:GID : 0:0 2025-05-07T19:43:42.0267355Z netrc file : None 2025-05-07T19:43:42.0267629Z offline mode : False 2025-05-07T19:43:42.0267802Z 2025-05-07T19:43:42.0858696Z 2025-05-07T19:43:42.0859070Z [SETUP] Exporting Miniconda variables ... 2025-05-07T19:43:42.0859907Z [SETUP] Saving Miniconda variables to /__w/_temp/_runner_file_commands/add_path_5b241200-45c7-4009-9747-071d37a14e8e ... 2025-05-07T19:43:42.0860790Z [SETUP] Successfully set up Miniconda at /github/home/miniconda 2025-05-07T19:43:42.1004989Z ##[group]Run . $PRELUDE; create_conda_environment $BUILD_ENV 3.11 2025-05-07T19:43:42.1005535Z . $PRELUDE; create_conda_environment $BUILD_ENV 3.11 2025-05-07T19:43:42.1006324Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:42.1006665Z env: 2025-05-07T19:43:42.1006886Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:42.1007205Z BUILD_ENV: build_binary 2025-05-07T19:43:42.1007462Z BUILD_TARGET: default 2025-05-07T19:43:42.1007688Z BUILD_VARIANT: cuda 2025-05-07T19:43:42.1007933Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:43:42.1008178Z ##[endgroup] 2025-05-07T19:43:42.5129551Z ################################################################################ 2025-05-07T19:43:42.5130609Z # Create Conda Environment 2025-05-07T19:43:42.5131419Z # 2025-05-07T19:43:42.5141973Z # [2025-05-07T19:43:42.513Z] + create_conda_environment build_binary 3.11 2025-05-07T19:43:42.5143364Z ################################################################################ 2025-05-07T19:43:42.5144039Z 2025-05-07T19:43:42.5156254Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:43:42.6008576Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:43:42.6009468Z [SETUP] Listing existing Conda environments ... 2025-05-07T19:43:42.6009872Z + conda info --envs 2025-05-07T19:43:42.6010016Z 2025-05-07T19:43:43.1792579Z 2025-05-07T19:43:43.1793215Z # conda environments: 2025-05-07T19:43:43.1793972Z # 2025-05-07T19:43:43.1794639Z base /github/home/miniconda 2025-05-07T19:43:43.1795312Z 2025-05-07T19:43:43.2379480Z 2025-05-07T19:43:43.2379872Z [SETUP] Deleting the prefix directory if it exists ... 2025-05-07T19:43:44.8468579Z + rm -rf /github/home/miniconda/envs/build_binary 2025-05-07T19:43:44.8468938Z 2025-05-07T19:43:44.8488238Z 2025-05-07T19:43:44.8496647Z [SETUP] Creating new Conda environment (Python 3.11) ... 2025-05-07T19:43:44.8519169Z [EXEC] [ATTEMPT 0/3] + conda create -y -n build_binary python=3.11 2025-05-07T19:43:45.4170541Z Channels: 2025-05-07T19:43:45.4170966Z - defaults 2025-05-07T19:43:45.4171209Z Platform: linux-64 2025-05-07T19:43:46.8069819Z Collecting package metadata (repodata.json): - \ | / - \ | / - done 2025-05-07T19:43:46.9076288Z Solving environment: | done 2025-05-07T19:43:46.9371591Z 2025-05-07T19:43:46.9372191Z ## Package Plan ## 2025-05-07T19:43:46.9372682Z 2025-05-07T19:43:46.9373290Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:43:46.9374315Z 2025-05-07T19:43:46.9374625Z added / updated specs: 2025-05-07T19:43:46.9375383Z - python=3.11 2025-05-07T19:43:46.9375877Z 2025-05-07T19:43:46.9375882Z 2025-05-07T19:43:46.9376018Z The following packages will be downloaded: 2025-05-07T19:43:46.9376271Z 2025-05-07T19:43:46.9376394Z package | build 2025-05-07T19:43:46.9376759Z ---------------------------|----------------- 2025-05-07T19:43:46.9377165Z _libgcc_mutex-0.1 | main 3 KB 2025-05-07T19:43:46.9377618Z _openmp_mutex-5.1 | 1_gnu 21 KB 2025-05-07T19:43:46.9378070Z ca-certificates-2025.2.25 | h06a4308_0 129 KB 2025-05-07T19:43:46.9378529Z python-3.11.11 | he870216_0 32.9 MB 2025-05-07T19:43:46.9378957Z setuptools-78.1.1 | py311h06a4308_0 2.3 MB 2025-05-07T19:43:46.9379513Z wheel-0.45.1 | py311h06a4308_0 151 KB 2025-05-07T19:43:46.9379921Z ------------------------------------------------------------ 2025-05-07T19:43:46.9380275Z Total: 35.4 MB 2025-05-07T19:43:46.9380505Z 2025-05-07T19:43:46.9380774Z The following NEW packages will be INSTALLED: 2025-05-07T19:43:46.9380999Z 2025-05-07T19:43:46.9381205Z _libgcc_mutex pkgs/main/linux-64::_libgcc_mutex-0.1-main 2025-05-07T19:43:46.9381676Z _openmp_mutex pkgs/main/linux-64::_openmp_mutex-5.1-1_gnu 2025-05-07T19:43:46.9382413Z bzip2 pkgs/main/linux-64::bzip2-1.0.8-h5eee18b_6 2025-05-07T19:43:46.9382909Z ca-certificates pkgs/main/linux-64::ca-certificates-2025.2.25-h06a4308_0 2025-05-07T19:43:46.9383470Z ld_impl_linux-64 pkgs/main/linux-64::ld_impl_linux-64-2.40-h12ee557_0 2025-05-07T19:43:46.9383935Z libffi pkgs/main/linux-64::libffi-3.4.4-h6a678d5_1 2025-05-07T19:43:46.9384981Z libgcc-ng pkgs/main/linux-64::libgcc-ng-11.2.0-h1234567_1 2025-05-07T19:43:46.9385452Z libgomp pkgs/main/linux-64::libgomp-11.2.0-h1234567_1 2025-05-07T19:43:46.9385968Z libstdcxx-ng pkgs/main/linux-64::libstdcxx-ng-11.2.0-h1234567_1 2025-05-07T19:43:46.9386648Z libuuid pkgs/main/linux-64::libuuid-1.41.5-h5eee18b_0 2025-05-07T19:43:46.9387103Z ncurses pkgs/main/linux-64::ncurses-6.4-h6a678d5_0 2025-05-07T19:43:46.9387563Z openssl pkgs/main/linux-64::openssl-3.0.16-h5eee18b_0 2025-05-07T19:43:46.9387996Z pip pkgs/main/noarch::pip-25.1-pyhc872135_2 2025-05-07T19:43:46.9388440Z python pkgs/main/linux-64::python-3.11.11-he870216_0 2025-05-07T19:43:46.9388911Z readline pkgs/main/linux-64::readline-8.2-h5eee18b_0 2025-05-07T19:43:46.9389512Z setuptools pkgs/main/linux-64::setuptools-78.1.1-py311h06a4308_0 2025-05-07T19:43:46.9390032Z sqlite pkgs/main/linux-64::sqlite-3.45.3-h5eee18b_0 2025-05-07T19:43:46.9390441Z tk pkgs/main/linux-64::tk-8.6.14-h39e8969_0 2025-05-07T19:43:46.9390860Z tzdata pkgs/main/noarch::tzdata-2025b-h04d1e81_0 2025-05-07T19:43:46.9391320Z wheel pkgs/main/linux-64::wheel-0.45.1-py311h06a4308_0 2025-05-07T19:43:46.9391738Z xz pkgs/main/linux-64::xz-5.6.4-h5eee18b_1 2025-05-07T19:43:46.9392148Z zlib pkgs/main/linux-64::zlib-1.2.13-h5eee18b_1 2025-05-07T19:43:46.9392411Z 2025-05-07T19:43:46.9392415Z 2025-05-07T19:43:46.9392419Z 2025-05-07T19:43:46.9392576Z Downloading and Extracting Packages: ...working... 2025-05-07T19:43:46.9392992Z python-3.11.11 | 32.9 MB | | 0% 2025-05-07T19:43:46.9393238Z 2025-05-07T19:43:46.9393603Z setuptools-78.1.1 | 2.3 MB | | 0%  2025-05-07T19:43:46.9393866Z 2025-05-07T19:43:46.9403324Z 2025-05-07T19:43:46.9404727Z wheel-0.45.1 | 151 KB | | 0%  2025-05-07T19:43:46.9405045Z 2025-05-07T19:43:46.9405050Z 2025-05-07T19:43:46.9405054Z 2025-05-07T19:43:46.9429734Z ca-certificates-2025 | 129 KB | | 0%  2025-05-07T19:43:46.9430070Z 2025-05-07T19:43:46.9430094Z 2025-05-07T19:43:46.9430119Z 2025-05-07T19:43:46.9430123Z 2025-05-07T19:43:46.9440667Z _openmp_mutex-5.1 | 21 KB | | 0%  2025-05-07T19:43:46.9441590Z 2025-05-07T19:43:46.9441604Z 2025-05-07T19:43:46.9441616Z 2025-05-07T19:43:46.9441626Z 2025-05-07T19:43:46.9441637Z 2025-05-07T19:43:46.9839493Z _libgcc_mutex-0.1 | 3 KB | | 0%  2025-05-07T19:43:46.9839814Z 2025-05-07T19:43:46.9839829Z 2025-05-07T19:43:46.9839833Z 2025-05-07T19:43:46.9995853Z ca-certificates-2025 | 129 KB | ########## | 100%  2025-05-07T19:43:46.9996186Z 2025-05-07T19:43:46.9996191Z 2025-05-07T19:43:47.0047869Z wheel-0.45.1 | 151 KB | ########## | 100%  2025-05-07T19:43:47.0048190Z 2025-05-07T19:43:47.0048195Z 2025-05-07T19:43:47.0048199Z 2025-05-07T19:43:47.0059339Z ca-certificates-2025 | 129 KB | ########## | 100%  2025-05-07T19:43:47.0059638Z 2025-05-07T19:43:47.0059642Z 2025-05-07T19:43:47.0059645Z 2025-05-07T19:43:47.0059673Z 2025-05-07T19:43:47.0059694Z 2025-05-07T19:43:47.0165946Z _libgcc_mutex-0.1 | 3 KB | ########## | 100%  2025-05-07T19:43:47.0166285Z 2025-05-07T19:43:47.0166290Z 2025-05-07T19:43:47.0166294Z 2025-05-07T19:43:47.0166298Z 2025-05-07T19:43:47.0166302Z 2025-05-07T19:43:47.0296003Z _libgcc_mutex-0.1 | 3 KB | ########## | 100%  2025-05-07T19:43:47.0296357Z 2025-05-07T19:43:47.0343750Z setuptools-78.1.1 | 2.3 MB | ########## | 100%  2025-05-07T19:43:47.0344075Z 2025-05-07T19:43:47.0344079Z 2025-05-07T19:43:47.0344083Z 2025-05-07T19:43:47.0344087Z 2025-05-07T19:43:47.0373794Z _openmp_mutex-5.1 | 21 KB | ########## | 100%  2025-05-07T19:43:47.0609626Z python-3.11.11 | 32.9 MB | 7 | 8% 2025-05-07T19:43:47.0610176Z 2025-05-07T19:43:47.0610197Z 2025-05-07T19:43:47.0610644Z wheel-0.45.1 | 151 KB | ########## | 100%  2025-05-07T19:43:47.0610906Z 2025-05-07T19:43:47.0610941Z 2025-05-07T19:43:47.0830896Z wheel-0.45.1 | 151 KB | ########## | 100%  2025-05-07T19:43:47.0831203Z 2025-05-07T19:43:47.0831207Z 2025-05-07T19:43:47.0831211Z 2025-05-07T19:43:47.0831215Z 2025-05-07T19:43:47.0831463Z _openmp_mutex-5.1 | 21 KB | ########## | 100%  2025-05-07T19:43:47.0831750Z 2025-05-07T19:43:47.0831754Z 2025-05-07T19:43:47.0831758Z 2025-05-07T19:43:47.0831791Z 2025-05-07T19:43:47.1373946Z _openmp_mutex-5.1 | 21 KB | ########## | 100%  2025-05-07T19:43:47.2373852Z python-3.11.11 | 32.9 MB | ####6 | 47% 2025-05-07T19:43:47.3332290Z python-3.11.11 | 32.9 MB | #########5 | 96% 2025-05-07T19:43:47.3332788Z 2025-05-07T19:43:47.3333511Z setuptools-78.1.1 | 2.3 MB | ########## | 100%  2025-05-07T19:43:47.3333820Z 2025-05-07T19:43:47.3687145Z setuptools-78.1.1 | 2.3 MB | ########## | 100%  2025-05-07T19:43:47.9320767Z python-3.11.11 | 32.9 MB | ########## | 100% 2025-05-07T19:43:47.9324465Z python-3.11.11 | 32.9 MB | ########## | 100% 2025-05-07T19:43:47.9325009Z 2025-05-07T19:43:47.9325255Z 2025-05-07T19:43:47.9325653Z  2025-05-07T19:43:47.9325915Z 2025-05-07T19:43:47.9325922Z 2025-05-07T19:43:47.9326167Z  2025-05-07T19:43:47.9326404Z 2025-05-07T19:43:47.9326441Z 2025-05-07T19:43:47.9326445Z 2025-05-07T19:43:47.9326628Z  2025-05-07T19:43:47.9326859Z 2025-05-07T19:43:47.9326863Z 2025-05-07T19:43:47.9326867Z 2025-05-07T19:43:47.9326870Z 2025-05-07T19:43:47.9327081Z  2025-05-07T19:43:47.9327314Z 2025-05-07T19:43:47.9327318Z 2025-05-07T19:43:47.9327321Z 2025-05-07T19:43:47.9327325Z 2025-05-07T19:43:47.9327328Z 2025-05-07T19:43:47.9327527Z  done 2025-05-07T19:43:48.1451241Z Preparing transaction: - \ done 2025-05-07T19:43:49.5346695Z Verifying transaction: / - \ | / - \ | / - \ | / done 2025-05-07T19:43:51.6516464Z Executing transaction: \ | / - \ | / - \ | / - \ | / - \ | / - \ done 2025-05-07T19:43:51.6555142Z # 2025-05-07T19:43:51.6555938Z # To activate this environment, use 2025-05-07T19:43:51.6556778Z # 2025-05-07T19:43:51.6557367Z # $ conda activate build_binary 2025-05-07T19:43:51.6558141Z # 2025-05-07T19:43:51.6558776Z # To deactivate an active environment, use 2025-05-07T19:43:51.6559622Z # 2025-05-07T19:43:51.6560174Z # $ conda deactivate 2025-05-07T19:43:51.6560633Z 2025-05-07T19:43:51.7397489Z [SETUP] Upgrading PIP to latest ... 2025-05-07T19:43:51.7429387Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install --upgrade pip 2025-05-07T19:43:54.6238458Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:43:54.6240382Z 2025-05-07T19:43:54.6241348Z Requirement already satisfied: pip in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (25.1) 2025-05-07T19:43:54.6242016Z Collecting pip 2025-05-07T19:43:54.6242373Z Downloading pip-25.1.1-py3-none-any.whl.metadata (3.6 kB) 2025-05-07T19:43:54.6242813Z Downloading pip-25.1.1-py3-none-any.whl (1.8 MB) 2025-05-07T19:43:54.6243774Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.8/1.8 MB 72.0 MB/s eta 0:00:00 2025-05-07T19:43:54.6244154Z Installing collected packages: pip 2025-05-07T19:43:54.6244500Z Attempting uninstall: pip 2025-05-07T19:43:54.6244807Z Found existing installation: pip 25.1 2025-05-07T19:43:54.6245168Z Uninstalling pip-25.1: 2025-05-07T19:43:54.6245651Z Successfully uninstalled pip-25.1 2025-05-07T19:43:54.6246006Z Successfully installed pip-25.1.1 2025-05-07T19:43:54.6246211Z 2025-05-07T19:43:54.6832211Z [SETUP] Upgrading pyOpenSSL ... 2025-05-07T19:43:54.6862186Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y pyOpenSSL>22.1.0 2025-05-07T19:43:55.3411553Z Channels: 2025-05-07T19:43:55.3412281Z - conda-forge 2025-05-07T19:43:55.3412938Z Platform: linux-64 2025-05-07T19:44:05.0738742Z Collecting package metadata (repodata.json): - \ | / - \ | / - \ | / - \ | / - done 2025-05-07T19:44:06.9792858Z Solving environment: | / - \ | done 2025-05-07T19:44:07.0232980Z 2025-05-07T19:44:07.0233861Z ## Package Plan ## 2025-05-07T19:44:07.0234108Z 2025-05-07T19:44:07.0234362Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:07.0234697Z 2025-05-07T19:44:07.0234810Z added / updated specs: 2025-05-07T19:44:07.0235117Z - pyopenssl[version='>22.1.0'] 2025-05-07T19:44:07.0235357Z 2025-05-07T19:44:07.0235361Z 2025-05-07T19:44:07.0235503Z The following packages will be downloaded: 2025-05-07T19:44:07.0235731Z 2025-05-07T19:44:07.0235848Z package | build 2025-05-07T19:44:07.0236207Z ---------------------------|----------------- 2025-05-07T19:44:07.0236606Z cffi-1.17.1 | py311hf29c0ef_0 295 KB conda-forge 2025-05-07T19:44:07.0237097Z cryptography-44.0.3 | py311hafd3f86_0 1.5 MB conda-forge 2025-05-07T19:44:07.0237567Z libgcc-15.1.0 | h767d61c_2 810 KB conda-forge 2025-05-07T19:44:07.0238015Z libgcc-ng-15.1.0 | h69a702a_2 34 KB conda-forge 2025-05-07T19:44:07.0238465Z libgomp-15.1.0 | h767d61c_2 442 KB conda-forge 2025-05-07T19:44:07.0238893Z openssl-3.5.0 | h7b32b05_1 3.0 MB conda-forge 2025-05-07T19:44:07.0239347Z pycparser-2.22 | pyh29332c3_1 108 KB conda-forge 2025-05-07T19:44:07.0239816Z pyopenssl-25.0.0 | pyhd8ed1ab_0 120 KB conda-forge 2025-05-07T19:44:07.0240306Z python_abi-3.11 | 2_cp311 5 KB conda-forge 2025-05-07T19:44:07.0240809Z typing-extensions-4.13.2 | h0e9735f_0 88 KB conda-forge 2025-05-07T19:44:07.0241321Z typing_extensions-4.13.2 | pyh29332c3_0 51 KB conda-forge 2025-05-07T19:44:07.0241782Z ------------------------------------------------------------ 2025-05-07T19:44:07.0242138Z Total: 6.4 MB 2025-05-07T19:44:07.0242373Z 2025-05-07T19:44:07.0242506Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:07.0242743Z 2025-05-07T19:44:07.0242957Z cffi conda-forge/linux-64::cffi-1.17.1-py311hf29c0ef_0 2025-05-07T19:44:07.0243484Z cryptography conda-forge/linux-64::cryptography-44.0.3-py311hafd3f86_0 2025-05-07T19:44:07.0244030Z libgcc conda-forge/linux-64::libgcc-15.1.0-h767d61c_2 2025-05-07T19:44:07.0244499Z pycparser conda-forge/noarch::pycparser-2.22-pyh29332c3_1 2025-05-07T19:44:07.0245012Z pyopenssl conda-forge/noarch::pyopenssl-25.0.0-pyhd8ed1ab_0 2025-05-07T19:44:07.0245978Z python_abi conda-forge/linux-64::python_abi-3.11-2_cp311 2025-05-07T19:44:07.0246635Z typing-extensions conda-forge/noarch::typing-extensions-4.13.2-h0e9735f_0 2025-05-07T19:44:07.0247270Z typing_extensions conda-forge/noarch::typing_extensions-4.13.2-pyh29332c3_0 2025-05-07T19:44:07.0247639Z 2025-05-07T19:44:07.0247758Z The following packages will be UPDATED: 2025-05-07T19:44:07.0247995Z 2025-05-07T19:44:07.0248419Z ca-certificates pkgs/main/linux-64::ca-certificates-2~ --> conda-forge/noarch::ca-certificates-2025.4.26-hbd8a1cb_0 2025-05-07T19:44:07.0249266Z libgcc-ng pkgs/main::libgcc-ng-11.2.0-h1234567_1 --> conda-forge::libgcc-ng-15.1.0-h69a702a_2 2025-05-07T19:44:07.0250110Z libgomp pkgs/main::libgomp-11.2.0-h1234567_1 --> conda-forge::libgomp-15.1.0-h767d61c_2 2025-05-07T19:44:07.0250800Z openssl pkgs/main::openssl-3.0.16-h5eee18b_0 --> conda-forge::openssl-3.5.0-h7b32b05_1 2025-05-07T19:44:07.0251197Z 2025-05-07T19:44:07.0251201Z 2025-05-07T19:44:07.0251209Z 2025-05-07T19:44:07.0251393Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:07.0251779Z openssl-3.5.0 | 3.0 MB | | 0% 2025-05-07T19:44:07.0252019Z 2025-05-07T19:44:07.0252451Z cryptography-44.0.3 | 1.5 MB | | 0%  2025-05-07T19:44:07.0252716Z 2025-05-07T19:44:07.0252720Z 2025-05-07T19:44:07.0264515Z libgcc-15.1.0 | 810 KB | | 0%  2025-05-07T19:44:07.0264819Z 2025-05-07T19:44:07.0264824Z 2025-05-07T19:44:07.0264828Z 2025-05-07T19:44:07.0286737Z libgomp-15.1.0 | 442 KB | | 0%  2025-05-07T19:44:07.0287038Z 2025-05-07T19:44:07.0287074Z 2025-05-07T19:44:07.0287078Z 2025-05-07T19:44:07.0287227Z 2025-05-07T19:44:07.0309412Z cffi-1.17.1 | 295 KB | | 0%  2025-05-07T19:44:07.0309736Z 2025-05-07T19:44:07.0309836Z 2025-05-07T19:44:07.0309844Z 2025-05-07T19:44:07.0309849Z 2025-05-07T19:44:07.0309853Z 2025-05-07T19:44:07.0337350Z pyopenssl-25.0.0 | 120 KB | | 0%  2025-05-07T19:44:07.0337675Z 2025-05-07T19:44:07.0337811Z 2025-05-07T19:44:07.0337815Z 2025-05-07T19:44:07.0337819Z 2025-05-07T19:44:07.0337823Z 2025-05-07T19:44:07.0337902Z 2025-05-07T19:44:07.0338441Z pycparser-2.22 | 108 KB | | 0%  2025-05-07T19:44:07.0338766Z 2025-05-07T19:44:07.0338770Z 2025-05-07T19:44:07.0338774Z 2025-05-07T19:44:07.0338777Z 2025-05-07T19:44:07.0338789Z 2025-05-07T19:44:07.0338792Z 2025-05-07T19:44:07.0338813Z 2025-05-07T19:44:07.0339256Z typing-extensions-4. | 88 KB | | 0%  2025-05-07T19:44:07.0339593Z 2025-05-07T19:44:07.0339597Z 2025-05-07T19:44:07.0339600Z 2025-05-07T19:44:07.0339604Z 2025-05-07T19:44:07.0339608Z 2025-05-07T19:44:07.0339616Z 2025-05-07T19:44:07.0339619Z 2025-05-07T19:44:07.0339640Z 2025-05-07T19:44:07.0340453Z typing_extensions-4. | 51 KB | | 0%  2025-05-07T19:44:07.0340761Z 2025-05-07T19:44:07.0340775Z 2025-05-07T19:44:07.0340779Z 2025-05-07T19:44:07.0340782Z 2025-05-07T19:44:07.0340786Z 2025-05-07T19:44:07.0340789Z 2025-05-07T19:44:07.0340793Z 2025-05-07T19:44:07.0340811Z 2025-05-07T19:44:07.0340817Z 2025-05-07T19:44:07.0341755Z libgcc-ng-15.1.0 | 34 KB | | 0%  2025-05-07T19:44:07.0342045Z 2025-05-07T19:44:07.0342056Z 2025-05-07T19:44:07.0342060Z 2025-05-07T19:44:07.0342063Z 2025-05-07T19:44:07.0342067Z 2025-05-07T19:44:07.0342085Z 2025-05-07T19:44:07.0342089Z 2025-05-07T19:44:07.0342092Z 2025-05-07T19:44:07.0342096Z 2025-05-07T19:44:07.0342099Z 2025-05-07T19:44:07.0999422Z python_abi-3.11 | 5 KB | | 0%  2025-05-07T19:44:07.0999748Z 2025-05-07T19:44:07.0999753Z 2025-05-07T19:44:07.0999761Z 2025-05-07T19:44:07.1253173Z libgomp-15.1.0 | 442 KB | ########## | 100%  2025-05-07T19:44:07.1253472Z 2025-05-07T19:44:07.1253680Z 2025-05-07T19:44:07.1276773Z libgcc-15.1.0 | 810 KB | #########4 | 95%  2025-05-07T19:44:07.1277075Z 2025-05-07T19:44:07.1277079Z 2025-05-07T19:44:07.1277084Z 2025-05-07T19:44:07.1277087Z 2025-05-07T19:44:07.1319602Z cffi-1.17.1 | 295 KB | ########## | 100%  2025-05-07T19:44:07.1319903Z 2025-05-07T19:44:07.1320139Z cryptography-44.0.3 | 1.5 MB | ########## | 100%  2025-05-07T19:44:07.1320422Z 2025-05-07T19:44:07.1388492Z cryptography-44.0.3 | 1.5 MB | ########## | 100%  2025-05-07T19:44:07.1388789Z 2025-05-07T19:44:07.1388793Z 2025-05-07T19:44:07.1388798Z 2025-05-07T19:44:07.1388801Z 2025-05-07T19:44:07.1389002Z 2025-05-07T19:44:07.1429785Z pyopenssl-25.0.0 | 120 KB | #3 | 13%  2025-05-07T19:44:07.1430149Z 2025-05-07T19:44:07.1430155Z 2025-05-07T19:44:07.1430159Z 2025-05-07T19:44:07.1430164Z 2025-05-07T19:44:07.1430169Z 2025-05-07T19:44:07.1430540Z pyopenssl-25.0.0 | 120 KB | ########## | 100%  2025-05-07T19:44:07.1430845Z 2025-05-07T19:44:07.1430849Z 2025-05-07T19:44:07.1591914Z libgcc-15.1.0 | 810 KB | ########## | 100%  2025-05-07T19:44:07.1592360Z openssl-3.5.0 | 3.0 MB | ########## | 100% 2025-05-07T19:44:07.1671148Z openssl-3.5.0 | 3.0 MB | ########## | 100% 2025-05-07T19:44:07.1671447Z 2025-05-07T19:44:07.1671453Z 2025-05-07T19:44:07.1671457Z 2025-05-07T19:44:07.1671753Z libgomp-15.1.0 | 442 KB | ########## | 100%  2025-05-07T19:44:07.1672040Z 2025-05-07T19:44:07.1672045Z 2025-05-07T19:44:07.1672052Z 2025-05-07T19:44:07.1779132Z libgomp-15.1.0 | 442 KB | ########## | 100%  2025-05-07T19:44:07.1779611Z 2025-05-07T19:44:07.1779690Z 2025-05-07T19:44:07.1779697Z 2025-05-07T19:44:07.1779749Z 2025-05-07T19:44:07.1779753Z 2025-05-07T19:44:07.1779758Z 2025-05-07T19:44:07.1779762Z 2025-05-07T19:44:07.1816567Z typing-extensions-4. | 88 KB | #8 | 18%  2025-05-07T19:44:07.1816939Z 2025-05-07T19:44:07.1816955Z 2025-05-07T19:44:07.1816959Z 2025-05-07T19:44:07.1816962Z 2025-05-07T19:44:07.1816966Z 2025-05-07T19:44:07.1816969Z 2025-05-07T19:44:07.1816972Z 2025-05-07T19:44:07.1911479Z typing-extensions-4. | 88 KB | ########## | 100%  2025-05-07T19:44:07.1911840Z 2025-05-07T19:44:07.1911844Z 2025-05-07T19:44:07.1911848Z 2025-05-07T19:44:07.1911852Z 2025-05-07T19:44:07.1911855Z 2025-05-07T19:44:07.1911869Z 2025-05-07T19:44:07.1912126Z pycparser-2.22 | 108 KB | #4 | 15%  2025-05-07T19:44:07.1912429Z 2025-05-07T19:44:07.1912433Z 2025-05-07T19:44:07.1912437Z 2025-05-07T19:44:07.1912454Z 2025-05-07T19:44:07.1912457Z 2025-05-07T19:44:07.1912461Z 2025-05-07T19:44:07.1912464Z 2025-05-07T19:44:07.1912468Z 2025-05-07T19:44:07.1913816Z 2025-05-07T19:44:07.1915998Z libgcc-ng-15.1.0 | 34 KB | ####7 | 47%  2025-05-07T19:44:07.1916321Z 2025-05-07T19:44:07.1916326Z 2025-05-07T19:44:07.1916329Z 2025-05-07T19:44:07.1916369Z 2025-05-07T19:44:07.1922466Z cffi-1.17.1 | 295 KB | ########## | 100%  2025-05-07T19:44:07.1922726Z 2025-05-07T19:44:07.1922730Z 2025-05-07T19:44:07.1922734Z 2025-05-07T19:44:07.1922741Z 2025-05-07T19:44:07.1929517Z cffi-1.17.1 | 295 KB | ########## | 100%  2025-05-07T19:44:07.1929771Z 2025-05-07T19:44:07.1929783Z 2025-05-07T19:44:07.1929787Z 2025-05-07T19:44:07.1929791Z 2025-05-07T19:44:07.1929794Z 2025-05-07T19:44:07.1929797Z 2025-05-07T19:44:07.1929801Z 2025-05-07T19:44:07.1929804Z 2025-05-07T19:44:07.1929808Z 2025-05-07T19:44:07.1933543Z libgcc-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:07.1933841Z 2025-05-07T19:44:07.1933850Z 2025-05-07T19:44:07.1933854Z 2025-05-07T19:44:07.1933858Z 2025-05-07T19:44:07.1933861Z 2025-05-07T19:44:07.1933864Z 2025-05-07T19:44:07.1933868Z 2025-05-07T19:44:07.1933871Z 2025-05-07T19:44:07.1943141Z typing_extensions-4. | 51 KB | ###1 | 31%  2025-05-07T19:44:07.1943463Z 2025-05-07T19:44:07.1943467Z 2025-05-07T19:44:07.1943470Z 2025-05-07T19:44:07.1943473Z 2025-05-07T19:44:07.1943477Z 2025-05-07T19:44:07.1944645Z 2025-05-07T19:44:07.1963089Z pycparser-2.22 | 108 KB | ########## | 100%  2025-05-07T19:44:07.1963410Z 2025-05-07T19:44:07.1963415Z 2025-05-07T19:44:07.1963418Z 2025-05-07T19:44:07.1963422Z 2025-05-07T19:44:07.1963425Z 2025-05-07T19:44:07.1963429Z 2025-05-07T19:44:07.1963432Z 2025-05-07T19:44:07.1963436Z 2025-05-07T19:44:07.2006161Z typing_extensions-4. | 51 KB | ########## | 100%  2025-05-07T19:44:07.2006710Z 2025-05-07T19:44:07.2006715Z 2025-05-07T19:44:07.2006719Z 2025-05-07T19:44:07.2006722Z 2025-05-07T19:44:07.2006726Z 2025-05-07T19:44:07.2040255Z pyopenssl-25.0.0 | 120 KB | ########## | 100%  2025-05-07T19:44:07.2040592Z 2025-05-07T19:44:07.2040597Z 2025-05-07T19:44:07.2040601Z 2025-05-07T19:44:07.2040618Z 2025-05-07T19:44:07.2040622Z 2025-05-07T19:44:07.2040626Z 2025-05-07T19:44:07.2040629Z 2025-05-07T19:44:07.2040633Z 2025-05-07T19:44:07.2040636Z 2025-05-07T19:44:07.2040640Z 2025-05-07T19:44:07.2051430Z python_abi-3.11 | 5 KB | ########## | 100%  2025-05-07T19:44:07.2051775Z 2025-05-07T19:44:07.2051787Z 2025-05-07T19:44:07.2051791Z 2025-05-07T19:44:07.2051795Z 2025-05-07T19:44:07.2051800Z 2025-05-07T19:44:07.2051804Z 2025-05-07T19:44:07.2051809Z 2025-05-07T19:44:07.2051814Z 2025-05-07T19:44:07.2051818Z 2025-05-07T19:44:07.2051823Z 2025-05-07T19:44:07.2508483Z python_abi-3.11 | 5 KB | ########## | 100%  2025-05-07T19:44:07.2508833Z 2025-05-07T19:44:07.2508838Z 2025-05-07T19:44:07.2599878Z libgcc-15.1.0 | 810 KB | ########## | 100%  2025-05-07T19:44:07.2600191Z 2025-05-07T19:44:07.2600196Z 2025-05-07T19:44:07.2600200Z 2025-05-07T19:44:07.2600204Z 2025-05-07T19:44:07.2600208Z 2025-05-07T19:44:07.2600226Z 2025-05-07T19:44:07.2600230Z 2025-05-07T19:44:07.2886527Z typing-extensions-4. | 88 KB | ########## | 100%  2025-05-07T19:44:07.2886982Z 2025-05-07T19:44:07.2886987Z 2025-05-07T19:44:07.2886990Z 2025-05-07T19:44:07.2886994Z 2025-05-07T19:44:07.2886997Z 2025-05-07T19:44:07.2887001Z 2025-05-07T19:44:07.2887004Z 2025-05-07T19:44:07.2887008Z 2025-05-07T19:44:07.2887012Z 2025-05-07T19:44:07.3596002Z libgcc-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:07.3596323Z 2025-05-07T19:44:07.3596331Z 2025-05-07T19:44:07.3596335Z 2025-05-07T19:44:07.3596338Z 2025-05-07T19:44:07.3596365Z 2025-05-07T19:44:07.3596368Z 2025-05-07T19:44:07.3598911Z pycparser-2.22 | 108 KB | ########## | 100%  2025-05-07T19:44:07.3599224Z 2025-05-07T19:44:07.3599228Z 2025-05-07T19:44:07.3599232Z 2025-05-07T19:44:07.3599241Z 2025-05-07T19:44:07.3599245Z 2025-05-07T19:44:07.3599249Z 2025-05-07T19:44:07.3640367Z pycparser-2.22 | 108 KB | ########## | 100%  2025-05-07T19:44:07.3730363Z openssl-3.5.0 | 3.0 MB | ########## | 100% 2025-05-07T19:44:07.3730622Z 2025-05-07T19:44:07.3730627Z 2025-05-07T19:44:07.3730630Z 2025-05-07T19:44:07.3730634Z 2025-05-07T19:44:07.3730657Z 2025-05-07T19:44:07.3730661Z 2025-05-07T19:44:07.3730664Z 2025-05-07T19:44:07.3730667Z 2025-05-07T19:44:07.3730671Z 2025-05-07T19:44:07.3730674Z 2025-05-07T19:44:07.3732953Z python_abi-3.11 | 5 KB | ########## | 100%  2025-05-07T19:44:07.3733261Z 2025-05-07T19:44:07.3733265Z 2025-05-07T19:44:07.3733298Z 2025-05-07T19:44:07.3733302Z 2025-05-07T19:44:07.3733305Z 2025-05-07T19:44:07.3733309Z 2025-05-07T19:44:07.3733312Z 2025-05-07T19:44:07.3733316Z 2025-05-07T19:44:07.3735215Z typing_extensions-4. | 51 KB | ########## | 100%  2025-05-07T19:44:07.3735536Z 2025-05-07T19:44:07.3735540Z 2025-05-07T19:44:07.3735560Z 2025-05-07T19:44:07.3735745Z 2025-05-07T19:44:07.3735750Z 2025-05-07T19:44:07.3735754Z 2025-05-07T19:44:07.3735757Z 2025-05-07T19:44:07.3735760Z 2025-05-07T19:44:07.3760247Z typing_extensions-4. | 51 KB | ########## | 100%  2025-05-07T19:44:07.3760582Z 2025-05-07T19:44:07.3768149Z cryptography-44.0.3 | 1.5 MB | ########## | 100%  2025-05-07T19:44:07.3768574Z 2025-05-07T19:44:07.3768805Z 2025-05-07T19:44:07.3768970Z  2025-05-07T19:44:07.3769182Z 2025-05-07T19:44:07.3769186Z 2025-05-07T19:44:07.3769370Z  2025-05-07T19:44:07.3769731Z 2025-05-07T19:44:07.3769734Z 2025-05-07T19:44:07.3769738Z 2025-05-07T19:44:07.3769944Z  2025-05-07T19:44:07.3770164Z 2025-05-07T19:44:07.3770167Z 2025-05-07T19:44:07.3770171Z 2025-05-07T19:44:07.3770174Z 2025-05-07T19:44:07.3770356Z  2025-05-07T19:44:07.3770599Z 2025-05-07T19:44:07.3770602Z 2025-05-07T19:44:07.3770606Z 2025-05-07T19:44:07.3770609Z 2025-05-07T19:44:07.3770613Z 2025-05-07T19:44:07.3770789Z  2025-05-07T19:44:07.3771016Z 2025-05-07T19:44:07.3771020Z 2025-05-07T19:44:07.3771023Z 2025-05-07T19:44:07.3771027Z 2025-05-07T19:44:07.3771044Z 2025-05-07T19:44:07.3771047Z 2025-05-07T19:44:07.3771226Z  2025-05-07T19:44:07.3771455Z 2025-05-07T19:44:07.3771464Z 2025-05-07T19:44:07.3771467Z 2025-05-07T19:44:07.3771471Z 2025-05-07T19:44:07.3771474Z 2025-05-07T19:44:07.3771478Z 2025-05-07T19:44:07.3771481Z 2025-05-07T19:44:07.3771681Z  2025-05-07T19:44:07.3771913Z 2025-05-07T19:44:07.3771917Z 2025-05-07T19:44:07.3771920Z 2025-05-07T19:44:07.3771929Z 2025-05-07T19:44:07.3771932Z 2025-05-07T19:44:07.3771936Z 2025-05-07T19:44:07.3771939Z 2025-05-07T19:44:07.3771942Z 2025-05-07T19:44:07.3772145Z  2025-05-07T19:44:07.3772376Z 2025-05-07T19:44:07.3772379Z 2025-05-07T19:44:07.3772383Z 2025-05-07T19:44:07.3772386Z 2025-05-07T19:44:07.3772389Z 2025-05-07T19:44:07.3772393Z 2025-05-07T19:44:07.3772396Z 2025-05-07T19:44:07.3772399Z 2025-05-07T19:44:07.3772403Z 2025-05-07T19:44:07.3772608Z  2025-05-07T19:44:07.3772844Z 2025-05-07T19:44:07.3772851Z 2025-05-07T19:44:07.3772855Z 2025-05-07T19:44:07.3772858Z 2025-05-07T19:44:07.3772862Z 2025-05-07T19:44:07.3772865Z 2025-05-07T19:44:07.3772869Z 2025-05-07T19:44:07.3772872Z 2025-05-07T19:44:07.3772875Z 2025-05-07T19:44:07.3772879Z 2025-05-07T19:44:07.3773099Z  done 2025-05-07T19:44:07.4782608Z Preparing transaction: - done 2025-05-07T19:44:07.5791704Z Verifying transaction: | done 2025-05-07T19:44:08.9820496Z Executing transaction: - \ | / - \ | / - \ | / - \ done 2025-05-07T19:44:09.0818281Z [SETUP] Testing pyOpenSSL import ... 2025-05-07T19:44:10.7963033Z [CHECK] Python (sub-)package 'OpenSSL' found ... 2025-05-07T19:44:10.7976925Z [SETUP] Installing libxcrypt ... 2025-05-07T19:44:10.8004811Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y libxcrypt 2025-05-07T19:44:11.4672907Z Channels: 2025-05-07T19:44:11.4673649Z - conda-forge 2025-05-07T19:44:11.4674317Z Platform: linux-64 2025-05-07T19:44:14.5873520Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:15.0164396Z Solving environment: \ done 2025-05-07T19:44:15.0635041Z 2025-05-07T19:44:15.0635689Z ## Package Plan ## 2025-05-07T19:44:15.0635900Z 2025-05-07T19:44:15.0636619Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:15.0636977Z 2025-05-07T19:44:15.0637097Z added / updated specs: 2025-05-07T19:44:15.0637426Z - libxcrypt 2025-05-07T19:44:15.0637578Z 2025-05-07T19:44:15.0637583Z 2025-05-07T19:44:15.0637720Z The following packages will be downloaded: 2025-05-07T19:44:15.0637995Z 2025-05-07T19:44:15.0638123Z package | build 2025-05-07T19:44:15.0638492Z ---------------------------|----------------- 2025-05-07T19:44:15.0638943Z libxcrypt-4.4.36 | hd590300_1 98 KB conda-forge 2025-05-07T19:44:15.0639433Z ------------------------------------------------------------ 2025-05-07T19:44:15.0639989Z Total: 98 KB 2025-05-07T19:44:15.0640223Z 2025-05-07T19:44:15.0640403Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:15.0640675Z 2025-05-07T19:44:15.0640931Z libxcrypt conda-forge/linux-64::libxcrypt-4.4.36-hd590300_1 2025-05-07T19:44:15.0641289Z 2025-05-07T19:44:15.0641293Z 2025-05-07T19:44:15.0641297Z 2025-05-07T19:44:15.0641461Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:15.2291060Z libxcrypt-4.4.36 | 98 KB | | 0% 2025-05-07T19:44:15.2307295Z libxcrypt-4.4.36 | 98 KB | #6 | 16% 2025-05-07T19:44:15.2412063Z libxcrypt-4.4.36 | 98 KB | ########## | 100% 2025-05-07T19:44:15.2413330Z libxcrypt-4.4.36 | 98 KB | ########## | 100% 2025-05-07T19:44:15.2414359Z 2025-05-07T19:44:15.2415178Z done 2025-05-07T19:44:15.3421984Z Preparing transaction: / done 2025-05-07T19:44:15.4431409Z Verifying transaction: \ done 2025-05-07T19:44:15.5441180Z Executing transaction: / done 2025-05-07T19:44:18.8467770Z [SETUP] Copying over ... 2025-05-07T19:44:18.8469090Z + cp /github/home/miniconda/envs/build_binary/include/crypt.h /github/home/miniconda/envs/build_binary/include/python3.11/crypt.h 2025-05-07T19:44:18.8469855Z 2025-05-07T19:44:18.8500430Z 2025-05-07T19:44:20.4473795Z [SETUP] Installed Python version: Python 3.11.11 2025-05-07T19:44:20.4474330Z [SETUP] Successfully created Conda environment: build_binary 2025-05-07T19:44:20.4548676Z ##[group]Run . $PRELUDE; install_cxx_compiler $BUILD_ENV gcc 2025-05-07T19:44:20.4549198Z . $PRELUDE; install_cxx_compiler $BUILD_ENV gcc 2025-05-07T19:44:20.4550151Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:44:20.4550576Z env: 2025-05-07T19:44:20.4550837Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:44:20.4551213Z BUILD_ENV: build_binary 2025-05-07T19:44:20.4551520Z BUILD_TARGET: default 2025-05-07T19:44:20.4551812Z BUILD_VARIANT: cuda 2025-05-07T19:44:20.4552077Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:44:20.4552387Z ##[endgroup] 2025-05-07T19:44:20.8947535Z ################################################################################ 2025-05-07T19:44:20.8948660Z # Install C/C++ Compilers 2025-05-07T19:44:20.8949618Z # 2025-05-07T19:44:20.8960424Z # [2025-05-07T19:44:20.895Z] + install_cxx_compiler build_binary gcc 2025-05-07T19:44:20.8961001Z ################################################################################ 2025-05-07T19:44:20.8961254Z 2025-05-07T19:44:20.8978992Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:44:20.9818693Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:44:20.9822617Z [INSTALL] Installing GLIBC (architecture = 64) ... 2025-05-07T19:44:20.9847466Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y sysroot_linux-64=2.17 2025-05-07T19:44:21.6489101Z Channels: 2025-05-07T19:44:21.6489668Z - conda-forge 2025-05-07T19:44:21.6489929Z Platform: linux-64 2025-05-07T19:44:24.7466906Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:25.1683692Z Solving environment: \ done 2025-05-07T19:44:25.2148379Z 2025-05-07T19:44:25.2148919Z ## Package Plan ## 2025-05-07T19:44:25.2149660Z 2025-05-07T19:44:25.2150273Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:25.2150869Z 2025-05-07T19:44:25.2151009Z added / updated specs: 2025-05-07T19:44:25.2151320Z - sysroot_linux-64=2.17 2025-05-07T19:44:25.2151541Z 2025-05-07T19:44:25.2151545Z 2025-05-07T19:44:25.2151684Z The following packages will be downloaded: 2025-05-07T19:44:25.2151921Z 2025-05-07T19:44:25.2152077Z package | build 2025-05-07T19:44:25.2152429Z ---------------------------|----------------- 2025-05-07T19:44:25.2152920Z kernel-headers_linux-64-3.10.0| he073ed8_18 921 KB conda-forge 2025-05-07T19:44:25.2153859Z sysroot_linux-64-2.17 | h0157908_18 14.5 MB conda-forge 2025-05-07T19:44:25.2154314Z ------------------------------------------------------------ 2025-05-07T19:44:25.2154667Z Total: 15.4 MB 2025-05-07T19:44:25.2154909Z 2025-05-07T19:44:25.2155042Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:25.2155271Z 2025-05-07T19:44:25.2155592Z kernel-headers_li~ conda-forge/noarch::kernel-headers_linux-64-3.10.0-he073ed8_18 2025-05-07T19:44:25.2156179Z sysroot_linux-64 conda-forge/noarch::sysroot_linux-64-2.17-h0157908_18 2025-05-07T19:44:25.2156526Z 2025-05-07T19:44:25.2156529Z 2025-05-07T19:44:25.2156532Z 2025-05-07T19:44:25.2156690Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:25.2157077Z sysroot_linux-64-2.1 | 14.5 MB | | 0% 2025-05-07T19:44:25.2157342Z 2025-05-07T19:44:25.4292534Z kernel-headers_linux | 921 KB | | 0%  2025-05-07T19:44:25.4293463Z 2025-05-07T19:44:25.4403328Z kernel-headers_linux | 921 KB | 1 | 2%  2025-05-07T19:44:25.4403806Z 2025-05-07T19:44:25.4459513Z kernel-headers_linux | 921 KB | ########## | 100%  2025-05-07T19:44:25.5473114Z sysroot_linux-64-2.1 | 14.5 MB | | 0% 2025-05-07T19:44:25.6097863Z sysroot_linux-64-2.1 | 14.5 MB | ####2 | 42% 2025-05-07T19:44:25.6098505Z 2025-05-07T19:44:25.6099309Z kernel-headers_linux | 921 KB | ########## | 100%  2025-05-07T19:44:25.6099629Z 2025-05-07T19:44:25.6571227Z kernel-headers_linux | 921 KB | ########## | 100%  2025-05-07T19:44:25.6571776Z sysroot_linux-64-2.1 | 14.5 MB | ########## | 100% 2025-05-07T19:44:26.0926381Z sysroot_linux-64-2.1 | 14.5 MB | ########## | 100% 2025-05-07T19:44:26.0927595Z sysroot_linux-64-2.1 | 14.5 MB | ########## | 100% 2025-05-07T19:44:26.0928643Z 2025-05-07T19:44:26.0929306Z 2025-05-07T19:44:26.0929988Z  done 2025-05-07T19:44:26.1938793Z Preparing transaction: / done 2025-05-07T19:44:26.3950568Z Verifying transaction: \ | done 2025-05-07T19:44:26.4962705Z Executing transaction: - done 2025-05-07T19:44:26.5793406Z [CHECK] LD_LIBRARY_PATH = 2025-05-07T19:44:26.5794293Z [CHECK] CONDA_PREFIX is not set. 2025-05-07T19:44:28.2217827Z [CHECK] libstdc++.so.6 found in CONDA_PREFIX PATH (symbolic link): /github/home/miniconda/envs/build_binary/lib/libstdc++.so.6 2025-05-07T19:44:28.2223302Z [INSTALL] Installing GCC (11.4.0, 64) through Conda ... 2025-05-07T19:44:28.2249603Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y gxx_linux-64=11.4.0 2025-05-07T19:44:28.9212881Z Channels: 2025-05-07T19:44:28.9213347Z - conda-forge 2025-05-07T19:44:28.9214201Z Platform: linux-64 2025-05-07T19:44:31.9787307Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:33.1045579Z Solving environment: \ | / done 2025-05-07T19:44:33.1548150Z 2025-05-07T19:44:33.1548583Z ## Package Plan ## 2025-05-07T19:44:33.1548879Z 2025-05-07T19:44:33.1549211Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:33.1549741Z 2025-05-07T19:44:33.1549881Z added / updated specs: 2025-05-07T19:44:33.1550185Z - gxx_linux-64=11.4.0 2025-05-07T19:44:33.1550364Z 2025-05-07T19:44:33.1550445Z 2025-05-07T19:44:33.1550613Z The following packages will be downloaded: 2025-05-07T19:44:33.1550862Z 2025-05-07T19:44:33.1551011Z package | build 2025-05-07T19:44:33.1551408Z ---------------------------|----------------- 2025-05-07T19:44:33.1551862Z binutils_impl_linux-64-2.40| ha1999f0_7 6.0 MB conda-forge 2025-05-07T19:44:33.1552431Z binutils_linux-64-2.40 | hb3c18ed_4 28 KB conda-forge 2025-05-07T19:44:33.1552973Z gcc_impl_linux-64-11.4.0 | h00c12a0_13 53.0 MB conda-forge 2025-05-07T19:44:33.1553800Z gcc_linux-64-11.4.0 | ha077dfb_4 31 KB conda-forge 2025-05-07T19:44:33.1554530Z gxx_impl_linux-64-11.4.0 | h634f3ee_13 11.2 MB conda-forge 2025-05-07T19:44:33.1555023Z gxx_linux-64-11.4.0 | h35bfe5d_4 29 KB conda-forge 2025-05-07T19:44:33.1555584Z ld_impl_linux-64-2.40 | hf3520f5_7 691 KB conda-forge 2025-05-07T19:44:33.1556103Z libgcc-devel_linux-64-11.4.0| h8f596e0_113 2.3 MB conda-forge 2025-05-07T19:44:33.1556657Z libsanitizer-11.4.0 | h5763a12_13 3.5 MB conda-forge 2025-05-07T19:44:33.1557175Z libstdcxx-15.1.0 | h8f9b012_2 3.7 MB conda-forge 2025-05-07T19:44:33.1557702Z libstdcxx-devel_linux-64-11.4.0| h8f596e0_113 11.1 MB conda-forge 2025-05-07T19:44:33.1558254Z libstdcxx-ng-15.1.0 | h4852527_2 34 KB conda-forge 2025-05-07T19:44:33.1558704Z ------------------------------------------------------------ 2025-05-07T19:44:33.1559111Z Total: 91.6 MB 2025-05-07T19:44:33.1559348Z 2025-05-07T19:44:33.1559521Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:33.1559766Z 2025-05-07T19:44:33.1560059Z binutils_impl_lin~ conda-forge/linux-64::binutils_impl_linux-64-2.40-ha1999f0_7 2025-05-07T19:44:33.1560712Z binutils_linux-64 conda-forge/linux-64::binutils_linux-64-2.40-hb3c18ed_4 2025-05-07T19:44:33.1561488Z gcc_impl_linux-64 conda-forge/linux-64::gcc_impl_linux-64-11.4.0-h00c12a0_13 2025-05-07T19:44:33.1562091Z gcc_linux-64 conda-forge/linux-64::gcc_linux-64-11.4.0-ha077dfb_4 2025-05-07T19:44:33.1562685Z gxx_impl_linux-64 conda-forge/linux-64::gxx_impl_linux-64-11.4.0-h634f3ee_13 2025-05-07T19:44:33.1563254Z gxx_linux-64 conda-forge/linux-64::gxx_linux-64-11.4.0-h35bfe5d_4 2025-05-07T19:44:33.1563877Z libgcc-devel_linu~ conda-forge/noarch::libgcc-devel_linux-64-11.4.0-h8f596e0_113 2025-05-07T19:44:33.1564505Z libsanitizer conda-forge/linux-64::libsanitizer-11.4.0-h5763a12_13 2025-05-07T19:44:33.1565085Z libstdcxx conda-forge/linux-64::libstdcxx-15.1.0-h8f9b012_2 2025-05-07T19:44:33.1565720Z libstdcxx-devel_l~ conda-forge/noarch::libstdcxx-devel_linux-64-11.4.0-h8f596e0_113 2025-05-07T19:44:33.1566123Z 2025-05-07T19:44:33.1566256Z The following packages will be UPDATED: 2025-05-07T19:44:33.1566509Z 2025-05-07T19:44:33.1566858Z ld_impl_linux-64 pkgs/main::ld_impl_linux-64-2.40-h12e~ --> conda-forge::ld_impl_linux-64-2.40-hf3520f5_7 2025-05-07T19:44:33.1567659Z libstdcxx-ng pkgs/main::libstdcxx-ng-11.2.0-h12345~ --> conda-forge::libstdcxx-ng-15.1.0-h4852527_2 2025-05-07T19:44:33.1568139Z 2025-05-07T19:44:33.1568143Z 2025-05-07T19:44:33.1568147Z 2025-05-07T19:44:33.1568309Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:33.1568744Z gcc_impl_linux-64-11 | 53.0 MB | | 0% 2025-05-07T19:44:33.1569003Z 2025-05-07T19:44:33.1569344Z gxx_impl_linux-64-11 | 11.2 MB | | 0%  2025-05-07T19:44:33.1569642Z 2025-05-07T19:44:33.1569646Z 2025-05-07T19:44:33.1569888Z libstdcxx-devel_linu | 11.1 MB | | 0%  2025-05-07T19:44:33.1570176Z 2025-05-07T19:44:33.1570180Z 2025-05-07T19:44:33.1570184Z 2025-05-07T19:44:33.1570459Z binutils_impl_linux- | 6.0 MB | | 0%  2025-05-07T19:44:33.1570746Z 2025-05-07T19:44:33.1570750Z 2025-05-07T19:44:33.1570753Z 2025-05-07T19:44:33.1570756Z 2025-05-07T19:44:33.1583823Z libstdcxx-15.1.0 | 3.7 MB | | 0%  2025-05-07T19:44:33.1584233Z 2025-05-07T19:44:33.1584238Z 2025-05-07T19:44:33.1584242Z 2025-05-07T19:44:33.1584246Z 2025-05-07T19:44:33.1584249Z 2025-05-07T19:44:33.1584796Z libsanitizer-11.4.0 | 3.5 MB | | 0%  2025-05-07T19:44:33.1585142Z 2025-05-07T19:44:33.1585147Z 2025-05-07T19:44:33.1585151Z 2025-05-07T19:44:33.1585154Z 2025-05-07T19:44:33.1585386Z 2025-05-07T19:44:33.1585390Z 2025-05-07T19:44:33.1585701Z libgcc-devel_linux-6 | 2.3 MB | | 0%  2025-05-07T19:44:33.1586046Z 2025-05-07T19:44:33.1586049Z 2025-05-07T19:44:33.1586053Z 2025-05-07T19:44:33.1586056Z 2025-05-07T19:44:33.1586061Z 2025-05-07T19:44:33.1586064Z 2025-05-07T19:44:33.1588402Z 2025-05-07T19:44:33.1590061Z ld_impl_linux-64-2.4 | 691 KB | | 0%  2025-05-07T19:44:33.1590417Z 2025-05-07T19:44:33.1590421Z 2025-05-07T19:44:33.1590426Z 2025-05-07T19:44:33.1590430Z 2025-05-07T19:44:33.1590434Z 2025-05-07T19:44:33.1590486Z 2025-05-07T19:44:33.1590490Z 2025-05-07T19:44:33.1590507Z 2025-05-07T19:44:33.1599124Z libstdcxx-ng-15.1.0 | 34 KB | | 0%  2025-05-07T19:44:33.1600126Z 2025-05-07T19:44:33.1600140Z 2025-05-07T19:44:33.1600151Z 2025-05-07T19:44:33.1600192Z 2025-05-07T19:44:33.1600203Z 2025-05-07T19:44:33.1600214Z 2025-05-07T19:44:33.1600224Z 2025-05-07T19:44:33.1600268Z 2025-05-07T19:44:33.1600279Z 2025-05-07T19:44:33.1601056Z gcc_linux-64-11.4.0 | 31 KB | | 0%  2025-05-07T19:44:33.1601926Z 2025-05-07T19:44:33.1601938Z 2025-05-07T19:44:33.1601948Z 2025-05-07T19:44:33.1601959Z 2025-05-07T19:44:33.1601999Z 2025-05-07T19:44:33.1602010Z 2025-05-07T19:44:33.1602021Z 2025-05-07T19:44:33.1602032Z 2025-05-07T19:44:33.1602042Z 2025-05-07T19:44:33.1602054Z 2025-05-07T19:44:33.1602814Z gxx_linux-64-11.4.0 | 29 KB | | 0%  2025-05-07T19:44:33.1603684Z 2025-05-07T19:44:33.1604094Z 2025-05-07T19:44:33.1604108Z 2025-05-07T19:44:33.1604118Z 2025-05-07T19:44:33.1604160Z 2025-05-07T19:44:33.1604171Z 2025-05-07T19:44:33.1604182Z 2025-05-07T19:44:33.1604192Z 2025-05-07T19:44:33.1604202Z 2025-05-07T19:44:33.1604213Z 2025-05-07T19:44:33.1604223Z 2025-05-07T19:44:33.5248452Z binutils_linux-64-2. | 28 KB | | 0%  2025-05-07T19:44:33.5249510Z 2025-05-07T19:44:33.5249524Z 2025-05-07T19:44:33.5453470Z libstdcxx-devel_linu | 11.1 MB | | 0%  2025-05-07T19:44:33.5454404Z 2025-05-07T19:44:33.5741705Z gxx_impl_linux-64-11 | 11.2 MB | | 0%  2025-05-07T19:44:33.5742020Z 2025-05-07T19:44:33.5742025Z 2025-05-07T19:44:33.5742058Z 2025-05-07T19:44:33.5742062Z 2025-05-07T19:44:33.5768242Z libstdcxx-15.1.0 | 3.7 MB | | 0%  2025-05-07T19:44:33.5768942Z 2025-05-07T19:44:33.5768946Z 2025-05-07T19:44:33.5768949Z 2025-05-07T19:44:33.5889968Z binutils_impl_linux- | 6.0 MB | | 0%  2025-05-07T19:44:33.6247951Z gcc_impl_linux-64-11 | 53.0 MB | | 0% 2025-05-07T19:44:33.6248885Z 2025-05-07T19:44:33.6248890Z 2025-05-07T19:44:33.6433511Z libstdcxx-devel_linu | 11.1 MB | #######1 | 72%  2025-05-07T19:44:33.6433858Z 2025-05-07T19:44:33.6433863Z 2025-05-07T19:44:33.6433866Z 2025-05-07T19:44:33.6433880Z 2025-05-07T19:44:33.6639757Z libstdcxx-15.1.0 | 3.7 MB | ########## | 100%  2025-05-07T19:44:33.6640216Z 2025-05-07T19:44:33.6640221Z 2025-05-07T19:44:33.6640225Z 2025-05-07T19:44:33.6859988Z binutils_impl_linux- | 6.0 MB | ########## | 100%  2025-05-07T19:44:33.6860412Z 2025-05-07T19:44:33.6860564Z 2025-05-07T19:44:33.6860569Z 2025-05-07T19:44:33.6860589Z 2025-05-07T19:44:33.6860654Z 2025-05-07T19:44:33.6891712Z libsanitizer-11.4.0 | 3.5 MB | | 0%  2025-05-07T19:44:33.6933157Z gcc_impl_linux-64-11 | 53.0 MB | 8 | 9% 2025-05-07T19:44:33.6933620Z 2025-05-07T19:44:33.6934026Z gxx_impl_linux-64-11 | 11.2 MB | ########## | 100%  2025-05-07T19:44:33.6934295Z 2025-05-07T19:44:33.6998725Z gxx_impl_linux-64-11 | 11.2 MB | ########## | 100%  2025-05-07T19:44:33.6999022Z 2025-05-07T19:44:33.6999038Z 2025-05-07T19:44:33.6999042Z 2025-05-07T19:44:33.6999046Z 2025-05-07T19:44:33.6999050Z 2025-05-07T19:44:33.6999053Z 2025-05-07T19:44:33.7328384Z libgcc-devel_linux-6 | 2.3 MB | | 1%  2025-05-07T19:44:33.7328737Z 2025-05-07T19:44:33.7328742Z 2025-05-07T19:44:33.7328746Z 2025-05-07T19:44:33.7328749Z 2025-05-07T19:44:33.7328753Z 2025-05-07T19:44:33.7328756Z 2025-05-07T19:44:33.7328759Z 2025-05-07T19:44:33.7555815Z ld_impl_linux-64-2.4 | 691 KB | 2 | 2%  2025-05-07T19:44:33.7556160Z 2025-05-07T19:44:33.7556164Z 2025-05-07T19:44:33.7556168Z 2025-05-07T19:44:33.7556172Z 2025-05-07T19:44:33.7556175Z 2025-05-07T19:44:33.7556179Z 2025-05-07T19:44:33.7556182Z 2025-05-07T19:44:33.7753594Z ld_impl_linux-64-2.4 | 691 KB | ########## | 100%  2025-05-07T19:44:33.7753961Z 2025-05-07T19:44:33.7753966Z 2025-05-07T19:44:33.7753970Z 2025-05-07T19:44:33.7753974Z 2025-05-07T19:44:33.7753977Z 2025-05-07T19:44:33.7753981Z 2025-05-07T19:44:33.7853526Z libgcc-devel_linux-6 | 2.3 MB | ########## | 100%  2025-05-07T19:44:33.7853909Z 2025-05-07T19:44:33.7853935Z 2025-05-07T19:44:33.7853939Z 2025-05-07T19:44:33.7853942Z 2025-05-07T19:44:33.7853946Z 2025-05-07T19:44:33.7891929Z libsanitizer-11.4.0 | 3.5 MB | ########## | 100%  2025-05-07T19:44:33.8088728Z gcc_impl_linux-64-11 | 53.0 MB | ##1 | 21% 2025-05-07T19:44:33.8089101Z 2025-05-07T19:44:33.8089194Z 2025-05-07T19:44:33.8089199Z 2025-05-07T19:44:33.8089222Z 2025-05-07T19:44:33.8089226Z 2025-05-07T19:44:33.8089399Z 2025-05-07T19:44:33.8089440Z 2025-05-07T19:44:33.8089468Z 2025-05-07T19:44:33.8089471Z 2025-05-07T19:44:33.8093481Z gcc_linux-64-11.4.0 | 31 KB | #####2 | 52%  2025-05-07T19:44:33.8093884Z 2025-05-07T19:44:33.8093890Z 2025-05-07T19:44:33.8093893Z 2025-05-07T19:44:33.8093909Z 2025-05-07T19:44:33.8097813Z libstdcxx-15.1.0 | 3.7 MB | ########## | 100%  2025-05-07T19:44:33.8098109Z 2025-05-07T19:44:33.8098116Z 2025-05-07T19:44:33.8098122Z 2025-05-07T19:44:33.8098130Z 2025-05-07T19:44:33.8102993Z libstdcxx-15.1.0 | 3.7 MB | ########## | 100%  2025-05-07T19:44:33.8103269Z 2025-05-07T19:44:33.8103273Z 2025-05-07T19:44:33.8103277Z 2025-05-07T19:44:33.8103290Z 2025-05-07T19:44:33.8103294Z 2025-05-07T19:44:33.8103297Z 2025-05-07T19:44:33.8103301Z 2025-05-07T19:44:33.8103304Z 2025-05-07T19:44:33.8103308Z 2025-05-07T19:44:33.8129878Z gcc_linux-64-11.4.0 | 31 KB | ########## | 100%  2025-05-07T19:44:33.8130206Z 2025-05-07T19:44:33.8130211Z 2025-05-07T19:44:33.8130214Z 2025-05-07T19:44:33.8130218Z 2025-05-07T19:44:33.8130233Z 2025-05-07T19:44:33.8130237Z 2025-05-07T19:44:33.8130241Z 2025-05-07T19:44:33.8130244Z 2025-05-07T19:44:33.8148332Z libstdcxx-ng-15.1.0 | 34 KB | ####7 | 47%  2025-05-07T19:44:33.8148741Z 2025-05-07T19:44:33.8148746Z 2025-05-07T19:44:33.8148750Z 2025-05-07T19:44:33.8148775Z 2025-05-07T19:44:33.8148778Z 2025-05-07T19:44:33.8148784Z 2025-05-07T19:44:33.8148789Z 2025-05-07T19:44:33.8148817Z 2025-05-07T19:44:33.8163307Z libstdcxx-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:33.8163623Z 2025-05-07T19:44:33.8163804Z 2025-05-07T19:44:33.8164037Z 2025-05-07T19:44:33.8164052Z 2025-05-07T19:44:33.8164060Z 2025-05-07T19:44:33.8164068Z 2025-05-07T19:44:33.8164104Z 2025-05-07T19:44:33.8164110Z 2025-05-07T19:44:33.8164116Z 2025-05-07T19:44:33.8164128Z 2025-05-07T19:44:33.8172532Z gxx_linux-64-11.4.0 | 29 KB | #####5 | 55%  2025-05-07T19:44:33.8172882Z 2025-05-07T19:44:33.8172888Z 2025-05-07T19:44:33.8172913Z 2025-05-07T19:44:33.8172917Z 2025-05-07T19:44:33.8172921Z 2025-05-07T19:44:33.8172946Z 2025-05-07T19:44:33.8172950Z 2025-05-07T19:44:33.8172954Z 2025-05-07T19:44:33.8172957Z 2025-05-07T19:44:33.8172968Z 2025-05-07T19:44:33.8188633Z gxx_linux-64-11.4.0 | 29 KB | ########## | 100%  2025-05-07T19:44:33.8188947Z 2025-05-07T19:44:33.8189044Z 2025-05-07T19:44:33.8453618Z libstdcxx-devel_linu | 11.1 MB | ########## | 100%  2025-05-07T19:44:33.8454029Z 2025-05-07T19:44:33.8454036Z 2025-05-07T19:44:33.8454057Z 2025-05-07T19:44:33.8454063Z 2025-05-07T19:44:33.8454070Z 2025-05-07T19:44:33.8454077Z 2025-05-07T19:44:33.8454084Z 2025-05-07T19:44:33.8454090Z 2025-05-07T19:44:33.8454097Z 2025-05-07T19:44:33.8454102Z 2025-05-07T19:44:33.8454107Z 2025-05-07T19:44:33.8461388Z binutils_linux-64-2. | 28 KB | #####6 | 56%  2025-05-07T19:44:33.8461734Z 2025-05-07T19:44:33.8461738Z 2025-05-07T19:44:33.8461776Z 2025-05-07T19:44:33.8461789Z 2025-05-07T19:44:33.8461794Z 2025-05-07T19:44:33.8461798Z 2025-05-07T19:44:33.8461801Z 2025-05-07T19:44:33.8461805Z 2025-05-07T19:44:33.8461808Z 2025-05-07T19:44:33.8461812Z 2025-05-07T19:44:33.8461815Z 2025-05-07T19:44:33.8504270Z binutils_linux-64-2. | 28 KB | ########## | 100%  2025-05-07T19:44:33.8504640Z 2025-05-07T19:44:33.8504679Z 2025-05-07T19:44:33.8504683Z 2025-05-07T19:44:33.8504687Z 2025-05-07T19:44:33.8504690Z 2025-05-07T19:44:33.8504694Z 2025-05-07T19:44:33.8504697Z 2025-05-07T19:44:33.8505042Z ld_impl_linux-64-2.4 | 691 KB | ########## | 100%  2025-05-07T19:44:33.8505356Z 2025-05-07T19:44:33.8505359Z 2025-05-07T19:44:33.8505363Z 2025-05-07T19:44:33.8505366Z 2025-05-07T19:44:33.8505370Z 2025-05-07T19:44:33.8505373Z 2025-05-07T19:44:33.8505377Z 2025-05-07T19:44:33.8894711Z ld_impl_linux-64-2.4 | 691 KB | ########## | 100%  2025-05-07T19:44:33.9481900Z gcc_impl_linux-64-11 | 53.0 MB | ####1 | 42% 2025-05-07T19:44:33.9482232Z 2025-05-07T19:44:33.9482237Z 2025-05-07T19:44:33.9482241Z 2025-05-07T19:44:33.9482245Z 2025-05-07T19:44:33.9482248Z 2025-05-07T19:44:33.9482264Z 2025-05-07T19:44:33.9483665Z libgcc-devel_linux-6 | 2.3 MB | ########## | 100%  2025-05-07T19:44:33.9483999Z 2025-05-07T19:44:33.9484004Z 2025-05-07T19:44:33.9484021Z 2025-05-07T19:44:33.9484028Z 2025-05-07T19:44:33.9484032Z 2025-05-07T19:44:33.9484035Z 2025-05-07T19:44:33.9895124Z libgcc-devel_linux-6 | 2.3 MB | ########## | 100%  2025-05-07T19:44:34.0445311Z gcc_impl_linux-64-11 | 53.0 MB | ######2 | 63% 2025-05-07T19:44:34.0445626Z 2025-05-07T19:44:34.0445631Z 2025-05-07T19:44:34.0445635Z 2025-05-07T19:44:34.0446092Z binutils_impl_linux- | 6.0 MB | ########## | 100%  2025-05-07T19:44:34.0446402Z 2025-05-07T19:44:34.0446406Z 2025-05-07T19:44:34.0446410Z 2025-05-07T19:44:34.0481945Z binutils_impl_linux- | 6.0 MB | ########## | 100%  2025-05-07T19:44:34.0482265Z 2025-05-07T19:44:34.0718055Z gxx_impl_linux-64-11 | 11.2 MB | ########## | 100%  2025-05-07T19:44:34.0718401Z 2025-05-07T19:44:34.0718406Z 2025-05-07T19:44:34.0718415Z 2025-05-07T19:44:34.0718418Z 2025-05-07T19:44:34.0718422Z 2025-05-07T19:44:34.0718425Z 2025-05-07T19:44:34.0718429Z 2025-05-07T19:44:34.0718432Z 2025-05-07T19:44:34.0718465Z 2025-05-07T19:44:34.0721607Z gcc_linux-64-11.4.0 | 31 KB | ########## | 100%  2025-05-07T19:44:34.0721959Z 2025-05-07T19:44:34.0721964Z 2025-05-07T19:44:34.0721976Z 2025-05-07T19:44:34.0721980Z 2025-05-07T19:44:34.0721984Z 2025-05-07T19:44:34.0721987Z 2025-05-07T19:44:34.0721990Z 2025-05-07T19:44:34.0721994Z 2025-05-07T19:44:34.0721997Z 2025-05-07T19:44:34.0877495Z gcc_linux-64-11.4.0 | 31 KB | ########## | 100%  2025-05-07T19:44:34.0877844Z 2025-05-07T19:44:34.0877849Z 2025-05-07T19:44:34.0877853Z 2025-05-07T19:44:34.0877870Z 2025-05-07T19:44:34.0877874Z 2025-05-07T19:44:34.0877878Z 2025-05-07T19:44:34.0877881Z 2025-05-07T19:44:34.0877885Z 2025-05-07T19:44:34.0879059Z libstdcxx-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:34.0879377Z 2025-05-07T19:44:34.0879391Z 2025-05-07T19:44:34.0879394Z 2025-05-07T19:44:34.0879398Z 2025-05-07T19:44:34.0879401Z 2025-05-07T19:44:34.0879608Z 2025-05-07T19:44:34.0879612Z 2025-05-07T19:44:34.0879616Z 2025-05-07T19:44:34.0893762Z libstdcxx-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:34.1048769Z gcc_impl_linux-64-11 | 53.0 MB | #######9 | 80% 2025-05-07T19:44:34.1049115Z 2025-05-07T19:44:34.1049373Z 2025-05-07T19:44:34.1049382Z 2025-05-07T19:44:34.1049387Z 2025-05-07T19:44:34.1049393Z 2025-05-07T19:44:34.1049880Z libsanitizer-11.4.0 | 3.5 MB | ########## | 100%  2025-05-07T19:44:34.1050213Z 2025-05-07T19:44:34.1050217Z 2025-05-07T19:44:34.1050221Z 2025-05-07T19:44:34.1050240Z 2025-05-07T19:44:34.1050243Z 2025-05-07T19:44:34.1074894Z libsanitizer-11.4.0 | 3.5 MB | ########## | 100%  2025-05-07T19:44:34.1075248Z 2025-05-07T19:44:34.1075253Z 2025-05-07T19:44:34.1075257Z 2025-05-07T19:44:34.1075261Z 2025-05-07T19:44:34.1075264Z 2025-05-07T19:44:34.1075268Z 2025-05-07T19:44:34.1075271Z 2025-05-07T19:44:34.1075275Z 2025-05-07T19:44:34.1075301Z 2025-05-07T19:44:34.1075318Z 2025-05-07T19:44:34.1077069Z gxx_linux-64-11.4.0 | 29 KB | ########## | 100%  2025-05-07T19:44:34.1077380Z 2025-05-07T19:44:34.1077397Z 2025-05-07T19:44:34.1077400Z 2025-05-07T19:44:34.1077404Z 2025-05-07T19:44:34.1077407Z 2025-05-07T19:44:34.1077433Z 2025-05-07T19:44:34.1077437Z 2025-05-07T19:44:34.1077440Z 2025-05-07T19:44:34.1077444Z 2025-05-07T19:44:34.1077447Z 2025-05-07T19:44:34.1341977Z gxx_linux-64-11.4.0 | 29 KB | ########## | 100%  2025-05-07T19:44:34.1342366Z 2025-05-07T19:44:34.1342580Z 2025-05-07T19:44:34.1342613Z 2025-05-07T19:44:34.1342616Z 2025-05-07T19:44:34.1342620Z 2025-05-07T19:44:34.1342623Z 2025-05-07T19:44:34.1342627Z 2025-05-07T19:44:34.1342630Z 2025-05-07T19:44:34.1342634Z 2025-05-07T19:44:34.1342637Z 2025-05-07T19:44:34.1342641Z 2025-05-07T19:44:34.1342951Z binutils_linux-64-2. | 28 KB | ########## | 100%  2025-05-07T19:44:34.1343290Z 2025-05-07T19:44:34.1343320Z 2025-05-07T19:44:34.1343324Z 2025-05-07T19:44:34.1343327Z 2025-05-07T19:44:34.1343330Z 2025-05-07T19:44:34.1343334Z 2025-05-07T19:44:34.1343337Z 2025-05-07T19:44:34.1343340Z 2025-05-07T19:44:34.1343344Z 2025-05-07T19:44:34.1343347Z 2025-05-07T19:44:34.1343351Z 2025-05-07T19:44:34.3665017Z binutils_linux-64-2. | 28 KB | ########## | 100%  2025-05-07T19:44:34.3665556Z gcc_impl_linux-64-11 | 53.0 MB | ########## | 100% 2025-05-07T19:44:34.4572607Z gcc_impl_linux-64-11 | 53.0 MB | ########## | 100% 2025-05-07T19:44:34.4573172Z 2025-05-07T19:44:34.4573177Z 2025-05-07T19:44:34.9401028Z libstdcxx-devel_linu | 11.1 MB | ########## | 100%  2025-05-07T19:44:34.9405448Z gcc_impl_linux-64-11 | 53.0 MB | ########## | 100% 2025-05-07T19:44:34.9405847Z 2025-05-07T19:44:34.9406122Z 2025-05-07T19:44:34.9406371Z  2025-05-07T19:44:34.9406632Z 2025-05-07T19:44:34.9406654Z 2025-05-07T19:44:34.9406852Z  2025-05-07T19:44:34.9407081Z 2025-05-07T19:44:34.9407085Z 2025-05-07T19:44:34.9407091Z 2025-05-07T19:44:34.9407262Z  2025-05-07T19:44:34.9407504Z 2025-05-07T19:44:34.9407509Z 2025-05-07T19:44:34.9407551Z 2025-05-07T19:44:34.9407557Z 2025-05-07T19:44:34.9407735Z  2025-05-07T19:44:34.9407974Z 2025-05-07T19:44:34.9407977Z 2025-05-07T19:44:34.9408001Z 2025-05-07T19:44:34.9408005Z 2025-05-07T19:44:34.9408008Z 2025-05-07T19:44:34.9408187Z  2025-05-07T19:44:34.9408415Z 2025-05-07T19:44:34.9408418Z 2025-05-07T19:44:34.9408421Z 2025-05-07T19:44:34.9408442Z 2025-05-07T19:44:34.9408446Z 2025-05-07T19:44:34.9408449Z 2025-05-07T19:44:34.9408636Z  2025-05-07T19:44:34.9409114Z 2025-05-07T19:44:34.9409117Z 2025-05-07T19:44:34.9409121Z 2025-05-07T19:44:34.9409124Z 2025-05-07T19:44:34.9409128Z 2025-05-07T19:44:34.9409131Z 2025-05-07T19:44:34.9409135Z 2025-05-07T19:44:34.9409343Z  2025-05-07T19:44:34.9409574Z 2025-05-07T19:44:34.9409577Z 2025-05-07T19:44:34.9409581Z 2025-05-07T19:44:34.9409584Z 2025-05-07T19:44:34.9409588Z 2025-05-07T19:44:34.9409591Z 2025-05-07T19:44:34.9409594Z 2025-05-07T19:44:34.9409598Z 2025-05-07T19:44:34.9409813Z  2025-05-07T19:44:34.9410045Z 2025-05-07T19:44:34.9410049Z 2025-05-07T19:44:34.9410052Z 2025-05-07T19:44:34.9410056Z 2025-05-07T19:44:34.9410059Z 2025-05-07T19:44:34.9410063Z 2025-05-07T19:44:34.9410066Z 2025-05-07T19:44:34.9410070Z 2025-05-07T19:44:34.9410073Z 2025-05-07T19:44:34.9410294Z  2025-05-07T19:44:34.9410533Z 2025-05-07T19:44:34.9410536Z 2025-05-07T19:44:34.9410540Z 2025-05-07T19:44:34.9410543Z 2025-05-07T19:44:34.9410547Z 2025-05-07T19:44:34.9410550Z 2025-05-07T19:44:34.9410554Z 2025-05-07T19:44:34.9410557Z 2025-05-07T19:44:34.9410561Z 2025-05-07T19:44:34.9410564Z 2025-05-07T19:44:34.9410770Z  2025-05-07T19:44:34.9411011Z 2025-05-07T19:44:34.9411015Z 2025-05-07T19:44:34.9411019Z 2025-05-07T19:44:34.9411022Z 2025-05-07T19:44:34.9411159Z 2025-05-07T19:44:34.9411163Z 2025-05-07T19:44:34.9411166Z 2025-05-07T19:44:34.9411170Z 2025-05-07T19:44:34.9411173Z 2025-05-07T19:44:34.9411177Z 2025-05-07T19:44:34.9411180Z 2025-05-07T19:44:34.9411416Z  done 2025-05-07T19:44:35.0418760Z Preparing transaction: \ done 2025-05-07T19:44:35.8442515Z Verifying transaction: / - \ | / - \ | done 2025-05-07T19:44:35.9462501Z Executing transaction: - done 2025-05-07T19:44:36.0346821Z [INSTALL] Setting the C/C++ compiler symlinks ... 2025-05-07T19:44:39.7104407Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-cc /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:44:39.7105024Z 2025-05-07T19:44:39.7119818Z 2025-05-07T19:44:39.7140665Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-cc /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:44:39.7141353Z 2025-05-07T19:44:39.7156988Z 2025-05-07T19:44:39.7181554Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:44:39.7182226Z 2025-05-07T19:44:39.7199299Z 2025-05-07T19:44:39.7221352Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:44:39.7222043Z 2025-05-07T19:44:39.7237078Z 2025-05-07T19:44:41.5066118Z /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:44:41.5066961Z 2025-05-07T19:44:41.5810999Z [CHECK] Binary cc found in PATH 2025-05-07T19:44:43.3689387Z /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:44:43.3690276Z 2025-05-07T19:44:43.4430861Z [CHECK] Binary gcc found in PATH 2025-05-07T19:44:45.2354138Z /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:44:45.2354525Z 2025-05-07T19:44:45.3133543Z [CHECK] Binary c++ found in PATH 2025-05-07T19:44:47.0889785Z /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:44:47.0890640Z 2025-05-07T19:44:47.1456408Z [CHECK] Binary g++ found in PATH 2025-05-07T19:44:47.1457688Z [INFO] Printing out all preprocessor defines in the C compiler ... 2025-05-07T19:44:47.1458698Z + conda run -n build_binary cc -dM -E - 2025-05-07T19:44:47.1458932Z 2025-05-07T19:44:48.9267178Z #define __DBL_MIN_EXP__ (-1021) 2025-05-07T19:44:48.9268212Z #define __UINT_LEAST16_MAX__ 0xffff 2025-05-07T19:44:48.9269897Z #define __ATOMIC_ACQUIRE 2 2025-05-07T19:44:48.9270670Z #define __FLT128_MAX_10_EXP__ 4932 2025-05-07T19:44:48.9271684Z #define __FLT_MIN__ 1.17549435082228750796873653722224568e-38F 2025-05-07T19:44:48.9272913Z #define __GCC_IEC_559_COMPLEX 2 2025-05-07T19:44:48.9273289Z #define __UINT_LEAST8_TYPE__ unsigned char 2025-05-07T19:44:48.9273642Z #define __SIZEOF_FLOAT80__ 16 2025-05-07T19:44:48.9273946Z #define __INTMAX_C(c) c ## L 2025-05-07T19:44:48.9274220Z #define __CHAR_BIT__ 8 2025-05-07T19:44:48.9274490Z #define __UINT8_MAX__ 0xff 2025-05-07T19:44:48.9274795Z #define __SCHAR_WIDTH__ 8 2025-05-07T19:44:48.9275062Z #define __WINT_MAX__ 0xffffffffU 2025-05-07T19:44:48.9275376Z #define __FLT32_MIN_EXP__ (-125) 2025-05-07T19:44:48.9275667Z #define __ORDER_LITTLE_ENDIAN__ 1234 2025-05-07T19:44:48.9276129Z #define __SIZE_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:48.9276447Z #define __WCHAR_MAX__ 0x7fffffff 2025-05-07T19:44:48.9276767Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1 1 2025-05-07T19:44:48.9277117Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2 1 2025-05-07T19:44:48.9277472Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4 1 2025-05-07T19:44:48.9277915Z #define __DBL_DENORM_MIN__ ((double)4.94065645841246544176568792868221372e-324L) 2025-05-07T19:44:48.9278358Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8 1 2025-05-07T19:44:48.9278705Z #define __GCC_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:44:48.9278997Z #define __GCC_IEC_559 2 2025-05-07T19:44:48.9279272Z #define __FLT32X_DECIMAL_DIG__ 17 2025-05-07T19:44:48.9279560Z #define __FLT_EVAL_METHOD__ 0 2025-05-07T19:44:48.9280082Z #define __FLT64_DECIMAL_DIG__ 17 2025-05-07T19:44:48.9280385Z #define __GCC_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:44:48.9280763Z #define __UINT_FAST64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:48.9281104Z #define __SIG_ATOMIC_TYPE__ int 2025-05-07T19:44:48.9281414Z #define __DBL_MIN_10_EXP__ (-307) 2025-05-07T19:44:48.9281722Z #define __FINITE_MATH_ONLY__ 0 2025-05-07T19:44:48.9282109Z #define __FLT32X_MAX_EXP__ 1024 2025-05-07T19:44:48.9282400Z #define __FLT32_HAS_DENORM__ 1 2025-05-07T19:44:48.9282661Z #define __UINT_FAST8_MAX__ 0xff 2025-05-07T19:44:48.9282943Z #define __FLT32_MAX_10_EXP__ 38 2025-05-07T19:44:48.9283203Z #define __DEC64_MAX_EXP__ 385 2025-05-07T19:44:48.9283479Z #define __INT8_C(c) c 2025-05-07T19:44:48.9283721Z #define __INT_LEAST8_WIDTH__ 8 2025-05-07T19:44:48.9284033Z #define __UINT_LEAST64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:48.9284596Z #define __SHRT_MAX__ 0x7fff 2025-05-07T19:44:48.9285133Z #define __LDBL_MAX__ 1.18973149535723176502126385303097021e+4932L 2025-05-07T19:44:48.9285544Z #define __FLT64X_MAX_10_EXP__ 4932 2025-05-07T19:44:48.9285843Z #define __LDBL_IS_IEC_60559__ 2 2025-05-07T19:44:48.9286151Z #define __FLT64X_HAS_QUIET_NAN__ 1 2025-05-07T19:44:48.9286451Z #define __UINT_LEAST8_MAX__ 0xff 2025-05-07T19:44:48.9286768Z #define __GCC_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:44:48.9287185Z #define __FLT128_DENORM_MIN__ 6.47517511943802511092443895822764655e-4966F128 2025-05-07T19:44:48.9287655Z #define __UINTMAX_TYPE__ long unsigned int 2025-05-07T19:44:48.9287967Z #define __linux 1 2025-05-07T19:44:48.9288222Z #define __DEC32_EPSILON__ 1E-6DF 2025-05-07T19:44:48.9288539Z #define __FLT_EVAL_METHOD_TS_18661_3__ 0 2025-05-07T19:44:48.9288840Z #define __unix 1 2025-05-07T19:44:48.9289096Z #define __UINT32_MAX__ 0xffffffffU 2025-05-07T19:44:48.9289394Z #define __FLT128_MIN_EXP__ (-16381) 2025-05-07T19:44:48.9289702Z #define __WINT_MIN__ 0U 2025-05-07T19:44:48.9289962Z #define __FLT128_MIN_10_EXP__ (-4931) 2025-05-07T19:44:48.9290453Z #define __FLT32X_IS_IEC_60559__ 2 2025-05-07T19:44:48.9290739Z #define __INT_LEAST16_WIDTH__ 16 2025-05-07T19:44:48.9291153Z #define __SCHAR_MAX__ 0x7f 2025-05-07T19:44:48.9291409Z #define __FLT128_MANT_DIG__ 113 2025-05-07T19:44:48.9291716Z #define __WCHAR_MIN__ (-__WCHAR_MAX__ - 1) 2025-05-07T19:44:48.9292036Z #define __INT64_C(c) c ## L 2025-05-07T19:44:48.9292304Z #define __GCC_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:44:48.9292620Z #define __FLT32X_MANT_DIG__ 53 2025-05-07T19:44:48.9293005Z #define __USER_LABEL_PREFIX__ 2025-05-07T19:44:48.9293378Z #define __FLT64X_EPSILON__ 1.08420217248550443400745280086994171e-19F64x 2025-05-07T19:44:48.9293777Z #define __STDC_HOSTED__ 1 2025-05-07T19:44:48.9294051Z #define __DEC64_MIN_EXP__ (-382) 2025-05-07T19:44:48.9294334Z #define __DBL_DIG__ 15 2025-05-07T19:44:48.9294565Z #define __FLT32_DIG__ 6 2025-05-07T19:44:48.9294885Z #define __FLT_EPSILON__ 1.19209289550781250000000000000000000e-7F 2025-05-07T19:44:48.9295239Z #define __SHRT_WIDTH__ 16 2025-05-07T19:44:48.9295505Z #define __FLT32_IS_IEC_60559__ 2 2025-05-07T19:44:48.9295830Z #define __LDBL_MIN__ 3.36210314311209350626267781732175260e-4932L 2025-05-07T19:44:48.9296188Z #define __STDC_UTF_16__ 1 2025-05-07T19:44:48.9296431Z #define __DBL_IS_IEC_60559__ 2 2025-05-07T19:44:48.9296709Z #define __DEC32_MAX__ 9.999999E96DF 2025-05-07T19:44:48.9297104Z #define __FLT64X_DENORM_MIN__ 3.64519953188247460252840593361941982e-4951F64x 2025-05-07T19:44:48.9297510Z #define __FLT32X_HAS_INFINITY__ 1 2025-05-07T19:44:48.9297799Z #define __INT32_MAX__ 0x7fffffff 2025-05-07T19:44:48.9298056Z #define __unix__ 1 2025-05-07T19:44:48.9298293Z #define __INT_WIDTH__ 32 2025-05-07T19:44:48.9298534Z #define __SIZEOF_LONG__ 8 2025-05-07T19:44:48.9298793Z #define __STDC_IEC_559__ 1 2025-05-07T19:44:48.9299041Z #define __STDC_ISO_10646__ 201103L 2025-05-07T19:44:48.9299323Z #define __UINT16_C(c) c 2025-05-07T19:44:48.9299558Z #define __DECIMAL_DIG__ 21 2025-05-07T19:44:48.9299826Z #define __STDC_IEC_559_COMPLEX__ 1 2025-05-07T19:44:48.9300292Z #define __FLT64_EPSILON__ 2.22044604925031308084726333618164062e-16F64 2025-05-07T19:44:48.9300665Z #define __gnu_linux__ 1 2025-05-07T19:44:48.9300924Z #define __FLT128_IS_IEC_60559__ 2 2025-05-07T19:44:48.9301195Z #define __FLT64X_MIN_10_EXP__ (-4931) 2025-05-07T19:44:48.9301628Z #define __LDBL_HAS_QUIET_NAN__ 1 2025-05-07T19:44:48.9301895Z #define __FLT64_MANT_DIG__ 53 2025-05-07T19:44:48.9302172Z #define __FLT64X_MANT_DIG__ 64 2025-05-07T19:44:48.9302431Z #define __GNUC__ 11 2025-05-07T19:44:48.9302668Z #define __pie__ 2 2025-05-07T19:44:48.9302877Z #define __MMX__ 1 2025-05-07T19:44:48.9303119Z #define __FLT_HAS_DENORM__ 1 2025-05-07T19:44:48.9303397Z #define __SIZEOF_LONG_DOUBLE__ 16 2025-05-07T19:44:48.9303674Z #define __BIGGEST_ALIGNMENT__ 16 2025-05-07T19:44:48.9303957Z #define __FLT64_MAX_10_EXP__ 308 2025-05-07T19:44:48.9304297Z #define __DBL_MAX__ ((double)1.79769313486231570814527423731704357e+308L) 2025-05-07T19:44:48.9304711Z #define __INT_FAST32_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:48.9305025Z #define __DBL_HAS_INFINITY__ 1 2025-05-07T19:44:48.9305297Z #define __SIZEOF_FLOAT__ 4 2025-05-07T19:44:48.9305557Z #define __HAVE_SPECULATION_SAFE_VALUE 1 2025-05-07T19:44:48.9305864Z #define __DEC32_MIN_EXP__ (-94) 2025-05-07T19:44:48.9306144Z #define __INTPTR_WIDTH__ 64 2025-05-07T19:44:48.9306397Z #define __FLT64X_HAS_INFINITY__ 1 2025-05-07T19:44:48.9306697Z #define __UINT_LEAST32_MAX__ 0xffffffffU 2025-05-07T19:44:48.9306991Z #define __FLT32X_HAS_DENORM__ 1 2025-05-07T19:44:48.9307274Z #define __INT_FAST16_TYPE__ long int 2025-05-07T19:44:48.9307550Z #define __MMX_WITH_SSE__ 1 2025-05-07T19:44:48.9307814Z #define __LDBL_HAS_DENORM__ 1 2025-05-07T19:44:48.9308075Z #define __FLT128_HAS_INFINITY__ 1 2025-05-07T19:44:48.9308364Z #define __DEC32_MIN__ 1E-95DF 2025-05-07T19:44:48.9308625Z #define __DBL_MAX_EXP__ 1024 2025-05-07T19:44:48.9308897Z #define __WCHAR_WIDTH__ 32 2025-05-07T19:44:48.9309226Z #define __FLT32_MAX__ 3.40282346638528859811704183484516925e+38F32 2025-05-07T19:44:48.9309723Z #define __DEC128_EPSILON__ 1E-33DL 2025-05-07T19:44:48.9310208Z #define __SSE2_MATH__ 1 2025-05-07T19:44:48.9310469Z #define __ATOMIC_HLE_RELEASE 131072 2025-05-07T19:44:48.9310811Z #define __PTRDIFF_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:48.9311124Z #define __amd64 1 2025-05-07T19:44:48.9311381Z #define __STDC_NO_THREADS__ 1 2025-05-07T19:44:48.9311665Z #define __ATOMIC_HLE_ACQUIRE 65536 2025-05-07T19:44:48.9312005Z #define __LONG_LONG_MAX__ 0x7fffffffffffffffLL 2025-05-07T19:44:48.9312440Z #define __SIZEOF_SIZE_T__ 8 2025-05-07T19:44:48.9312742Z #define __FLT64X_MIN_EXP__ (-16381) 2025-05-07T19:44:48.9313060Z #define __SIZEOF_WINT_T__ 4 2025-05-07T19:44:48.9313335Z #define __LONG_LONG_WIDTH__ 64 2025-05-07T19:44:48.9313637Z #define __FLT32_MAX_EXP__ 128 2025-05-07T19:44:48.9313917Z #define __GXX_ABI_VERSION 1016 2025-05-07T19:44:48.9314221Z #define __FLT_MIN_EXP__ (-125) 2025-05-07T19:44:48.9314505Z #define __GCC_HAVE_DWARF2_CFI_ASM 1 2025-05-07T19:44:48.9314831Z #define __INT16_MAX__ 0x7fff 2025-05-07T19:44:48.9315099Z #define __x86_64 1 2025-05-07T19:44:48.9315363Z #define __INT_FAST64_TYPE__ long int 2025-05-07T19:44:48.9315760Z #define __FLT64_DENORM_MIN__ 4.94065645841246544176568792868221372e-324F64 2025-05-07T19:44:48.9316378Z #define __DBL_MIN__ ((double)2.22507385850720138309023271733240406e-308L) 2025-05-07T19:44:48.9316855Z #define __FLT128_EPSILON__ 1.92592994438723585305597794258492732e-34F128 2025-05-07T19:44:48.9317329Z #define __FLT64X_NORM_MAX__ 1.18973149535723176502126385303097021e+4932F64x 2025-05-07T19:44:48.9317744Z #define __SIZEOF_POINTER__ 8 2025-05-07T19:44:48.9317998Z #define __LP64__ 1 2025-05-07T19:44:48.9318248Z #define __DBL_HAS_QUIET_NAN__ 1 2025-05-07T19:44:48.9318599Z #define __FLT32X_EPSILON__ 2.22044604925031308084726333618164062e-16F32x 2025-05-07T19:44:48.9319005Z #define __DECIMAL_BID_FORMAT__ 1 2025-05-07T19:44:48.9319292Z #define __FLT64_MIN_EXP__ (-1021) 2025-05-07T19:44:48.9319566Z #define __FLT64_MIN_10_EXP__ (-307) 2025-05-07T19:44:48.9319861Z #define __FLT64X_DECIMAL_DIG__ 21 2025-05-07T19:44:48.9320209Z #define __DEC128_MIN__ 1E-6143DL 2025-05-07T19:44:48.9320494Z #define __REGISTER_PREFIX__ 2025-05-07T19:44:48.9320750Z #define __UINT16_MAX__ 0xffff 2025-05-07T19:44:48.9321026Z #define __DBL_HAS_DENORM__ 1 2025-05-07T19:44:48.9321280Z #define __LDBL_HAS_INFINITY__ 1 2025-05-07T19:44:48.9321621Z #define __FLT32_MIN__ 1.17549435082228750796873653722224568e-38F32 2025-05-07T19:44:48.9321978Z #define __UINT8_TYPE__ unsigned char 2025-05-07T19:44:48.9322270Z #define __FLT_DIG__ 6 2025-05-07T19:44:48.9322516Z #define __NO_INLINE__ 1 2025-05-07T19:44:48.9322751Z #define __DEC_EVAL_METHOD__ 2 2025-05-07T19:44:48.9323090Z #define __DEC128_MAX__ 9.999999999999999999999999999999999E6144DL 2025-05-07T19:44:48.9323438Z #define __FLT_MANT_DIG__ 24 2025-05-07T19:44:48.9323708Z #define __LDBL_DECIMAL_DIG__ 21 2025-05-07T19:44:48.9323967Z #define __VERSION__ "11.4.0" 2025-05-07T19:44:48.9324234Z #define __UINT64_C(c) c ## UL 2025-05-07T19:44:48.9324488Z #define _STDC_PREDEF_H 1 2025-05-07T19:44:48.9324762Z #define __INT_LEAST32_MAX__ 0x7fffffff 2025-05-07T19:44:48.9325062Z #define __GCC_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:44:48.9325365Z #define __FLT128_MAX_EXP__ 16384 2025-05-07T19:44:48.9325646Z #define __FLT32_MANT_DIG__ 24 2025-05-07T19:44:48.9325942Z #define __FLOAT_WORD_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:44:48.9326285Z #define __FLT128_HAS_DENORM__ 1 2025-05-07T19:44:48.9326550Z #define __FLT32_DECIMAL_DIG__ 9 2025-05-07T19:44:48.9326822Z #define __FLT128_DIG__ 33 2025-05-07T19:44:48.9327056Z #define __INT32_C(c) c 2025-05-07T19:44:48.9327310Z #define __DEC64_EPSILON__ 1E-15DD 2025-05-07T19:44:48.9327584Z #define __ORDER_PDP_ENDIAN__ 3412 2025-05-07T19:44:48.9327875Z #define __DEC128_MIN_EXP__ (-6142) 2025-05-07T19:44:48.9328154Z #define __INT_FAST32_TYPE__ long int 2025-05-07T19:44:48.9328483Z #define __UINT_LEAST16_TYPE__ short unsigned int 2025-05-07T19:44:48.9328807Z #define unix 1 2025-05-07T19:44:48.9329034Z #define __SIZE_TYPE__ long unsigned int 2025-05-07T19:44:48.9329362Z #define __UINT64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:48.9329661Z #define __FLT_IS_IEC_60559__ 2 2025-05-07T19:44:48.9329982Z #define __GNUC_WIDE_EXECUTION_CHARSET_NAME "UTF-32LE" 2025-05-07T19:44:48.9330312Z #define __FLT64X_DIG__ 18 2025-05-07T19:44:48.9330576Z #define __INT8_TYPE__ signed char 2025-05-07T19:44:48.9330835Z #define __ELF__ 1 2025-05-07T19:44:48.9331078Z #define __GCC_ASM_FLAG_OUTPUTS__ 1 2025-05-07T19:44:48.9331437Z #define __UINT32_TYPE__ unsigned int 2025-05-07T19:44:48.9331730Z #define __FLT_RADIX__ 2 2025-05-07T19:44:48.9332000Z #define __INT_LEAST16_TYPE__ short int 2025-05-07T19:44:48.9332357Z #define __LDBL_EPSILON__ 1.08420217248550443400745280086994171e-19L 2025-05-07T19:44:48.9332743Z #define __UINTMAX_C(c) c ## UL 2025-05-07T19:44:48.9333002Z #define __SSE_MATH__ 1 2025-05-07T19:44:48.9333247Z #define __k8 1 2025-05-07T19:44:48.9333542Z #define __FLT32X_MIN__ 2.22507385850720138309023271733240406e-308F32x 2025-05-07T19:44:48.9333936Z #define __SIG_ATOMIC_MAX__ 0x7fffffff 2025-05-07T19:44:48.9334236Z #define __GCC_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:44:48.9334550Z #define __SIZEOF_PTRDIFF_T__ 8 2025-05-07T19:44:48.9334823Z #define __LDBL_DIG__ 18 2025-05-07T19:44:48.9335059Z #define __FLT64_IS_IEC_60559__ 2 2025-05-07T19:44:48.9335331Z #define __x86_64__ 1 2025-05-07T19:44:48.9335565Z #define __FLT32X_MIN_EXP__ (-1021) 2025-05-07T19:44:48.9335871Z #define __DEC32_SUBNORMAL_MIN__ 0.000001E-95DF 2025-05-07T19:44:48.9336205Z #define __INT_FAST16_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:48.9336525Z #define __FLT64_DIG__ 15 2025-05-07T19:44:48.9336798Z #define __UINT_FAST32_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:48.9337162Z #define __UINT_LEAST64_TYPE__ long unsigned int 2025-05-07T19:44:48.9337476Z #define __FLT_HAS_QUIET_NAN__ 1 2025-05-07T19:44:48.9337754Z #define __FLT_MAX_10_EXP__ 38 2025-05-07T19:44:48.9338042Z #define __LONG_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:48.9338334Z #define __FLT64X_HAS_DENORM__ 1 2025-05-07T19:44:48.9338814Z #define __DEC128_SUBNORMAL_MIN__ 0.000000000000000000000000000000001E-6143DL 2025-05-07T19:44:48.9339215Z #define __FLT_HAS_INFINITY__ 1 2025-05-07T19:44:48.9339521Z #define __GNUC_EXECUTION_CHARSET_NAME "UTF-8" 2025-05-07T19:44:48.9339858Z #define __UINT_FAST16_TYPE__ long unsigned int 2025-05-07T19:44:48.9340198Z #define __DEC64_MAX__ 9.999999999999999E384DD 2025-05-07T19:44:48.9340498Z #define __INT_FAST32_WIDTH__ 64 2025-05-07T19:44:48.9340797Z #define __CHAR16_TYPE__ short unsigned int 2025-05-07T19:44:48.9341128Z #define __PRAGMA_REDEFINE_EXTNAME 1 2025-05-07T19:44:48.9341404Z #define __SIZE_WIDTH__ 64 2025-05-07T19:44:48.9341658Z #define __SEG_FS 1 2025-05-07T19:44:48.9341883Z #define __INT_LEAST16_MAX__ 0x7fff 2025-05-07T19:44:48.9342171Z #define __DEC64_MANT_DIG__ 16 2025-05-07T19:44:48.9342443Z #define __INT64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:48.9342741Z #define __SEG_GS 1 2025-05-07T19:44:48.9343048Z #define __FLT32_DENORM_MIN__ 1.40129846432481707092372958328991613e-45F32 2025-05-07T19:44:48.9343454Z #define __SIG_ATOMIC_WIDTH__ 32 2025-05-07T19:44:48.9343726Z #define __INT_LEAST64_TYPE__ long int 2025-05-07T19:44:48.9344030Z #define __INT16_TYPE__ short int 2025-05-07T19:44:48.9344316Z #define __INT_LEAST8_TYPE__ signed char 2025-05-07T19:44:48.9344634Z #define __STDC_VERSION__ 201710L 2025-05-07T19:44:48.9344896Z #define __SIZEOF_INT__ 4 2025-05-07T19:44:48.9345153Z #define __DEC32_MAX_EXP__ 97 2025-05-07T19:44:48.9345409Z #define __INT_FAST8_MAX__ 0x7f 2025-05-07T19:44:48.9345767Z #define __FLT128_MAX__ 1.18973149535723176508575932662800702e+4932F128 2025-05-07T19:44:48.9346157Z #define __INTPTR_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:48.9346460Z #define linux 1 2025-05-07T19:44:48.9346701Z #define __FLT64_HAS_QUIET_NAN__ 1 2025-05-07T19:44:48.9346977Z #define __FLT32_MIN_10_EXP__ (-37) 2025-05-07T19:44:48.9347280Z #define __FLT32X_DIG__ 15 2025-05-07T19:44:48.9347535Z #define __PTRDIFF_WIDTH__ 64 2025-05-07T19:44:48.9347807Z #define __LDBL_MANT_DIG__ 64 2025-05-07T19:44:48.9348068Z #define __FLT64_HAS_INFINITY__ 1 2025-05-07T19:44:48.9348430Z #define __FLT64X_MAX__ 1.18973149535723176502126385303097021e+4932F64x 2025-05-07T19:44:48.9348841Z #define __SIG_ATOMIC_MIN__ (-__SIG_ATOMIC_MAX__ - 1) 2025-05-07T19:44:48.9349186Z #define __code_model_small__ 1 2025-05-07T19:44:48.9349562Z #define __GCC_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:44:48.9350057Z #define __DEC32_MANT_DIG__ 7 2025-05-07T19:44:48.9350344Z #define __k8__ 1 2025-05-07T19:44:48.9350704Z #define __INTPTR_TYPE__ long int 2025-05-07T19:44:48.9351035Z #define __UINT16_TYPE__ short unsigned int 2025-05-07T19:44:48.9351356Z #define __WCHAR_TYPE__ int 2025-05-07T19:44:48.9351641Z #define __pic__ 2 2025-05-07T19:44:48.9351907Z #define __UINTPTR_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:48.9352260Z #define __INT_FAST64_WIDTH__ 64 2025-05-07T19:44:48.9352575Z #define __INT_FAST64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:48.9352947Z #define __GCC_ATOMIC_TEST_AND_SET_TRUEVAL 1 2025-05-07T19:44:48.9353354Z #define __FLT_NORM_MAX__ 3.40282346638528859811704183484516925e+38F 2025-05-07T19:44:48.9353746Z #define __FLT32_HAS_INFINITY__ 1 2025-05-07T19:44:48.9354051Z #define __FLT64X_MAX_EXP__ 16384 2025-05-07T19:44:48.9354359Z #define __UINT_FAST64_TYPE__ long unsigned int 2025-05-07T19:44:48.9354708Z #define __INT_MAX__ 0x7fffffff 2025-05-07T19:44:48.9354973Z #define __linux__ 1 2025-05-07T19:44:48.9355227Z #define __INT64_TYPE__ long int 2025-05-07T19:44:48.9355507Z #define __FLT_MAX_EXP__ 128 2025-05-07T19:44:48.9355804Z #define __ORDER_BIG_ENDIAN__ 4321 2025-05-07T19:44:48.9356210Z #define __DBL_MANT_DIG__ 53 2025-05-07T19:44:48.9356478Z #define __SIZEOF_FLOAT128__ 16 2025-05-07T19:44:48.9356784Z #define __INT_LEAST64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:48.9357111Z #define __GCC_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:44:48.9357423Z #define __DEC64_MIN__ 1E-383DD 2025-05-07T19:44:48.9357688Z #define __WINT_TYPE__ unsigned int 2025-05-07T19:44:48.9357990Z #define __UINT_LEAST32_TYPE__ unsigned int 2025-05-07T19:44:48.9358284Z #define __SIZEOF_SHORT__ 2 2025-05-07T19:44:48.9358697Z #define __FLT32_NORM_MAX__ 3.40282346638528859811704183484516925e+38F32 2025-05-07T19:44:48.9359055Z #define __SSE__ 1 2025-05-07T19:44:48.9359297Z #define __LDBL_MIN_EXP__ (-16381) 2025-05-07T19:44:48.9359629Z #define __FLT64_MAX__ 1.79769313486231570814527423731704357e+308F64 2025-05-07T19:44:48.9359989Z #define __amd64__ 1 2025-05-07T19:44:48.9360228Z #define __WINT_WIDTH__ 32 2025-05-07T19:44:48.9360478Z #define __INT_LEAST8_MAX__ 0x7f 2025-05-07T19:44:48.9360760Z #define __INT_LEAST64_WIDTH__ 64 2025-05-07T19:44:48.9361027Z #define __LDBL_MAX_EXP__ 16384 2025-05-07T19:44:48.9361316Z #define __FLT32X_MAX_10_EXP__ 308 2025-05-07T19:44:48.9361605Z #define __SIZEOF_INT128__ 16 2025-05-07T19:44:48.9361866Z #define __FLT64X_IS_IEC_60559__ 2 2025-05-07T19:44:48.9362156Z #define __LDBL_MAX_10_EXP__ 4932 2025-05-07T19:44:48.9362423Z #define __ATOMIC_RELAXED 0 2025-05-07T19:44:48.9362790Z #define __DBL_EPSILON__ ((double)2.22044604925031308084726333618164062e-16L) 2025-05-07T19:44:48.9363267Z #define __FLT128_MIN__ 3.36210314311209350626267781732175260e-4932F128 2025-05-07T19:44:48.9363651Z #define _LP64 1 2025-05-07T19:44:48.9363887Z #define __UINT8_C(c) c 2025-05-07T19:44:48.9364128Z #define __FLT64_MAX_EXP__ 1024 2025-05-07T19:44:48.9364418Z #define __INT_LEAST32_TYPE__ int 2025-05-07T19:44:48.9364691Z #define __SIZEOF_WCHAR_T__ 4 2025-05-07T19:44:48.9364996Z #define __UINT64_TYPE__ long unsigned int 2025-05-07T19:44:48.9365306Z #define __GNUC_PATCHLEVEL__ 0 2025-05-07T19:44:48.9365690Z #define __FLT128_NORM_MAX__ 1.18973149535723176508575932662800702e+4932F128 2025-05-07T19:44:48.9366166Z #define __FLT64_NORM_MAX__ 1.79769313486231570814527423731704357e+308F64 2025-05-07T19:44:48.9366558Z #define __FLT128_HAS_QUIET_NAN__ 1 2025-05-07T19:44:48.9366878Z #define __INTMAX_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:48.9367187Z #define __INT_FAST8_TYPE__ signed char 2025-05-07T19:44:48.9367567Z #define __FLT64X_MIN__ 3.36210314311209350626267781732175260e-4932F64x 2025-05-07T19:44:48.9367941Z #define __GNUC_STDC_INLINE__ 1 2025-05-07T19:44:48.9368221Z #define __FLT64_HAS_DENORM__ 1 2025-05-07T19:44:48.9368564Z #define __FLT32_EPSILON__ 1.19209289550781250000000000000000000e-7F32 2025-05-07T19:44:48.9368951Z #define __DBL_DECIMAL_DIG__ 17 2025-05-07T19:44:48.9369204Z #define __STDC_UTF_32__ 1 2025-05-07T19:44:48.9369465Z #define __INT_FAST8_WIDTH__ 8 2025-05-07T19:44:48.9369713Z #define __FXSR__ 1 2025-05-07T19:44:48.9370101Z #define __FLT32X_MAX__ 1.79769313486231570814527423731704357e+308F32x 2025-05-07T19:44:48.9370578Z #define __DBL_NORM_MAX__ ((double)1.79769313486231570814527423731704357e+308L) 2025-05-07T19:44:48.9370990Z #define __BYTE_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:44:48.9371315Z #define __INTMAX_WIDTH__ 64 2025-05-07T19:44:48.9371566Z #define __UINT32_C(c) c ## U 2025-05-07T19:44:48.9371911Z #define __FLT_DENORM_MIN__ 1.40129846432481707092372958328991613e-45F 2025-05-07T19:44:48.9372269Z #define __INT8_MAX__ 0x7f 2025-05-07T19:44:48.9372527Z #define __LONG_WIDTH__ 64 2025-05-07T19:44:48.9372761Z #define __PIC__ 2 2025-05-07T19:44:48.9373025Z #define __UINT_FAST32_TYPE__ long unsigned int 2025-05-07T19:44:48.9373445Z #define __FLT32X_NORM_MAX__ 1.79769313486231570814527423731704357e+308F32x 2025-05-07T19:44:48.9373835Z #define __CHAR32_TYPE__ unsigned int 2025-05-07T19:44:48.9374182Z #define __FLT_MAX__ 3.40282346638528859811704183484516925e+38F 2025-05-07T19:44:48.9374517Z #define __SSE2__ 1 2025-05-07T19:44:48.9374766Z #define __INT32_TYPE__ int 2025-05-07T19:44:48.9375014Z #define __SIZEOF_DOUBLE__ 8 2025-05-07T19:44:48.9375282Z #define __FLT_MIN_10_EXP__ (-37) 2025-05-07T19:44:48.9375610Z #define __FLT64_MIN__ 2.22507385850720138309023271733240406e-308F64 2025-05-07T19:44:48.9375982Z #define __INT_LEAST32_WIDTH__ 32 2025-05-07T19:44:48.9376266Z #define __INTMAX_TYPE__ long int 2025-05-07T19:44:48.9376532Z #define __DEC128_MAX_EXP__ 6145 2025-05-07T19:44:48.9376814Z #define __FLT32X_HAS_QUIET_NAN__ 1 2025-05-07T19:44:48.9377086Z #define __ATOMIC_CONSUME 1 2025-05-07T19:44:48.9377419Z #define __GNUC_MINOR__ 4 2025-05-07T19:44:48.9377666Z #define __INT_FAST16_WIDTH__ 64 2025-05-07T19:44:48.9377966Z #define __UINTMAX_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:48.9378261Z #define __PIE__ 2 2025-05-07T19:44:48.9378612Z #define __FLT32X_DENORM_MIN__ 4.94065645841246544176568792868221372e-324F32x 2025-05-07T19:44:48.9379011Z #define __DBL_MAX_10_EXP__ 308 2025-05-07T19:44:48.9379373Z #define __LDBL_DENORM_MIN__ 3.64519953188247460252840593361941982e-4951L 2025-05-07T19:44:48.9379763Z #define __INT16_C(c) c 2025-05-07T19:44:48.9379987Z #define __STDC__ 1 2025-05-07T19:44:48.9380233Z #define __PTRDIFF_TYPE__ long int 2025-05-07T19:44:48.9380502Z #define __ATOMIC_SEQ_CST 5 2025-05-07T19:44:48.9380769Z #define __FLT32X_MIN_10_EXP__ (-307) 2025-05-07T19:44:48.9381062Z #define __UINTPTR_TYPE__ long unsigned int 2025-05-07T19:44:48.9381423Z #define __DEC64_SUBNORMAL_MIN__ 0.000000000000001E-383DD 2025-05-07T19:44:48.9381756Z #define __DEC128_MANT_DIG__ 34 2025-05-07T19:44:48.9382038Z #define __LDBL_MIN_10_EXP__ (-4931) 2025-05-07T19:44:48.9382332Z #define __SIZEOF_LONG_LONG__ 8 2025-05-07T19:44:48.9382594Z #define __FLT128_DECIMAL_DIG__ 36 2025-05-07T19:44:48.9382887Z #define __GCC_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:44:48.9383174Z #define __FLT32_HAS_QUIET_NAN__ 1 2025-05-07T19:44:48.9383468Z #define __FLT_DECIMAL_DIG__ 9 2025-05-07T19:44:48.9383760Z #define __UINT_FAST16_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:48.9384176Z #define __LDBL_NORM_MAX__ 1.18973149535723176502126385303097021e+4932L 2025-05-07T19:44:48.9384924Z #define __GCC_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:44:48.9385448Z #define __UINT_FAST8_TYPE__ unsigned char 2025-05-07T19:44:48.9385763Z #define __ATOMIC_ACQ_REL 4 2025-05-07T19:44:48.9386042Z #define __ATOMIC_RELEASE 3 2025-05-07T19:44:48.9386212Z 2025-05-07T19:44:48.9847260Z 2025-05-07T19:44:48.9848239Z [INFO] Printing out all preprocessor defines in the C++ compiler ... 2025-05-07T19:44:48.9849617Z + conda run -n build_binary c++ -dM -E -x c++ - 2025-05-07T19:44:48.9850349Z 2025-05-07T19:44:50.7817704Z #define __DBL_MIN_EXP__ (-1021) 2025-05-07T19:44:50.7818660Z #define __cpp_attributes 200809L 2025-05-07T19:44:50.7819689Z #define __cpp_nontype_template_parameter_auto 201606L 2025-05-07T19:44:50.7820771Z #define __UINT_LEAST16_MAX__ 0xffff 2025-05-07T19:44:50.7821991Z #define __ATOMIC_ACQUIRE 2 2025-05-07T19:44:50.7822766Z #define __FLT128_MAX_10_EXP__ 4932 2025-05-07T19:44:50.7823748Z #define __FLT_MIN__ 1.17549435082228750796873653722224568e-38F 2025-05-07T19:44:50.7824803Z #define __GCC_IEC_559_COMPLEX 2 2025-05-07T19:44:50.7825075Z #define __cpp_aggregate_nsdmi 201304L 2025-05-07T19:44:50.7825395Z #define __UINT_LEAST8_TYPE__ unsigned char 2025-05-07T19:44:50.7825709Z #define __SIZEOF_FLOAT80__ 16 2025-05-07T19:44:50.7825967Z #define __INTMAX_C(c) c ## L 2025-05-07T19:44:50.7826229Z #define __CHAR_BIT__ 8 2025-05-07T19:44:50.7826458Z #define __UINT8_MAX__ 0xff 2025-05-07T19:44:50.7826715Z #define __SCHAR_WIDTH__ 8 2025-05-07T19:44:50.7826960Z #define __WINT_MAX__ 0xffffffffU 2025-05-07T19:44:50.7827251Z #define __FLT32_MIN_EXP__ (-125) 2025-05-07T19:44:50.7827522Z #define __cpp_static_assert 201411L 2025-05-07T19:44:50.7827819Z #define __ORDER_LITTLE_ENDIAN__ 1234 2025-05-07T19:44:50.7828115Z #define __SIZE_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:50.7828432Z #define __WCHAR_MAX__ 0x7fffffff 2025-05-07T19:44:50.7828733Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1 1 2025-05-07T19:44:50.7829063Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2 1 2025-05-07T19:44:50.7829513Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4 1 2025-05-07T19:44:50.7830115Z #define __DBL_DENORM_MIN__ double(4.94065645841246544176568792868221372e-324L) 2025-05-07T19:44:50.7830583Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8 1 2025-05-07T19:44:50.7830920Z #define __GCC_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:44:50.7831238Z #define __GCC_IEC_559 2 2025-05-07T19:44:50.7831490Z #define __FLT32X_DECIMAL_DIG__ 17 2025-05-07T19:44:50.7831798Z #define __FLT_EVAL_METHOD__ 0 2025-05-07T19:44:50.7832276Z #define __cpp_binary_literals 201304L 2025-05-07T19:44:50.7832587Z #define __FLT64_DECIMAL_DIG__ 17 2025-05-07T19:44:50.7832911Z #define __cpp_noexcept_function_type 201510L 2025-05-07T19:44:50.7833251Z #define __GCC_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:44:50.7833598Z #define __cpp_variadic_templates 200704L 2025-05-07T19:44:50.7833947Z #define __UINT_FAST64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:50.7834308Z #define __SIG_ATOMIC_TYPE__ int 2025-05-07T19:44:50.7834601Z #define __DBL_MIN_10_EXP__ (-307) 2025-05-07T19:44:50.7834907Z #define __FINITE_MATH_ONLY__ 0 2025-05-07T19:44:50.7835214Z #define __cpp_variable_templates 201304L 2025-05-07T19:44:50.7835528Z #define __FLT32X_MAX_EXP__ 1024 2025-05-07T19:44:50.7835818Z #define __FLT32_HAS_DENORM__ 1 2025-05-07T19:44:50.7836195Z #define __UINT_FAST8_MAX__ 0xff 2025-05-07T19:44:50.7836476Z #define __cpp_rvalue_reference 200610L 2025-05-07T19:44:50.7836799Z #define __cpp_nested_namespace_definitions 201411L 2025-05-07T19:44:50.7837136Z #define __DEC64_MAX_EXP__ 385 2025-05-07T19:44:50.7837387Z #define __INT8_C(c) c 2025-05-07T19:44:50.7837628Z #define __INT_LEAST8_WIDTH__ 8 2025-05-07T19:44:50.7837891Z #define __cpp_variadic_using 201611L 2025-05-07T19:44:50.7838217Z #define __UINT_LEAST64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:50.7838551Z #define __INT_LEAST8_MAX__ 0x7f 2025-05-07T19:44:50.7838816Z #define __cpp_capture_star_this 201603L 2025-05-07T19:44:50.7839116Z #define __SHRT_MAX__ 0x7fff 2025-05-07T19:44:50.7839423Z #define __LDBL_MAX__ 1.18973149535723176502126385303097021e+4932L 2025-05-07T19:44:50.7839787Z #define __FLT64X_MAX_10_EXP__ 4932 2025-05-07T19:44:50.7840061Z #define __cpp_if_constexpr 201606L 2025-05-07T19:44:50.7840347Z #define __LDBL_IS_IEC_60559__ 2 2025-05-07T19:44:50.7840605Z #define __FLT64X_HAS_QUIET_NAN__ 1 2025-05-07T19:44:50.7840895Z #define __UINT_LEAST8_MAX__ 0xff 2025-05-07T19:44:50.7841171Z #define __GCC_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:44:50.7841571Z #define __FLT128_DENORM_MIN__ 6.47517511943802511092443895822764655e-4966F128 2025-05-07T19:44:50.7842000Z #define __UINTMAX_TYPE__ long unsigned int 2025-05-07T19:44:50.7842288Z #define __linux 1 2025-05-07T19:44:50.7842525Z #define __DEC32_EPSILON__ 1E-6DF 2025-05-07T19:44:50.7842797Z #define __FLT_EVAL_METHOD_TS_18661_3__ 0 2025-05-07T19:44:50.7843083Z #define __unix 1 2025-05-07T19:44:50.7843300Z #define __UINT32_MAX__ 0xffffffffU 2025-05-07T19:44:50.7843586Z #define __GXX_EXPERIMENTAL_CXX0X__ 1 2025-05-07T19:44:50.7843952Z #define __FLT128_MIN_EXP__ (-16381) 2025-05-07T19:44:50.7844239Z #define __WINT_MIN__ 0U 2025-05-07T19:44:50.7844498Z #define __FLT128_MIN_10_EXP__ (-4931) 2025-05-07T19:44:50.7844780Z #define __FLT32X_IS_IEC_60559__ 2 2025-05-07T19:44:50.7845068Z #define __INT_LEAST16_WIDTH__ 16 2025-05-07T19:44:50.7845330Z #define __SCHAR_MAX__ 0x7f 2025-05-07T19:44:50.7845589Z #define __FLT128_MANT_DIG__ 113 2025-05-07T19:44:50.7845865Z #define __WCHAR_MIN__ (-__WCHAR_MAX__ - 1) 2025-05-07T19:44:50.7846171Z #define __INT64_C(c) c ## L 2025-05-07T19:44:50.7846434Z #define __GCC_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:44:50.7846742Z #define __FLT32X_MANT_DIG__ 53 2025-05-07T19:44:50.7847010Z #define __GCC_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:44:50.7847324Z #define __cpp_aligned_new 201606L 2025-05-07T19:44:50.7847611Z #define __USER_LABEL_PREFIX__ 2025-05-07T19:44:50.7847869Z #define __FLT32_MAX_10_EXP__ 38 2025-05-07T19:44:50.7848223Z #define __FLT64X_EPSILON__ 1.08420217248550443400745280086994171e-19F64x 2025-05-07T19:44:50.7848601Z #define __STDC_HOSTED__ 1 2025-05-07T19:44:50.7848862Z #define __DEC64_MIN_EXP__ (-382) 2025-05-07T19:44:50.7849132Z #define __cpp_decltype_auto 201304L 2025-05-07T19:44:50.7849419Z #define __DBL_DIG__ 15 2025-05-07T19:44:50.7849642Z #define __FLT32_DIG__ 6 2025-05-07T19:44:50.7849950Z #define __FLT_EPSILON__ 1.19209289550781250000000000000000000e-7F 2025-05-07T19:44:50.7850292Z #define __GXX_WEAK__ 1 2025-05-07T19:44:50.7850533Z #define __SHRT_WIDTH__ 16 2025-05-07T19:44:50.7850788Z #define __FLT32_IS_IEC_60559__ 2 2025-05-07T19:44:50.7851176Z #define __LDBL_MIN__ 3.36210314311209350626267781732175260e-4932L 2025-05-07T19:44:50.7851539Z #define __DBL_IS_IEC_60559__ 2 2025-05-07T19:44:50.7851791Z #define __DEC32_MAX__ 9.999999E96DF 2025-05-07T19:44:50.7852097Z #define __cpp_threadsafe_static_init 200806L 2025-05-07T19:44:50.7852417Z #define __cpp_enumerator_attributes 201411L 2025-05-07T19:44:50.7852833Z #define __FLT64X_DENORM_MIN__ 3.64519953188247460252840593361941982e-4951F64x 2025-05-07T19:44:50.7853232Z #define __FLT32X_HAS_INFINITY__ 1 2025-05-07T19:44:50.7853523Z #define __INT32_MAX__ 0x7fffffff 2025-05-07T19:44:50.7853788Z #define __unix__ 1 2025-05-07T19:44:50.7854005Z #define __INT_WIDTH__ 32 2025-05-07T19:44:50.7854253Z #define __SIZEOF_LONG__ 8 2025-05-07T19:44:50.7854490Z #define __STDC_IEC_559__ 1 2025-05-07T19:44:50.7854753Z #define __STDC_ISO_10646__ 201103L 2025-05-07T19:44:50.7855042Z #define __UINT16_C(c) c 2025-05-07T19:44:50.7855289Z #define __DECIMAL_DIG__ 21 2025-05-07T19:44:50.7855537Z #define __STDC_IEC_559_COMPLEX__ 1 2025-05-07T19:44:50.7855906Z #define __FLT64_EPSILON__ 2.22044604925031308084726333618164062e-16F64 2025-05-07T19:44:50.7856270Z #define __gnu_linux__ 1 2025-05-07T19:44:50.7856520Z #define __INT16_MAX__ 0x7fff 2025-05-07T19:44:50.7856775Z #define __FLT64_MIN_EXP__ (-1021) 2025-05-07T19:44:50.7857063Z #define __FLT64X_MIN_10_EXP__ (-4931) 2025-05-07T19:44:50.7857362Z #define __LDBL_HAS_QUIET_NAN__ 1 2025-05-07T19:44:50.7857632Z #define __FLT64_MANT_DIG__ 53 2025-05-07T19:44:50.7857907Z #define __FLT64X_MANT_DIG__ 64 2025-05-07T19:44:50.7858154Z #define __GNUC__ 11 2025-05-07T19:44:50.7858388Z #define __GXX_RTTI 1 2025-05-07T19:44:50.7858612Z #define __pie__ 2 2025-05-07T19:44:50.7858847Z #define __MMX__ 1 2025-05-07T19:44:50.7859069Z #define __FLT_HAS_DENORM__ 1 2025-05-07T19:44:50.7859344Z #define __SIZEOF_LONG_DOUBLE__ 16 2025-05-07T19:44:50.7859618Z #define __BIGGEST_ALIGNMENT__ 16 2025-05-07T19:44:50.7859899Z #define __STDC_UTF_16__ 1 2025-05-07T19:44:50.7860143Z #define __FLT64_MAX_10_EXP__ 308 2025-05-07T19:44:50.7860458Z #define __cpp_delegating_constructors 200604L 2025-05-07T19:44:50.7860789Z #define __FLT32_HAS_INFINITY__ 1 2025-05-07T19:44:50.7861136Z #define __DBL_MAX__ double(1.79769313486231570814527423731704357e+308L) 2025-05-07T19:44:50.7861531Z #define __cpp_raw_strings 200710L 2025-05-07T19:44:50.7861824Z #define __INT_FAST32_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:50.7862146Z #define __DBL_HAS_INFINITY__ 1 2025-05-07T19:44:50.7862463Z #define __SIZEOF_FLOAT__ 4 2025-05-07T19:44:50.7862733Z #define __HAVE_SPECULATION_SAFE_VALUE 1 2025-05-07T19:44:50.7863030Z #define __cpp_fold_expressions 201603L 2025-05-07T19:44:50.7863332Z #define __DEC32_MIN_EXP__ (-94) 2025-05-07T19:44:50.7863606Z #define __INTPTR_WIDTH__ 64 2025-05-07T19:44:50.7863856Z #define __FLT64X_HAS_INFINITY__ 1 2025-05-07T19:44:50.7864145Z #define __UINT_LEAST32_MAX__ 0xffffffffU 2025-05-07T19:44:50.7864430Z #define __FLT32X_HAS_DENORM__ 1 2025-05-07T19:44:50.7864704Z #define __INT_FAST16_TYPE__ long int 2025-05-07T19:44:50.7864981Z #define __MMX_WITH_SSE__ 1 2025-05-07T19:44:50.7865241Z #define __LDBL_HAS_DENORM__ 1 2025-05-07T19:44:50.7865496Z #define __cplusplus 201703L 2025-05-07T19:44:50.7865774Z #define __cpp_ref_qualifiers 200710L 2025-05-07T19:44:50.7866048Z #define __DEC32_MIN__ 1E-95DF 2025-05-07T19:44:50.7866319Z #define __DEPRECATED 1 2025-05-07T19:44:50.7866586Z #define __cpp_rvalue_references 200610L 2025-05-07T19:44:50.7866875Z #define __DBL_MAX_EXP__ 1024 2025-05-07T19:44:50.7867143Z #define __WCHAR_WIDTH__ 32 2025-05-07T19:44:50.7867443Z #define __FLT32_MAX__ 3.40282346638528859811704183484516925e+38F32 2025-05-07T19:44:50.7867808Z #define __DEC128_EPSILON__ 1E-33DL 2025-05-07T19:44:50.7868072Z #define __SSE2_MATH__ 1 2025-05-07T19:44:50.7868322Z #define __ATOMIC_HLE_RELEASE 131072 2025-05-07T19:44:50.7868614Z #define __PTRDIFF_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:50.7868915Z #define __amd64 1 2025-05-07T19:44:50.7869133Z #define __STDC_NO_THREADS__ 1 2025-05-07T19:44:50.7869506Z #define __ATOMIC_HLE_ACQUIRE 65536 2025-05-07T19:44:50.7870052Z #define __GNUG__ 11 2025-05-07T19:44:50.7870323Z #define __LONG_LONG_MAX__ 0x7fffffffffffffffLL 2025-05-07T19:44:50.7870675Z #define __SIZEOF_SIZE_T__ 8 2025-05-07T19:44:50.7870944Z #define __cpp_nsdmi 200809L 2025-05-07T19:44:50.7871235Z #define __FLT64X_MIN_EXP__ (-16381) 2025-05-07T19:44:50.7871528Z #define __SIZEOF_WINT_T__ 4 2025-05-07T19:44:50.7871807Z #define __LONG_LONG_WIDTH__ 64 2025-05-07T19:44:50.7872098Z #define __cpp_initializer_lists 200806L 2025-05-07T19:44:50.7872422Z #define __FLT32_MAX_EXP__ 128 2025-05-07T19:44:50.7872699Z #define __cpp_hex_float 201603L 2025-05-07T19:44:50.7872993Z #define __GXX_ABI_VERSION 1016 2025-05-07T19:44:50.7873284Z #define __FLT128_HAS_INFINITY__ 1 2025-05-07T19:44:50.7873573Z #define __FLT_MIN_EXP__ (-125) 2025-05-07T19:44:50.7873867Z #define __GCC_HAVE_DWARF2_CFI_ASM 1 2025-05-07T19:44:50.7874147Z #define __x86_64 1 2025-05-07T19:44:50.7874398Z #define __cpp_lambdas 200907L 2025-05-07T19:44:50.7874684Z #define __INT_FAST64_TYPE__ long int 2025-05-07T19:44:50.7875094Z #define __FLT64_DENORM_MIN__ 4.94065645841246544176568792868221372e-324F64 2025-05-07T19:44:50.7875511Z #define __cpp_template_auto 201606L 2025-05-07T19:44:50.7875902Z #define __DBL_MIN__ double(2.22507385850720138309023271733240406e-308L) 2025-05-07T19:44:50.7876481Z #define __FLT128_EPSILON__ 1.92592994438723585305597794258492732e-34F128 2025-05-07T19:44:50.7876947Z #define __FLT64X_NORM_MAX__ 1.18973149535723176502126385303097021e+4932F64x 2025-05-07T19:44:50.7877348Z #define __SIZEOF_POINTER__ 8 2025-05-07T19:44:50.7877592Z #define __LP64__ 1 2025-05-07T19:44:50.7877870Z #define __DBL_HAS_QUIET_NAN__ 1 2025-05-07T19:44:50.7878209Z #define __FLT32X_EPSILON__ 2.22044604925031308084726333618164062e-16F32x 2025-05-07T19:44:50.7878597Z #define __DECIMAL_BID_FORMAT__ 1 2025-05-07T19:44:50.7878862Z #define __FLT64_MIN_10_EXP__ (-307) 2025-05-07T19:44:50.7879147Z #define __FLT64X_DECIMAL_DIG__ 21 2025-05-07T19:44:50.7879428Z #define __DEC128_MIN__ 1E-6143DL 2025-05-07T19:44:50.7879686Z #define __REGISTER_PREFIX__ 2025-05-07T19:44:50.7879952Z #define __UINT16_MAX__ 0xffff 2025-05-07T19:44:50.7880206Z #define __LDBL_HAS_INFINITY__ 1 2025-05-07T19:44:50.7880541Z #define __FLT32_MIN__ 1.17549435082228750796873653722224568e-38F32 2025-05-07T19:44:50.7880894Z #define __UINT8_TYPE__ unsigned char 2025-05-07T19:44:50.7881176Z #define __FLT_DIG__ 6 2025-05-07T19:44:50.7881398Z #define __NO_INLINE__ 1 2025-05-07T19:44:50.7881708Z #define __DEC_EVAL_METHOD__ 2 2025-05-07T19:44:50.7882025Z #define __DEC128_MAX__ 9.999999999999999999999999999999999E6144DL 2025-05-07T19:44:50.7882386Z #define __FLT_MANT_DIG__ 24 2025-05-07T19:44:50.7882652Z #define __LDBL_DECIMAL_DIG__ 21 2025-05-07T19:44:50.7882908Z #define __VERSION__ "11.4.0" 2025-05-07T19:44:50.7883174Z #define __UINT64_C(c) c ## UL 2025-05-07T19:44:50.7883439Z #define __cpp_unicode_characters 201411L 2025-05-07T19:44:50.7883742Z #define _STDC_PREDEF_H 1 2025-05-07T19:44:50.7883988Z #define __INT_LEAST32_MAX__ 0x7fffffff 2025-05-07T19:44:50.7884295Z #define __GCC_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:44:50.7884736Z #define __FLT128_MAX_EXP__ 16384 2025-05-07T19:44:50.7885208Z #define __FLT32_MANT_DIG__ 24 2025-05-07T19:44:50.7885588Z #define __FLOAT_WORD_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:44:50.7885971Z #define __cpp_aggregate_bases 201603L 2025-05-07T19:44:50.7886296Z #define __FLT128_HAS_DENORM__ 1 2025-05-07T19:44:50.7886574Z #define __FLT32_DECIMAL_DIG__ 9 2025-05-07T19:44:50.7886867Z #define __FLT128_DIG__ 33 2025-05-07T19:44:50.7887120Z #define __INT32_C(c) c 2025-05-07T19:44:50.7887388Z #define __DEC64_EPSILON__ 1E-15DD 2025-05-07T19:44:50.7887685Z #define __ORDER_PDP_ENDIAN__ 3412 2025-05-07T19:44:50.7888000Z #define __DEC128_MIN_EXP__ (-6142) 2025-05-07T19:44:50.7888295Z #define __INT_FAST32_TYPE__ long int 2025-05-07T19:44:50.7888643Z #define __UINT_LEAST16_TYPE__ short unsigned int 2025-05-07T19:44:50.7888989Z #define unix 1 2025-05-07T19:44:50.7889221Z #define __DBL_HAS_DENORM__ 1 2025-05-07T19:44:50.7889695Z #define __cpp_rtti 199711L 2025-05-07T19:44:50.7889975Z #define __SIZE_TYPE__ long unsigned int 2025-05-07T19:44:50.7890327Z #define __UINT64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:50.7890648Z #define __FLT_IS_IEC_60559__ 2 2025-05-07T19:44:50.7890990Z #define __GNUC_WIDE_EXECUTION_CHARSET_NAME "UTF-32LE" 2025-05-07T19:44:50.7891340Z #define __FLT64X_DIG__ 18 2025-05-07T19:44:50.7891619Z #define __INT8_TYPE__ signed char 2025-05-07T19:44:50.7891928Z #define __cpp_digit_separators 201309L 2025-05-07T19:44:50.7892236Z #define __ELF__ 1 2025-05-07T19:44:50.7892492Z #define __GCC_ASM_FLAG_OUTPUTS__ 1 2025-05-07T19:44:50.7892784Z #define __UINT32_TYPE__ unsigned int 2025-05-07T19:44:50.7893087Z #define __FLT_RADIX__ 2 2025-05-07T19:44:50.7893341Z #define __INT_LEAST16_TYPE__ short int 2025-05-07T19:44:50.7893732Z #define __LDBL_EPSILON__ 1.08420217248550443400745280086994171e-19L 2025-05-07T19:44:50.7894119Z #define __UINTMAX_C(c) c ## UL 2025-05-07T19:44:50.7894413Z #define __GLIBCXX_BITSIZE_INT_N_0 128 2025-05-07T19:44:50.7894700Z #define __k8 1 2025-05-07T19:44:50.7895020Z #define __FLT32X_MIN__ 2.22507385850720138309023271733240406e-308F32x 2025-05-07T19:44:50.7895415Z #define __SIG_ATOMIC_MAX__ 0x7fffffff 2025-05-07T19:44:50.7895736Z #define __GCC_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:44:50.7896064Z #define __SIZEOF_PTRDIFF_T__ 8 2025-05-07T19:44:50.7896336Z #define __LDBL_DIG__ 18 2025-05-07T19:44:50.7896603Z #define __FLT64_IS_IEC_60559__ 2 2025-05-07T19:44:50.7896873Z #define __x86_64__ 1 2025-05-07T19:44:50.7897260Z #define __FLT32X_MIN_EXP__ (-1021) 2025-05-07T19:44:50.7897570Z #define __DEC32_SUBNORMAL_MIN__ 0.000001E-95DF 2025-05-07T19:44:50.7898033Z #define __INT_FAST16_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:50.7898336Z #define __FLT64_DIG__ 15 2025-05-07T19:44:50.7898618Z #define __UINT_FAST32_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:50.7898974Z #define __UINT_LEAST64_TYPE__ long unsigned int 2025-05-07T19:44:50.7899281Z #define __FLT_HAS_QUIET_NAN__ 1 2025-05-07T19:44:50.7899553Z #define __FLT_MAX_10_EXP__ 38 2025-05-07T19:44:50.7899823Z #define __LONG_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:50.7900128Z #define __FLT64X_HAS_DENORM__ 1 2025-05-07T19:44:50.7900484Z #define __DEC128_SUBNORMAL_MIN__ 0.000000000000000000000000000000001E-6143DL 2025-05-07T19:44:50.7900889Z #define __FLT_HAS_INFINITY__ 1 2025-05-07T19:44:50.7901176Z #define __GNUC_EXECUTION_CHARSET_NAME "UTF-8" 2025-05-07T19:44:50.7901504Z #define __cpp_unicode_literals 200710L 2025-05-07T19:44:50.7901919Z #define __UINT_FAST16_TYPE__ long unsigned int 2025-05-07T19:44:50.7902258Z #define __DEC64_MAX__ 9.999999999999999E384DD 2025-05-07T19:44:50.7902568Z #define __INT_FAST32_WIDTH__ 64 2025-05-07T19:44:50.7902845Z #define __CHAR16_TYPE__ short unsigned int 2025-05-07T19:44:50.7903166Z #define __PRAGMA_REDEFINE_EXTNAME 1 2025-05-07T19:44:50.7903443Z #define __SIZE_WIDTH__ 64 2025-05-07T19:44:50.7903688Z #define __SEG_FS 1 2025-05-07T19:44:50.7903910Z #define __INT_LEAST16_MAX__ 0x7fff 2025-05-07T19:44:50.7904189Z #define __DEC64_MANT_DIG__ 16 2025-05-07T19:44:50.7904459Z #define __INT64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:50.7904752Z #define __SEG_GS 1 2025-05-07T19:44:50.7905071Z #define __FLT32_DENORM_MIN__ 1.40129846432481707092372958328991613e-45F32 2025-05-07T19:44:50.7905449Z #define __SIG_ATOMIC_WIDTH__ 32 2025-05-07T19:44:50.7905729Z #define __INT_LEAST64_TYPE__ long int 2025-05-07T19:44:50.7906006Z #define __INT16_TYPE__ short int 2025-05-07T19:44:50.7906295Z #define __INT_LEAST8_TYPE__ signed char 2025-05-07T19:44:50.7906597Z #define __cpp_structured_bindings 201606L 2025-05-07T19:44:50.7906902Z #define __SIZEOF_INT__ 4 2025-05-07T19:44:50.7907140Z #define __DEC32_MAX_EXP__ 97 2025-05-07T19:44:50.7907408Z #define __INT_FAST8_MAX__ 0x7f 2025-05-07T19:44:50.7907741Z #define __FLT128_MAX__ 1.18973149535723176508575932662800702e+4932F128 2025-05-07T19:44:50.7908141Z #define __INTPTR_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:50.7908464Z #define __cpp_sized_deallocation 201309L 2025-05-07T19:44:50.7908842Z #define __cpp_guaranteed_copy_elision 201606L 2025-05-07T19:44:50.7909150Z #define linux 1 2025-05-07T19:44:50.7909363Z #define __FLT64_HAS_QUIET_NAN__ 1 2025-05-07T19:44:50.7909761Z #define __FLT32_MIN_10_EXP__ (-37) 2025-05-07T19:44:50.7910208Z #define __EXCEPTIONS 1 2025-05-07T19:44:50.7910474Z #define __PTRDIFF_WIDTH__ 64 2025-05-07T19:44:50.7910820Z #define __LDBL_MANT_DIG__ 64 2025-05-07T19:44:50.7911114Z #define __cpp_range_based_for 201603L 2025-05-07T19:44:50.7911438Z #define __FLT64_HAS_INFINITY__ 1 2025-05-07T19:44:50.7911800Z #define __FLT64X_MAX__ 1.18973149535723176502126385303097021e+4932F64x 2025-05-07T19:44:50.7912228Z #define __STDCPP_DEFAULT_NEW_ALIGNMENT__ 16 2025-05-07T19:44:50.7912592Z #define __SIG_ATOMIC_MIN__ (-__SIG_ATOMIC_MAX__ - 1) 2025-05-07T19:44:50.7912961Z #define __code_model_small__ 1 2025-05-07T19:44:50.7913245Z #define __GCC_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:44:50.7913589Z #define __cpp_nontype_template_args 201411L 2025-05-07T19:44:50.7913910Z #define __DEC32_MANT_DIG__ 7 2025-05-07T19:44:50.7914223Z #define __cpp_return_type_deduction 201304L 2025-05-07T19:44:50.7914536Z #define __k8__ 1 2025-05-07T19:44:50.7914784Z #define __INTPTR_TYPE__ long int 2025-05-07T19:44:50.7915096Z #define __UINT16_TYPE__ short unsigned int 2025-05-07T19:44:50.7915405Z #define __WCHAR_TYPE__ int 2025-05-07T19:44:50.7915672Z #define __pic__ 2 2025-05-07T19:44:50.7915932Z #define __UINTPTR_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:50.7916275Z #define __INT_FAST64_WIDTH__ 64 2025-05-07T19:44:50.7916554Z #define __cpp_decltype 200707L 2025-05-07T19:44:50.7916880Z #define __INT_FAST64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:50.7917225Z #define __GCC_ATOMIC_TEST_AND_SET_TRUEVAL 1 2025-05-07T19:44:50.7917633Z #define __FLT_NORM_MAX__ 3.40282346638528859811704183484516925e+38F 2025-05-07T19:44:50.7918039Z #define __FLT64X_MAX_EXP__ 16384 2025-05-07T19:44:50.7918348Z #define __UINT_FAST64_TYPE__ long unsigned int 2025-05-07T19:44:50.7918709Z #define __cpp_inline_variables 201606L 2025-05-07T19:44:50.7919027Z #define __INT_MAX__ 0x7fffffff 2025-05-07T19:44:50.7919309Z #define __linux__ 1 2025-05-07T19:44:50.7919543Z #define __INT64_TYPE__ long int 2025-05-07T19:44:50.7919836Z #define __FLT_MAX_EXP__ 128 2025-05-07T19:44:50.7920112Z #define __ORDER_BIG_ENDIAN__ 4321 2025-05-07T19:44:50.7920418Z #define __DBL_MANT_DIG__ 53 2025-05-07T19:44:50.7920718Z #define __cpp_inheriting_constructors 201511L 2025-05-07T19:44:50.7921075Z #define __SIZEOF_FLOAT128__ 16 2025-05-07T19:44:50.7921475Z #define __INT_LEAST64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:50.7921811Z #define __DEC64_MIN__ 1E-383DD 2025-05-07T19:44:50.7922225Z #define __WINT_TYPE__ unsigned int 2025-05-07T19:44:50.7922515Z #define __UINT_LEAST32_TYPE__ unsigned int 2025-05-07T19:44:50.7922827Z #define __SIZEOF_SHORT__ 2 2025-05-07T19:44:50.7923151Z #define __FLT32_NORM_MAX__ 3.40282346638528859811704183484516925e+38F32 2025-05-07T19:44:50.7923525Z #define __SSE__ 1 2025-05-07T19:44:50.7923747Z #define __LDBL_MIN_EXP__ (-16381) 2025-05-07T19:44:50.7924102Z #define __FLT64_MAX__ 1.79769313486231570814527423731704357e+308F64 2025-05-07T19:44:50.7924460Z #define __amd64__ 1 2025-05-07T19:44:50.7924673Z #define __WINT_WIDTH__ 32 2025-05-07T19:44:50.7924940Z #define __INT_LEAST64_WIDTH__ 64 2025-05-07T19:44:50.7925200Z #define __LDBL_MAX_EXP__ 16384 2025-05-07T19:44:50.7925470Z #define __FLT32X_MAX_10_EXP__ 308 2025-05-07T19:44:50.7925733Z #define __SIZEOF_INT128__ 16 2025-05-07T19:44:50.7925999Z #define __FLT64X_IS_IEC_60559__ 2 2025-05-07T19:44:50.7926263Z #define __LDBL_MAX_10_EXP__ 4932 2025-05-07T19:44:50.7926534Z #define __ATOMIC_RELAXED 0 2025-05-07T19:44:50.7926872Z #define __DBL_EPSILON__ double(2.22044604925031308084726333618164062e-16L) 2025-05-07T19:44:50.7927345Z #define __FLT128_MIN__ 3.36210314311209350626267781732175260e-4932F128 2025-05-07T19:44:50.7927718Z #define _LP64 1 2025-05-07T19:44:50.7927924Z #define __UINT8_C(c) c 2025-05-07T19:44:50.7928173Z #define __FLT64_MAX_EXP__ 1024 2025-05-07T19:44:50.7928433Z #define __INT_LEAST32_TYPE__ int 2025-05-07T19:44:50.7931384Z #define __SIZEOF_WCHAR_T__ 4 2025-05-07T19:44:50.7931705Z #define __GNUC_PATCHLEVEL__ 0 2025-05-07T19:44:50.7932080Z #define __FLT128_NORM_MAX__ 1.18973149535723176508575932662800702e+4932F128 2025-05-07T19:44:50.7932550Z #define __FLT64_NORM_MAX__ 1.79769313486231570814527423731704357e+308F64 2025-05-07T19:44:50.7932939Z #define __FLT128_HAS_QUIET_NAN__ 1 2025-05-07T19:44:50.7933241Z #define __INTMAX_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:50.7933557Z #define __INT_FAST8_TYPE__ signed char 2025-05-07T19:44:50.7933879Z #define __cpp_namespace_attributes 201411L 2025-05-07T19:44:50.7934258Z #define __FLT64X_MIN__ 3.36210314311209350626267781732175260e-4932F64x 2025-05-07T19:44:50.7934640Z #define __STDCPP_THREADS__ 1 2025-05-07T19:44:50.7934898Z #define __GNUC_STDC_INLINE__ 1 2025-05-07T19:44:50.7935166Z #define __FLT64_HAS_DENORM__ 1 2025-05-07T19:44:50.7935498Z #define __FLT32_EPSILON__ 1.19209289550781250000000000000000000e-7F32 2025-05-07T19:44:50.7935874Z #define __DBL_DECIMAL_DIG__ 17 2025-05-07T19:44:50.7936147Z #define __STDC_UTF_32__ 1 2025-05-07T19:44:50.7936386Z #define __INT_FAST8_WIDTH__ 8 2025-05-07T19:44:50.7936645Z #define __FXSR__ 1 2025-05-07T19:44:50.7936936Z #define __FLT32X_MAX__ 1.79769313486231570814527423731704357e+308F32x 2025-05-07T19:44:50.7937403Z #define __DBL_NORM_MAX__ double(1.79769313486231570814527423731704357e+308L) 2025-05-07T19:44:50.7937805Z #define __BYTE_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:44:50.7938127Z #define __INTMAX_WIDTH__ 64 2025-05-07T19:44:50.7938387Z #define __cpp_runtime_arrays 198712L 2025-05-07T19:44:50.7938691Z #define __UINT64_TYPE__ long unsigned int 2025-05-07T19:44:50.7938978Z #define __UINT32_C(c) c ## U 2025-05-07T19:44:50.7939257Z #define __cpp_alias_templates 200704L 2025-05-07T19:44:50.7939632Z #define __FLT_DENORM_MIN__ 1.40129846432481707092372958328991613e-45F 2025-05-07T19:44:50.7940000Z #define __FLT128_IS_IEC_60559__ 2 2025-05-07T19:44:50.7940280Z #define __INT8_MAX__ 0x7f 2025-05-07T19:44:50.7940524Z #define __LONG_WIDTH__ 64 2025-05-07T19:44:50.7940773Z #define __PIC__ 2 2025-05-07T19:44:50.7941014Z #define __UINT_FAST32_TYPE__ long unsigned int 2025-05-07T19:44:50.7941426Z #define __FLT32X_NORM_MAX__ 1.79769313486231570814527423731704357e+308F32x 2025-05-07T19:44:50.7941808Z #define __CHAR32_TYPE__ unsigned int 2025-05-07T19:44:50.7942145Z #define __FLT_MAX__ 3.40282346638528859811704183484516925e+38F 2025-05-07T19:44:50.7942499Z #define __cpp_constexpr 201603L 2025-05-07T19:44:50.7942834Z #define __SSE2__ 1 2025-05-07T19:44:50.7943082Z #define __cpp_deduction_guides 201703L 2025-05-07T19:44:50.7943363Z #define __INT32_TYPE__ int 2025-05-07T19:44:50.7943617Z #define __SIZEOF_DOUBLE__ 8 2025-05-07T19:44:50.7943871Z #define __cpp_exceptions 199711L 2025-05-07T19:44:50.7944151Z #define __FLT_MIN_10_EXP__ (-37) 2025-05-07T19:44:50.7944471Z #define __FLT64_MIN__ 2.22507385850720138309023271733240406e-308F64 2025-05-07T19:44:50.7944837Z #define __INT_LEAST32_WIDTH__ 32 2025-05-07T19:44:50.7945096Z #define __INTMAX_TYPE__ long int 2025-05-07T19:44:50.7945370Z #define __DEC128_MAX_EXP__ 6145 2025-05-07T19:44:50.7945642Z #define __FLT32X_HAS_QUIET_NAN__ 1 2025-05-07T19:44:50.7945906Z #define __ATOMIC_CONSUME 1 2025-05-07T19:44:50.7946161Z #define __GNUC_MINOR__ 4 2025-05-07T19:44:50.7946407Z #define __GLIBCXX_TYPE_INT_N_0 __int128 2025-05-07T19:44:50.7946705Z #define __INT_FAST16_WIDTH__ 64 2025-05-07T19:44:50.7946988Z #define __UINTMAX_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:50.7947302Z #define __PIE__ 2 2025-05-07T19:44:50.7947624Z #define __FLT32X_DENORM_MIN__ 4.94065645841246544176568792868221372e-324F32x 2025-05-07T19:44:50.7948063Z #define __cpp_template_template_args 201611L 2025-05-07T19:44:50.7948379Z #define __DBL_MAX_10_EXP__ 308 2025-05-07T19:44:50.7948715Z #define __LDBL_DENORM_MIN__ 3.64519953188247460252840593361941982e-4951L 2025-05-07T19:44:50.7949253Z #define __INT16_C(c) c 2025-05-07T19:44:50.7949563Z #define __STDC__ 1 2025-05-07T19:44:50.7949960Z #define __FLT32X_DIG__ 15 2025-05-07T19:44:50.7950293Z #define __PTRDIFF_TYPE__ long int 2025-05-07T19:44:50.7950628Z #define __ATOMIC_SEQ_CST 5 2025-05-07T19:44:50.7950896Z #define __FLT32X_MIN_10_EXP__ (-307) 2025-05-07T19:44:50.7951227Z #define __UINTPTR_TYPE__ long unsigned int 2025-05-07T19:44:50.7951598Z #define __DEC64_SUBNORMAL_MIN__ 0.000000000000001E-383DD 2025-05-07T19:44:50.7951975Z #define __DEC128_MANT_DIG__ 34 2025-05-07T19:44:50.7952270Z #define __LDBL_MIN_10_EXP__ (-4931) 2025-05-07T19:44:50.7952582Z #define __cpp_generic_lambdas 201304L 2025-05-07T19:44:50.7952894Z #define __SSE_MATH__ 1 2025-05-07T19:44:50.7953149Z #define __SIZEOF_LONG_LONG__ 8 2025-05-07T19:44:50.7953473Z #define __cpp_user_defined_literals 200809L 2025-05-07T19:44:50.7953800Z #define __FLT128_DECIMAL_DIG__ 36 2025-05-07T19:44:50.7954116Z #define __GCC_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:44:50.7954423Z #define __FLT32_HAS_QUIET_NAN__ 1 2025-05-07T19:44:50.7954729Z #define __FLT_DECIMAL_DIG__ 9 2025-05-07T19:44:50.7955041Z #define __UINT_FAST16_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:50.7955481Z #define __LDBL_NORM_MAX__ 1.18973149535723176502126385303097021e+4932L 2025-05-07T19:44:50.7955900Z #define __GCC_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:44:50.7956221Z #define __UINT_FAST8_TYPE__ unsigned char 2025-05-07T19:44:50.7956544Z #define _GNU_SOURCE 1 2025-05-07T19:44:50.7956800Z #define __cpp_init_captures 201304L 2025-05-07T19:44:50.7957111Z #define __ATOMIC_ACQ_REL 4 2025-05-07T19:44:50.7957370Z #define __ATOMIC_RELEASE 3 2025-05-07T19:44:50.7957558Z 2025-05-07T19:44:50.8560717Z 2025-05-07T19:44:50.8561364Z + conda run -n build_binary c++ --version 2025-05-07T19:44:50.8562066Z 2025-05-07T19:44:52.6386939Z c++ (conda-forge gcc 11.4.0-13) 11.4.0 2025-05-07T19:44:52.6387363Z Copyright (C) 2021 Free Software Foundation, Inc. 2025-05-07T19:44:52.6387842Z This is free software; see the source for copying conditions. There is NO 2025-05-07T19:44:52.6388446Z warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. 2025-05-07T19:44:52.6388805Z 2025-05-07T19:44:52.6388809Z 2025-05-07T19:44:52.7129196Z 2025-05-07T19:44:52.7129937Z [INFO] Printing the default version of the C standard used by the compiler ... 2025-05-07T19:44:52.7130607Z + conda run -n build_binary cc -dM -E - < /dev/null | grep __STDC_VERSION__ 2025-05-07T19:44:52.7130980Z 2025-05-07T19:44:54.5952228Z #define __STDC_VERSION__ 201710L 2025-05-07T19:44:54.5953675Z 2025-05-07T19:44:54.5955585Z [INFO] Printing the default version of the C++ standard used by the compiler ... 2025-05-07T19:44:54.5957956Z + conda run -n build_binary c++ -dM -E -x c++ - < /dev/null | grep __cplusplus 2025-05-07T19:44:54.5958992Z 2025-05-07T19:44:56.4408665Z #define __cplusplus 201703L 2025-05-07T19:44:56.4408889Z 2025-05-07T19:44:56.4409058Z [INSTALL] Successfully installed C/C++ compilers 2025-05-07T19:44:56.4469569Z ##[group]Run . $PRELUDE; install_build_tools $BUILD_ENV 2025-05-07T19:44:56.4470118Z . $PRELUDE; install_build_tools $BUILD_ENV 2025-05-07T19:44:56.4471028Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:44:56.4471396Z env: 2025-05-07T19:44:56.4471642Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:44:56.4472000Z BUILD_ENV: build_binary 2025-05-07T19:44:56.4472270Z BUILD_TARGET: default 2025-05-07T19:44:56.4472547Z BUILD_VARIANT: cuda 2025-05-07T19:44:56.4472812Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:44:56.4473108Z ##[endgroup] 2025-05-07T19:44:56.8366269Z ################################################################################ 2025-05-07T19:44:56.8366725Z # Install Build Tools 2025-05-07T19:44:56.8366974Z # 2025-05-07T19:44:56.8385063Z # [2025-05-07T19:44:56.837Z] + install_build_tools build_binary 2025-05-07T19:44:56.8386696Z ################################################################################ 2025-05-07T19:44:56.8387707Z 2025-05-07T19:44:56.8399278Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:44:56.9291736Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:44:56.9297363Z [INSTALL] Installing build tools ... 2025-05-07T19:44:56.9320479Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y auditwheel bazel cmake>=3.30 hypothesis jinja2 make ncurses ninja openblas patchelf rhash scikit-build wheel pyyaml 2025-05-07T19:44:57.6372993Z Channels: 2025-05-07T19:44:57.6373720Z - conda-forge 2025-05-07T19:44:57.6374250Z Platform: linux-64 2025-05-07T19:45:00.6389259Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:45:04.0057388Z Solving environment: \ | / done 2025-05-07T19:45:04.0570904Z 2025-05-07T19:45:04.0571771Z ## Package Plan ## 2025-05-07T19:45:04.0575916Z 2025-05-07T19:45:04.0576211Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:45:04.0576596Z 2025-05-07T19:45:04.0576755Z added / updated specs: 2025-05-07T19:45:04.0577070Z - auditwheel 2025-05-07T19:45:04.0577332Z - bazel 2025-05-07T19:45:04.0577583Z - cmake[version='>=3.30'] 2025-05-07T19:45:04.0577894Z - hypothesis 2025-05-07T19:45:04.0578242Z - jinja2 2025-05-07T19:45:04.0578501Z - make 2025-05-07T19:45:04.0578720Z - ncurses 2025-05-07T19:45:04.0578979Z - ninja 2025-05-07T19:45:04.0579304Z - openblas 2025-05-07T19:45:04.0579541Z - patchelf 2025-05-07T19:45:04.0579759Z - pyyaml 2025-05-07T19:45:04.0579993Z - rhash 2025-05-07T19:45:04.0580199Z - scikit-build 2025-05-07T19:45:04.0580452Z - wheel 2025-05-07T19:45:04.0580591Z 2025-05-07T19:45:04.0580595Z 2025-05-07T19:45:04.0580749Z The following packages will be downloaded: 2025-05-07T19:45:04.0580974Z 2025-05-07T19:45:04.0581099Z package | build 2025-05-07T19:45:04.0581457Z ---------------------------|----------------- 2025-05-07T19:45:04.0581845Z alsa-lib-1.2.14 | hb9d3cd8_0 553 KB conda-forge 2025-05-07T19:45:04.0582321Z attrs-25.3.0 | pyh71513ae_0 56 KB conda-forge 2025-05-07T19:45:04.0582769Z auditwheel-6.2.0 | pyha804496_1 40 KB conda-forge 2025-05-07T19:45:04.0583227Z bazel-7.5.0 | h96810dc_2 47.4 MB conda-forge 2025-05-07T19:45:04.0583661Z c-ares-1.34.5 | hb9d3cd8_0 202 KB conda-forge 2025-05-07T19:45:04.0584065Z cairo-1.18.4 | h3394656_0 955 KB conda-forge 2025-05-07T19:45:04.0584922Z click-8.1.8 | pyh707e725_0 83 KB conda-forge 2025-05-07T19:45:04.0585694Z cmake-4.0.2 | h74e3db0_0 19.4 MB conda-forge 2025-05-07T19:45:04.0586172Z distro-1.9.0 | pyhd8ed1ab_1 41 KB conda-forge 2025-05-07T19:45:04.0586878Z exceptiongroup-1.2.2 | pyhd8ed1ab_1 20 KB conda-forge 2025-05-07T19:45:04.0587446Z font-ttf-dejavu-sans-mono-2.37| hab24e00_0 388 KB conda-forge 2025-05-07T19:45:04.0588056Z font-ttf-inconsolata-3.000 | h77eed37_0 94 KB conda-forge 2025-05-07T19:45:04.0588629Z font-ttf-source-code-pro-2.038| h77eed37_0 684 KB conda-forge 2025-05-07T19:45:04.0589203Z font-ttf-ubuntu-0.83 | h77eed37_3 1.5 MB conda-forge 2025-05-07T19:45:04.0589789Z fontconfig-2.15.0 | h7e30c49_1 259 KB conda-forge 2025-05-07T19:45:04.0590393Z fonts-conda-ecosystem-1 | 0 4 KB conda-forge 2025-05-07T19:45:04.0590950Z fonts-conda-forge-1 | 0 4 KB conda-forge 2025-05-07T19:45:04.0591436Z freetype-2.13.3 | ha770c72_1 168 KB conda-forge 2025-05-07T19:45:04.0591921Z giflib-5.2.2 | hd590300_0 75 KB conda-forge 2025-05-07T19:45:04.0592390Z graphite2-1.3.13 | h59595ed_1003 95 KB conda-forge 2025-05-07T19:45:04.0592890Z harfbuzz-11.0.0 | h76408a6_0 1.6 MB conda-forge 2025-05-07T19:45:04.0593378Z hypothesis-6.131.14 | pyha770c72_0 348 KB conda-forge 2025-05-07T19:45:04.0593861Z icu-75.1 | he02047a_0 11.6 MB conda-forge 2025-05-07T19:45:04.0594295Z ijar-7.5.0 | h5888daf_0 114 KB conda-forge 2025-05-07T19:45:04.0594725Z jinja2-3.1.6 | pyhd8ed1ab_0 110 KB conda-forge 2025-05-07T19:45:04.0595206Z keyutils-1.6.1 | h166bdaf_0 115 KB conda-forge 2025-05-07T19:45:04.0595648Z krb5-1.21.3 | h659f571_0 1.3 MB conda-forge 2025-05-07T19:45:04.0596189Z lcms2-2.17 | h717163a_0 242 KB conda-forge 2025-05-07T19:45:04.0596605Z lerc-4.0.0 | h0aef613_1 258 KB conda-forge 2025-05-07T19:45:04.0597049Z libabseil-20250127.1 | cxx17_hbbce691_0 1.3 MB conda-forge 2025-05-07T19:45:04.0597536Z libcups-2.3.3 | h4637d8d_4 4.3 MB conda-forge 2025-05-07T19:45:04.0597959Z libcurl-8.13.0 | h332b0f4_0 428 KB conda-forge 2025-05-07T19:45:04.0598411Z libdeflate-1.23 | h86f0d12_0 71 KB conda-forge 2025-05-07T19:45:04.0598874Z libedit-3.1.20250104 | pl5321h7949ede_0 132 KB conda-forge 2025-05-07T19:45:04.0599340Z libev-4.33 | hd590300_2 110 KB conda-forge 2025-05-07T19:45:04.0599792Z libexpat-2.7.0 | h5888daf_0 73 KB conda-forge 2025-05-07T19:45:04.0600244Z libfreetype-2.13.3 | ha770c72_1 8 KB conda-forge 2025-05-07T19:45:04.0600740Z libfreetype6-2.13.3 | h48d6fc4_1 371 KB conda-forge 2025-05-07T19:45:04.0601209Z libgfortran-15.1.0 | h69a702a_2 34 KB conda-forge 2025-05-07T19:45:04.0601700Z libgfortran5-15.1.0 | hcea5267_2 1.5 MB conda-forge 2025-05-07T19:45:04.0602149Z libglib-2.84.0 | h2ff4ddf_0 3.8 MB conda-forge 2025-05-07T19:45:04.0602600Z libgrpc-1.71.0 | h8e591d7_1 7.6 MB conda-forge 2025-05-07T19:45:04.0603048Z libiconv-1.18 | h4ce23a2_1 696 KB conda-forge 2025-05-07T19:45:04.0603502Z libjpeg-turbo-3.1.0 | hb9d3cd8_0 614 KB conda-forge 2025-05-07T19:45:04.0603974Z liblzma-5.8.1 | hb9d3cd8_1 110 KB conda-forge 2025-05-07T19:45:04.0604515Z liblzma-devel-5.8.1 | hb9d3cd8_1 431 KB conda-forge 2025-05-07T19:45:04.0605017Z libnghttp2-1.64.0 | h161d5f1_0 632 KB conda-forge 2025-05-07T19:45:04.0605486Z libnsl-2.0.1 | hd590300_0 33 KB conda-forge 2025-05-07T19:45:04.0606033Z libopenblas-0.3.29 |pthreads_h94d23a6_0 5.6 MB conda-forge 2025-05-07T19:45:04.0606523Z libpng-1.6.47 | h943b412_0 282 KB conda-forge 2025-05-07T19:45:04.0606959Z libprotobuf-5.29.3 | h501fc15_1 3.2 MB conda-forge 2025-05-07T19:45:04.0607447Z libre2-11-2024.07.02 | hba17884_3 205 KB conda-forge 2025-05-07T19:45:04.0607898Z libsqlite-3.49.2 | hee588c1_0 895 KB conda-forge 2025-05-07T19:45:04.0608360Z libssh2-1.11.1 | hcf80075_0 298 KB conda-forge 2025-05-07T19:45:04.0608814Z libtiff-4.7.0 | hd9ff511_4 419 KB conda-forge 2025-05-07T19:45:04.0609245Z libuuid-2.38.1 | h0b41bf4_0 33 KB conda-forge 2025-05-07T19:45:04.0609691Z libuv-1.50.0 | hb9d3cd8_0 870 KB conda-forge 2025-05-07T19:45:04.0610134Z libwebp-base-1.5.0 | h851e524_0 420 KB conda-forge 2025-05-07T19:45:04.0610602Z libxcb-1.17.0 | h8a09558_0 387 KB conda-forge 2025-05-07T19:45:04.0611017Z libzlib-1.3.1 | hb9d3cd8_2 60 KB conda-forge 2025-05-07T19:45:04.0611451Z make-4.4.1 | hb9d3cd8_2 501 KB conda-forge 2025-05-07T19:45:04.0611912Z markupsafe-3.0.2 | py311h2dc5d0c_1 25 KB conda-forge 2025-05-07T19:45:04.0612354Z ncurses-6.5 | h2d0b736_3 871 KB conda-forge 2025-05-07T19:45:04.0612790Z ninja-1.12.1 | hff21bea_1 158 KB conda-forge 2025-05-07T19:45:04.0613244Z openblas-0.3.29 |pthreads_h6ec200e_0 5.8 MB conda-forge 2025-05-07T19:45:04.0613733Z openjdk-23.0.2 | h53dfc1b_2 181.4 MB conda-forge 2025-05-07T19:45:04.0614191Z packaging-25.0 | pyh29332c3_1 61 KB conda-forge 2025-05-07T19:45:04.0614640Z patchelf-0.18.0 | h3f2d84a_2 133 KB conda-forge 2025-05-07T19:45:04.0615083Z pcre2-10.44 | hc749103_2 934 KB conda-forge 2025-05-07T19:45:04.0615497Z pixman-0.46.0 | h29eaf8c_0 389 KB conda-forge 2025-05-07T19:45:04.0615970Z pthread-stubs-0.4 | hb9d3cd8_1002 8 KB conda-forge 2025-05-07T19:45:04.0616429Z pyelftools-0.32 | pyh707e725_1 146 KB conda-forge 2025-05-07T19:45:04.0616912Z python-3.11.11 |h9e4cc4f_2_cpython 29.2 MB conda-forge 2025-05-07T19:45:04.0617377Z pyyaml-6.0.2 | py311h2dc5d0c_2 208 KB conda-forge 2025-05-07T19:45:04.0617798Z re2-2024.07.02 | h9925aae_3 26 KB conda-forge 2025-05-07T19:45:04.0618229Z rhash-1.4.5 | hb9d3cd8_0 183 KB conda-forge 2025-05-07T19:45:04.0618667Z scikit-build-0.18.1 | pyhae55e72_2 114 KB conda-forge 2025-05-07T19:45:04.0619143Z singlejar-7.5.0 | h0e684df_1 122 KB conda-forge 2025-05-07T19:45:04.0619643Z sortedcontainers-2.4.0 | pyhd8ed1ab_1 28 KB conda-forge 2025-05-07T19:45:04.0620107Z sqlite-3.49.2 | h9eae976_0 840 KB conda-forge 2025-05-07T19:45:04.0620540Z tk-8.6.13 |noxft_h4845f30_101 3.2 MB conda-forge 2025-05-07T19:45:04.0620951Z tomli-2.2.1 | pyhd8ed1ab_1 19 KB conda-forge 2025-05-07T19:45:04.0621395Z wheel-0.45.1 | pyhd8ed1ab_1 61 KB conda-forge 2025-05-07T19:45:04.0621902Z xorg-libice-1.1.2 | hb9d3cd8_0 57 KB conda-forge 2025-05-07T19:45:04.0622370Z xorg-libsm-1.2.6 | he73a12e_0 27 KB conda-forge 2025-05-07T19:45:04.0622835Z xorg-libx11-1.8.12 | h4f16b4b_0 816 KB conda-forge 2025-05-07T19:45:04.0623358Z xorg-libxau-1.0.12 | hb9d3cd8_0 14 KB conda-forge 2025-05-07T19:45:04.0623848Z xorg-libxdmcp-1.1.5 | hb9d3cd8_0 19 KB conda-forge 2025-05-07T19:45:04.0624318Z xorg-libxext-1.3.6 | hb9d3cd8_0 49 KB conda-forge 2025-05-07T19:45:04.0624811Z xorg-libxfixes-6.0.1 | hb9d3cd8_0 19 KB conda-forge 2025-05-07T19:45:04.0625269Z xorg-libxi-1.8.2 | hb9d3cd8_0 46 KB conda-forge 2025-05-07T19:45:04.0625750Z xorg-libxrandr-1.5.4 | hb9d3cd8_0 29 KB conda-forge 2025-05-07T19:45:04.0626255Z xorg-libxrender-0.9.12 | hb9d3cd8_0 32 KB conda-forge 2025-05-07T19:45:04.0626726Z xorg-libxt-1.3.1 | hb9d3cd8_0 371 KB conda-forge 2025-05-07T19:45:04.0627199Z xorg-libxtst-1.2.5 | hb9d3cd8_3 32 KB conda-forge 2025-05-07T19:45:04.0627620Z xz-5.8.1 | hbcc6ac9_1 23 KB conda-forge 2025-05-07T19:45:04.0628063Z xz-gpl-tools-5.8.1 | hbcc6ac9_1 33 KB conda-forge 2025-05-07T19:45:04.0628525Z xz-tools-5.8.1 | hb9d3cd8_1 94 KB conda-forge 2025-05-07T19:45:04.0628933Z yaml-0.2.5 | h7f98852_2 87 KB conda-forge 2025-05-07T19:45:04.0629354Z zlib-1.3.1 | hb9d3cd8_2 90 KB conda-forge 2025-05-07T19:45:04.0630030Z zstd-1.5.7 | hb8e6e7a_2 554 KB conda-forge 2025-05-07T19:45:04.0630581Z ------------------------------------------------------------ 2025-05-07T19:45:04.0630966Z Total: 349.0 MB 2025-05-07T19:45:04.0631245Z 2025-05-07T19:45:04.0631392Z The following NEW packages will be INSTALLED: 2025-05-07T19:45:04.0631639Z 2025-05-07T19:45:04.0631901Z alsa-lib conda-forge/linux-64::alsa-lib-1.2.14-hb9d3cd8_0 2025-05-07T19:45:04.0632384Z attrs conda-forge/noarch::attrs-25.3.0-pyh71513ae_0 2025-05-07T19:45:04.0632905Z auditwheel conda-forge/noarch::auditwheel-6.2.0-pyha804496_1 2025-05-07T19:45:04.0633398Z bazel conda-forge/linux-64::bazel-7.5.0-h96810dc_2 2025-05-07T19:45:04.0633880Z c-ares conda-forge/linux-64::c-ares-1.34.5-hb9d3cd8_0 2025-05-07T19:45:04.0634368Z cairo conda-forge/linux-64::cairo-1.18.4-h3394656_0 2025-05-07T19:45:04.0634823Z click conda-forge/noarch::click-8.1.8-pyh707e725_0 2025-05-07T19:45:04.0635300Z cmake conda-forge/linux-64::cmake-4.0.2-h74e3db0_0 2025-05-07T19:45:04.0635756Z distro conda-forge/noarch::distro-1.9.0-pyhd8ed1ab_1 2025-05-07T19:45:04.0636333Z exceptiongroup conda-forge/noarch::exceptiongroup-1.2.2-pyhd8ed1ab_1 2025-05-07T19:45:04.0637014Z font-ttf-dejavu-s~ conda-forge/noarch::font-ttf-dejavu-sans-mono-2.37-hab24e00_0 2025-05-07T19:45:04.0637690Z font-ttf-inconsol~ conda-forge/noarch::font-ttf-inconsolata-3.000-h77eed37_0 2025-05-07T19:45:04.0638379Z font-ttf-source-c~ conda-forge/noarch::font-ttf-source-code-pro-2.038-h77eed37_0 2025-05-07T19:45:04.0639008Z font-ttf-ubuntu conda-forge/noarch::font-ttf-ubuntu-0.83-h77eed37_3 2025-05-07T19:45:04.0639680Z fontconfig conda-forge/linux-64::fontconfig-2.15.0-h7e30c49_1 2025-05-07T19:45:04.0640263Z fonts-conda-ecosy~ conda-forge/noarch::fonts-conda-ecosystem-1-0 2025-05-07T19:45:04.0640809Z fonts-conda-forge conda-forge/noarch::fonts-conda-forge-1-0 2025-05-07T19:45:04.0641353Z freetype conda-forge/linux-64::freetype-2.13.3-ha770c72_1 2025-05-07T19:45:04.0641829Z giflib conda-forge/linux-64::giflib-5.2.2-hd590300_0 2025-05-07T19:45:04.0642535Z graphite2 conda-forge/linux-64::graphite2-1.3.13-h59595ed_1003 2025-05-07T19:45:04.0643040Z harfbuzz conda-forge/linux-64::harfbuzz-11.0.0-h76408a6_0 2025-05-07T19:45:04.0643618Z hypothesis conda-forge/noarch::hypothesis-6.131.14-pyha770c72_0 2025-05-07T19:45:04.0644098Z icu conda-forge/linux-64::icu-75.1-he02047a_0 2025-05-07T19:45:04.0644494Z ijar conda-forge/linux-64::ijar-7.5.0-h5888daf_0 2025-05-07T19:45:04.0644933Z jinja2 conda-forge/noarch::jinja2-3.1.6-pyhd8ed1ab_0 2025-05-07T19:45:04.0645385Z keyutils conda-forge/linux-64::keyutils-1.6.1-h166bdaf_0 2025-05-07T19:45:04.0645845Z krb5 conda-forge/linux-64::krb5-1.21.3-h659f571_0 2025-05-07T19:45:04.0646277Z lcms2 conda-forge/linux-64::lcms2-2.17-h717163a_0 2025-05-07T19:45:04.0646685Z lerc conda-forge/linux-64::lerc-4.0.0-h0aef613_1 2025-05-07T19:45:04.0647187Z libabseil conda-forge/linux-64::libabseil-20250127.1-cxx17_hbbce691_0 2025-05-07T19:45:04.0647691Z libcups conda-forge/linux-64::libcups-2.3.3-h4637d8d_4 2025-05-07T19:45:04.0648162Z libcurl conda-forge/linux-64::libcurl-8.13.0-h332b0f4_0 2025-05-07T19:45:04.0648664Z libdeflate conda-forge/linux-64::libdeflate-1.23-h86f0d12_0 2025-05-07T19:45:04.0649179Z libedit conda-forge/linux-64::libedit-3.1.20250104-pl5321h7949ede_0 2025-05-07T19:45:04.0649680Z libev conda-forge/linux-64::libev-4.33-hd590300_2 2025-05-07T19:45:04.0650113Z libexpat conda-forge/linux-64::libexpat-2.7.0-h5888daf_0 2025-05-07T19:45:04.0650615Z libfreetype conda-forge/linux-64::libfreetype-2.13.3-ha770c72_1 2025-05-07T19:45:04.0651168Z libfreetype6 conda-forge/linux-64::libfreetype6-2.13.3-h48d6fc4_1 2025-05-07T19:45:04.0651687Z libgfortran conda-forge/linux-64::libgfortran-15.1.0-h69a702a_2 2025-05-07T19:45:04.0652225Z libgfortran5 conda-forge/linux-64::libgfortran5-15.1.0-hcea5267_2 2025-05-07T19:45:04.0652722Z libglib conda-forge/linux-64::libglib-2.84.0-h2ff4ddf_0 2025-05-07T19:45:04.0653191Z libgrpc conda-forge/linux-64::libgrpc-1.71.0-h8e591d7_1 2025-05-07T19:45:04.0653678Z libiconv conda-forge/linux-64::libiconv-1.18-h4ce23a2_1 2025-05-07T19:45:04.0654181Z libjpeg-turbo conda-forge/linux-64::libjpeg-turbo-3.1.0-hb9d3cd8_0 2025-05-07T19:45:04.0654697Z liblzma conda-forge/linux-64::liblzma-5.8.1-hb9d3cd8_1 2025-05-07T19:45:04.0655181Z liblzma-devel conda-forge/linux-64::liblzma-devel-5.8.1-hb9d3cd8_1 2025-05-07T19:45:04.0655723Z libnghttp2 conda-forge/linux-64::libnghttp2-1.64.0-h161d5f1_0 2025-05-07T19:45:04.0656208Z libnsl conda-forge/linux-64::libnsl-2.0.1-hd590300_0 2025-05-07T19:45:04.0656718Z libopenblas conda-forge/linux-64::libopenblas-0.3.29-pthreads_h94d23a6_0 2025-05-07T19:45:04.0657257Z libpng conda-forge/linux-64::libpng-1.6.47-h943b412_0 2025-05-07T19:45:04.0657739Z libprotobuf conda-forge/linux-64::libprotobuf-5.29.3-h501fc15_1 2025-05-07T19:45:04.0658262Z libre2-11 conda-forge/linux-64::libre2-11-2024.07.02-hba17884_3 2025-05-07T19:45:04.0658787Z libsqlite conda-forge/linux-64::libsqlite-3.49.2-hee588c1_0 2025-05-07T19:45:04.0659254Z libssh2 conda-forge/linux-64::libssh2-1.11.1-hcf80075_0 2025-05-07T19:45:04.0659716Z libtiff conda-forge/linux-64::libtiff-4.7.0-hd9ff511_4 2025-05-07T19:45:04.0660150Z libuv conda-forge/linux-64::libuv-1.50.0-hb9d3cd8_0 2025-05-07T19:45:04.0660645Z libwebp-base conda-forge/linux-64::libwebp-base-1.5.0-h851e524_0 2025-05-07T19:45:04.0661145Z libxcb conda-forge/linux-64::libxcb-1.17.0-h8a09558_0 2025-05-07T19:45:04.0661586Z libzlib conda-forge/linux-64::libzlib-1.3.1-hb9d3cd8_2 2025-05-07T19:45:04.0662108Z make conda-forge/linux-64::make-4.4.1-hb9d3cd8_2 2025-05-07T19:45:04.0662585Z markupsafe conda-forge/linux-64::markupsafe-3.0.2-py311h2dc5d0c_1 2025-05-07T19:45:04.0663096Z ninja conda-forge/linux-64::ninja-1.12.1-hff21bea_1 2025-05-07T19:45:04.0663829Z openblas conda-forge/linux-64::openblas-0.3.29-pthreads_h6ec200e_0 2025-05-07T19:45:04.0664385Z openjdk conda-forge/linux-64::openjdk-23.0.2-h53dfc1b_2 2025-05-07T19:45:04.0664899Z packaging conda-forge/noarch::packaging-25.0-pyh29332c3_1 2025-05-07T19:45:04.0665393Z patchelf conda-forge/linux-64::patchelf-0.18.0-h3f2d84a_2 2025-05-07T19:45:04.0665921Z pcre2 conda-forge/linux-64::pcre2-10.44-hc749103_2 2025-05-07T19:45:04.0666799Z pixman conda-forge/linux-64::pixman-0.46.0-h29eaf8c_0 2025-05-07T19:45:04.0667809Z pthread-stubs conda-forge/linux-64::pthread-stubs-0.4-hb9d3cd8_1002 2025-05-07T19:45:04.0668772Z pyelftools conda-forge/noarch::pyelftools-0.32-pyh707e725_1 2025-05-07T19:45:04.0669287Z pyyaml conda-forge/linux-64::pyyaml-6.0.2-py311h2dc5d0c_2 2025-05-07T19:45:04.0669889Z re2 conda-forge/linux-64::re2-2024.07.02-h9925aae_3 2025-05-07T19:45:04.0670411Z rhash conda-forge/linux-64::rhash-1.4.5-hb9d3cd8_0 2025-05-07T19:45:04.0670960Z scikit-build conda-forge/noarch::scikit-build-0.18.1-pyhae55e72_2 2025-05-07T19:45:04.0671527Z singlejar conda-forge/linux-64::singlejar-7.5.0-h0e684df_1 2025-05-07T19:45:04.0672102Z sortedcontainers conda-forge/noarch::sortedcontainers-2.4.0-pyhd8ed1ab_1 2025-05-07T19:45:04.0672679Z tomli conda-forge/noarch::tomli-2.2.1-pyhd8ed1ab_1 2025-05-07T19:45:04.0673338Z xorg-libice conda-forge/linux-64::xorg-libice-1.1.2-hb9d3cd8_0 2025-05-07T19:45:04.0674246Z xorg-libsm conda-forge/linux-64::xorg-libsm-1.2.6-he73a12e_0 2025-05-07T19:45:04.0674802Z xorg-libx11 conda-forge/linux-64::xorg-libx11-1.8.12-h4f16b4b_0 2025-05-07T19:45:04.0675340Z xorg-libxau conda-forge/linux-64::xorg-libxau-1.0.12-hb9d3cd8_0 2025-05-07T19:45:04.0675921Z xorg-libxdmcp conda-forge/linux-64::xorg-libxdmcp-1.1.5-hb9d3cd8_0 2025-05-07T19:45:04.0676485Z xorg-libxext conda-forge/linux-64::xorg-libxext-1.3.6-hb9d3cd8_0 2025-05-07T19:45:04.0677081Z xorg-libxfixes conda-forge/linux-64::xorg-libxfixes-6.0.1-hb9d3cd8_0 2025-05-07T19:45:04.0677654Z xorg-libxi conda-forge/linux-64::xorg-libxi-1.8.2-hb9d3cd8_0 2025-05-07T19:45:04.0678203Z xorg-libxrandr conda-forge/linux-64::xorg-libxrandr-1.5.4-hb9d3cd8_0 2025-05-07T19:45:04.0678827Z xorg-libxrender conda-forge/linux-64::xorg-libxrender-0.9.12-hb9d3cd8_0 2025-05-07T19:45:04.0679392Z xorg-libxt conda-forge/linux-64::xorg-libxt-1.3.1-hb9d3cd8_0 2025-05-07T19:45:04.0679942Z xorg-libxtst conda-forge/linux-64::xorg-libxtst-1.2.5-hb9d3cd8_3 2025-05-07T19:45:04.0680518Z xz-gpl-tools conda-forge/linux-64::xz-gpl-tools-5.8.1-hbcc6ac9_1 2025-05-07T19:45:04.0681219Z xz-tools conda-forge/linux-64::xz-tools-5.8.1-hb9d3cd8_1 2025-05-07T19:45:04.0682011Z yaml conda-forge/linux-64::yaml-0.2.5-h7f98852_2 2025-05-07T19:45:04.0682442Z zstd conda-forge/linux-64::zstd-1.5.7-hb8e6e7a_2 2025-05-07T19:45:04.0682743Z 2025-05-07T19:45:04.0682875Z The following packages will be UPDATED: 2025-05-07T19:45:04.0683113Z 2025-05-07T19:45:04.0683441Z libuuid pkgs/main::libuuid-1.41.5-h5eee18b_0 --> conda-forge::libuuid-2.38.1-h0b41bf4_0 2025-05-07T19:45:04.0684184Z ncurses pkgs/main::ncurses-6.4-h6a678d5_0 --> conda-forge::ncurses-6.5-h2d0b736_3 2025-05-07T19:45:04.0685125Z python pkgs/main::python-3.11.11-he870216_0 --> conda-forge::python-3.11.11-h9e4cc4f_2_cpython 2025-05-07T19:45:04.0685880Z sqlite pkgs/main::sqlite-3.45.3-h5eee18b_0 --> conda-forge::sqlite-3.49.2-h9eae976_0 2025-05-07T19:45:04.0686796Z wheel pkgs/main/linux-64::wheel-0.45.1-py31~ --> conda-forge/noarch::wheel-0.45.1-pyhd8ed1ab_1 2025-05-07T19:45:04.0687503Z xz pkgs/main::xz-5.6.4-h5eee18b_1 --> conda-forge::xz-5.8.1-hbcc6ac9_1 2025-05-07T19:45:04.0688251Z zlib pkgs/main::zlib-1.2.13-h5eee18b_1 --> conda-forge::zlib-1.3.1-hb9d3cd8_2 2025-05-07T19:45:04.0688629Z 2025-05-07T19:45:04.0688882Z The following packages will be SUPERSEDED by a higher-priority channel: 2025-05-07T19:45:04.0689273Z 2025-05-07T19:45:04.0689536Z tk pkgs/main::tk-8.6.14-h39e8969_0 --> conda-forge::tk-8.6.13-noxft_h4845f30_101 2025-05-07T19:45:04.0689906Z 2025-05-07T19:45:04.0689934Z 2025-05-07T19:45:04.0689937Z 2025-05-07T19:45:04.0690130Z Downloading and Extracting Packages: ...working... 2025-05-07T19:45:04.0690550Z openjdk-23.0.2 | 181.4 MB | | 0% 2025-05-07T19:45:04.0690840Z 2025-05-07T19:45:04.0691236Z bazel-7.5.0 | 47.4 MB | | 0%  2025-05-07T19:45:04.0691500Z 2025-05-07T19:45:04.0691503Z 2025-05-07T19:45:04.0691765Z python-3.11.11 | 29.2 MB | | 0%  2025-05-07T19:45:04.0692036Z 2025-05-07T19:45:04.0692039Z 2025-05-07T19:45:04.0692043Z 2025-05-07T19:45:04.0693865Z cmake-4.0.2 | 19.4 MB | | 0%  2025-05-07T19:45:04.0694424Z 2025-05-07T19:45:04.0694430Z 2025-05-07T19:45:04.0694435Z 2025-05-07T19:45:04.0694452Z 2025-05-07T19:45:04.0711073Z icu-75.1 | 11.6 MB | | 0%  2025-05-07T19:45:04.0712504Z 2025-05-07T19:45:04.0712557Z 2025-05-07T19:45:04.0712578Z 2025-05-07T19:45:04.0712597Z 2025-05-07T19:45:04.0712615Z 2025-05-07T19:45:04.0713837Z libgrpc-1.71.0 | 7.6 MB | | 0%  2025-05-07T19:45:04.0715279Z 2025-05-07T19:45:04.0715302Z 2025-05-07T19:45:04.0715317Z 2025-05-07T19:45:04.0715340Z 2025-05-07T19:45:04.0715356Z 2025-05-07T19:45:04.0715376Z 2025-05-07T19:45:04.0716689Z openblas-0.3.29 | 5.8 MB | | 0%  2025-05-07T19:45:04.0717283Z 2025-05-07T19:45:04.0717289Z 2025-05-07T19:45:04.0717294Z 2025-05-07T19:45:04.0717299Z 2025-05-07T19:45:04.0717304Z 2025-05-07T19:45:04.0717308Z 2025-05-07T19:45:04.0717341Z 2025-05-07T19:45:04.0717761Z libopenblas-0.3.29 | 5.6 MB | | 0%  2025-05-07T19:45:04.0718343Z 2025-05-07T19:45:04.0718347Z 2025-05-07T19:45:04.0718352Z 2025-05-07T19:45:04.0718356Z 2025-05-07T19:45:04.0718361Z 2025-05-07T19:45:04.0718366Z 2025-05-07T19:45:04.0718370Z 2025-05-07T19:45:04.0718375Z 2025-05-07T19:45:04.0718868Z libcups-2.3.3 | 4.3 MB | | 0%  2025-05-07T19:45:04.0719385Z 2025-05-07T19:45:04.0719388Z 2025-05-07T19:45:04.0719392Z 2025-05-07T19:45:04.0719395Z 2025-05-07T19:45:04.0719400Z 2025-05-07T19:45:04.0719403Z 2025-05-07T19:45:04.0719406Z 2025-05-07T19:45:04.0719410Z 2025-05-07T19:45:04.0719414Z 2025-05-07T19:45:04.0719882Z libglib-2.84.0 | 3.8 MB | | 0%  2025-05-07T19:45:04.0720441Z 2025-05-07T19:45:04.0720447Z 2025-05-07T19:45:04.0720453Z 2025-05-07T19:45:04.0720459Z 2025-05-07T19:45:04.0720463Z 2025-05-07T19:45:04.0720467Z 2025-05-07T19:45:04.0720473Z 2025-05-07T19:45:04.0720476Z 2025-05-07T19:45:04.0720484Z 2025-05-07T19:45:04.0720487Z 2025-05-07T19:45:04.0721013Z libprotobuf-5.29.3 | 3.2 MB | | 0%  2025-05-07T19:45:04.0721609Z 2025-05-07T19:45:04.0721616Z 2025-05-07T19:45:04.0721624Z 2025-05-07T19:45:04.0721632Z 2025-05-07T19:45:04.0721637Z 2025-05-07T19:45:04.0721670Z 2025-05-07T19:45:04.0721678Z 2025-05-07T19:45:04.0721684Z 2025-05-07T19:45:04.0721690Z 2025-05-07T19:45:04.0721698Z 2025-05-07T19:45:04.0721703Z 2025-05-07T19:45:04.0722158Z tk-8.6.13 | 3.2 MB | | 0%  2025-05-07T19:45:04.0722577Z 2025-05-07T19:45:04.0722583Z 2025-05-07T19:45:04.0722780Z 2025-05-07T19:45:04.0722785Z 2025-05-07T19:45:04.0722820Z 2025-05-07T19:45:04.0722825Z 2025-05-07T19:45:04.0722831Z 2025-05-07T19:45:04.0722836Z 2025-05-07T19:45:04.0722842Z 2025-05-07T19:45:04.0722847Z 2025-05-07T19:45:04.0722851Z 2025-05-07T19:45:04.0722856Z 2025-05-07T19:45:04.0723455Z harfbuzz-11.0.0 | 1.6 MB | | 0%  2025-05-07T19:45:04.0723984Z 2025-05-07T19:45:04.0723992Z 2025-05-07T19:45:04.0724037Z 2025-05-07T19:45:04.0724045Z 2025-05-07T19:45:04.0724050Z 2025-05-07T19:45:04.0724057Z 2025-05-07T19:45:04.0724065Z 2025-05-07T19:45:04.0724070Z 2025-05-07T19:45:04.0724077Z 2025-05-07T19:45:04.0724086Z 2025-05-07T19:45:04.0724091Z 2025-05-07T19:45:04.0724098Z 2025-05-07T19:45:04.0724106Z 2025-05-07T19:45:04.0724636Z font-ttf-ubuntu-0.83 | 1.5 MB | | 0%  2025-05-07T19:45:04.0725237Z 2025-05-07T19:45:04.0725242Z 2025-05-07T19:45:04.0725249Z 2025-05-07T19:45:04.0725263Z 2025-05-07T19:45:04.0725270Z 2025-05-07T19:45:04.0725278Z 2025-05-07T19:45:04.0725284Z 2025-05-07T19:45:04.0725291Z 2025-05-07T19:45:04.0725298Z 2025-05-07T19:45:04.0725304Z 2025-05-07T19:45:04.0725311Z 2025-05-07T19:45:04.0725318Z 2025-05-07T19:45:04.0725324Z 2025-05-07T19:45:04.0725331Z 2025-05-07T19:45:04.0725887Z libgfortran5-15.1.0 | 1.5 MB | | 0%  2025-05-07T19:45:04.0726538Z 2025-05-07T19:45:04.0726541Z 2025-05-07T19:45:04.0726545Z 2025-05-07T19:45:04.0726548Z 2025-05-07T19:45:04.0726552Z 2025-05-07T19:45:04.0726555Z 2025-05-07T19:45:04.0726559Z 2025-05-07T19:45:04.0726562Z 2025-05-07T19:45:04.0726565Z 2025-05-07T19:45:04.0726569Z 2025-05-07T19:45:04.0726572Z 2025-05-07T19:45:04.0726575Z 2025-05-07T19:45:04.0726580Z 2025-05-07T19:45:04.0726584Z 2025-05-07T19:45:04.0726587Z 2025-05-07T19:45:04.0726907Z krb5-1.21.3 | 1.3 MB | | 0%  2025-05-07T19:45:04.0727214Z 2025-05-07T19:45:04.0727217Z 2025-05-07T19:45:04.0727221Z 2025-05-07T19:45:04.0727225Z 2025-05-07T19:45:04.0727228Z 2025-05-07T19:45:04.0727231Z 2025-05-07T19:45:04.0727260Z 2025-05-07T19:45:04.0727263Z 2025-05-07T19:45:04.0727267Z 2025-05-07T19:45:04.0727270Z 2025-05-07T19:45:04.0727274Z 2025-05-07T19:45:04.0727277Z 2025-05-07T19:45:04.0727285Z 2025-05-07T19:45:04.0727288Z 2025-05-07T19:45:04.0727292Z 2025-05-07T19:45:04.0727296Z 2025-05-07T19:45:04.0727627Z libabseil-20250127.1 | 1.3 MB | | 0%  2025-05-07T19:45:04.0728003Z 2025-05-07T19:45:04.0728007Z 2025-05-07T19:45:04.0728010Z 2025-05-07T19:45:04.0728013Z 2025-05-07T19:45:04.0728016Z 2025-05-07T19:45:04.0728020Z 2025-05-07T19:45:04.0728024Z 2025-05-07T19:45:04.0728027Z 2025-05-07T19:45:04.0728030Z 2025-05-07T19:45:04.0728033Z 2025-05-07T19:45:04.0728037Z 2025-05-07T19:45:04.0728040Z 2025-05-07T19:45:04.0728044Z 2025-05-07T19:45:04.0728047Z 2025-05-07T19:45:04.0728054Z 2025-05-07T19:45:04.0728057Z 2025-05-07T19:45:04.0728061Z 2025-05-07T19:45:04.0728382Z cairo-1.18.4 | 955 KB | | 0%  2025-05-07T19:45:04.0728694Z 2025-05-07T19:45:04.0728697Z 2025-05-07T19:45:04.0728701Z 2025-05-07T19:45:04.0728704Z 2025-05-07T19:45:04.0728711Z 2025-05-07T19:45:04.0728715Z 2025-05-07T19:45:04.0728718Z 2025-05-07T19:45:04.0728721Z 2025-05-07T19:45:04.0728725Z 2025-05-07T19:45:04.0728728Z 2025-05-07T19:45:04.0728732Z 2025-05-07T19:45:04.0728735Z 2025-05-07T19:45:04.0728738Z 2025-05-07T19:45:04.0728742Z 2025-05-07T19:45:04.0728745Z 2025-05-07T19:45:04.0728748Z 2025-05-07T19:45:04.0728774Z 2025-05-07T19:45:04.0728777Z 2025-05-07T19:45:04.0729075Z pcre2-10.44 | 934 KB | | 0%  2025-05-07T19:45:04.0729397Z 2025-05-07T19:45:04.0729400Z 2025-05-07T19:45:04.0729403Z 2025-05-07T19:45:04.0729407Z 2025-05-07T19:45:04.0729513Z 2025-05-07T19:45:04.0729516Z 2025-05-07T19:45:04.0729519Z 2025-05-07T19:45:04.0729549Z 2025-05-07T19:45:04.0729552Z 2025-05-07T19:45:04.0729556Z 2025-05-07T19:45:04.0729559Z 2025-05-07T19:45:04.0729562Z 2025-05-07T19:45:04.0729566Z 2025-05-07T19:45:04.0729569Z 2025-05-07T19:45:04.0729572Z 2025-05-07T19:45:04.0729576Z 2025-05-07T19:45:04.0729635Z 2025-05-07T19:45:04.0729639Z 2025-05-07T19:45:04.0729643Z 2025-05-07T19:45:04.2541524Z ... (more hidden) ... 2025-05-07T19:45:04.2541995Z 2025-05-07T19:45:04.2542011Z 2025-05-07T19:45:04.2542015Z 2025-05-07T19:45:04.2589169Z 2025-05-07T19:45:04.3690094Z icu-75.1 | 11.6 MB | | 0%  2025-05-07T19:45:04.3691223Z 2025-05-07T19:45:04.3691238Z 2025-05-07T19:45:04.3691249Z 2025-05-07T19:45:04.3691260Z 2025-05-07T19:45:04.4309221Z icu-75.1 | 11.6 MB | | 1%  2025-05-07T19:45:04.4310800Z 2025-05-07T19:45:04.4310874Z 2025-05-07T19:45:04.4310890Z 2025-05-07T19:45:04.4415683Z cmake-4.0.2 | 19.4 MB | | 0%  2025-05-07T19:45:04.4416907Z 2025-05-07T19:45:04.4422301Z bazel-7.5.0 | 47.4 MB | | 0%  2025-05-07T19:45:04.4422851Z 2025-05-07T19:45:04.4422857Z 2025-05-07T19:45:04.4628320Z python-3.11.11 | 29.2 MB | | 0%  2025-05-07T19:45:04.4776042Z openjdk-23.0.2 | 181.4 MB | | 0% 2025-05-07T19:45:04.4777489Z 2025-05-07T19:45:04.4777508Z 2025-05-07T19:45:04.4777527Z 2025-05-07T19:45:04.4777545Z 2025-05-07T19:45:04.5309957Z icu-75.1 | 11.6 MB | #######6 | 76%  2025-05-07T19:45:04.5311422Z 2025-05-07T19:45:04.5311452Z 2025-05-07T19:45:04.5311463Z 2025-05-07T19:45:04.5424991Z cmake-4.0.2 | 19.4 MB | ####9 | 50%  2025-05-07T19:45:04.5425493Z 2025-05-07T19:45:04.5425522Z 2025-05-07T19:45:04.5519547Z python-3.11.11 | 29.2 MB | ###8 | 39%  2025-05-07T19:45:04.5519994Z 2025-05-07T19:45:04.5627948Z bazel-7.5.0 | 47.4 MB | 7 | 8%  2025-05-07T19:45:04.6110727Z openjdk-23.0.2 | 181.4 MB | 4 | 5% 2025-05-07T19:45:04.6111270Z 2025-05-07T19:45:04.6111302Z 2025-05-07T19:45:04.6111307Z 2025-05-07T19:45:04.6111313Z 2025-05-07T19:45:04.6359807Z icu-75.1 | 11.6 MB | ########## | 100%  2025-05-07T19:45:04.6360330Z 2025-05-07T19:45:04.6360338Z 2025-05-07T19:45:04.6360345Z 2025-05-07T19:45:04.6427537Z cmake-4.0.2 | 19.4 MB | #######8 | 79%  2025-05-07T19:45:04.6428018Z 2025-05-07T19:45:04.6428023Z 2025-05-07T19:45:04.6527371Z python-3.11.11 | 29.2 MB | ########2 | 82%  2025-05-07T19:45:04.6527715Z 2025-05-07T19:45:04.6629848Z bazel-7.5.0 | 47.4 MB | #5 | 16%  2025-05-07T19:45:04.7224349Z openjdk-23.0.2 | 181.4 MB | 8 | 9% 2025-05-07T19:45:04.7224871Z 2025-05-07T19:45:04.7224899Z 2025-05-07T19:45:04.7224935Z 2025-05-07T19:45:04.7224990Z 2025-05-07T19:45:04.7224997Z 2025-05-07T19:45:04.7628427Z libgrpc-1.71.0 | 7.6 MB | | 0%  2025-05-07T19:45:04.7859966Z openjdk-23.0.2 | 181.4 MB | #4 | 14% 2025-05-07T19:45:04.7860479Z 2025-05-07T19:45:04.8238540Z bazel-7.5.0 | 47.4 MB | ##1 | 21%  2025-05-07T19:45:04.8239011Z 2025-05-07T19:45:04.8239015Z 2025-05-07T19:45:04.8239018Z 2025-05-07T19:45:04.8239022Z 2025-05-07T19:45:04.8239025Z 2025-05-07T19:45:04.8290602Z libgrpc-1.71.0 | 7.6 MB | ##8 | 29%  2025-05-07T19:45:04.8291147Z 2025-05-07T19:45:04.8291156Z 2025-05-07T19:45:04.8291163Z 2025-05-07T19:45:04.8671641Z cmake-4.0.2 | 19.4 MB | ########## | 100%  2025-05-07T19:45:04.8672127Z 2025-05-07T19:45:04.8672132Z 2025-05-07T19:45:04.8672135Z 2025-05-07T19:45:04.8672139Z 2025-05-07T19:45:04.8672143Z 2025-05-07T19:45:04.8672147Z 2025-05-07T19:45:04.8677194Z openblas-0.3.29 | 5.8 MB | | 0%  2025-05-07T19:45:04.8860169Z openjdk-23.0.2 | 181.4 MB | ##1 | 22% 2025-05-07T19:45:04.8860716Z 2025-05-07T19:45:04.9240956Z bazel-7.5.0 | 47.4 MB | ##9 | 29%  2025-05-07T19:45:04.9241253Z 2025-05-07T19:45:04.9241258Z 2025-05-07T19:45:04.9241263Z 2025-05-07T19:45:04.9241475Z 2025-05-07T19:45:04.9241480Z 2025-05-07T19:45:04.9642887Z libgrpc-1.71.0 | 7.6 MB | ########2 | 83%  2025-05-07T19:45:04.9644423Z 2025-05-07T19:45:04.9644431Z 2025-05-07T19:45:04.9675450Z python-3.11.11 | 29.2 MB | ########## | 100%  2025-05-07T19:45:04.9676509Z 2025-05-07T19:45:04.9676523Z 2025-05-07T19:45:04.9676527Z 2025-05-07T19:45:04.9676531Z 2025-05-07T19:45:04.9676534Z 2025-05-07T19:45:04.9677978Z 2025-05-07T19:45:04.9862887Z openblas-0.3.29 | 5.8 MB | #######5 | 76%  2025-05-07T19:45:04.9863428Z 2025-05-07T19:45:05.0051243Z bazel-7.5.0 | 47.4 MB | ###8 | 38%  2025-05-07T19:45:05.0051617Z 2025-05-07T19:45:05.0051718Z 2025-05-07T19:45:05.0051729Z 2025-05-07T19:45:05.0051736Z 2025-05-07T19:45:05.0051741Z 2025-05-07T19:45:05.0051745Z 2025-05-07T19:45:05.0051750Z 2025-05-07T19:45:05.0753707Z libopenblas-0.3.29 | 5.6 MB | | 0%  2025-05-07T19:45:05.0863755Z openjdk-23.0.2 | 181.4 MB | ##6 | 27% 2025-05-07T19:45:05.0864371Z 2025-05-07T19:45:05.1063284Z bazel-7.5.0 | 47.4 MB | ####5 | 46%  2025-05-07T19:45:05.1064191Z 2025-05-07T19:45:05.1064224Z 2025-05-07T19:45:05.1064232Z 2025-05-07T19:45:05.1064240Z 2025-05-07T19:45:05.1064245Z 2025-05-07T19:45:05.1064252Z 2025-05-07T19:45:05.1064260Z 2025-05-07T19:45:05.1396645Z libopenblas-0.3.29 | 5.6 MB | ########5 | 86%  2025-05-07T19:45:05.1397082Z 2025-05-07T19:45:05.1397087Z 2025-05-07T19:45:05.1397118Z 2025-05-07T19:45:05.1397121Z 2025-05-07T19:45:05.1398202Z 2025-05-07T19:45:05.1703657Z libgrpc-1.71.0 | 7.6 MB | ########## | 100%  2025-05-07T19:45:05.1704154Z 2025-05-07T19:45:05.1704158Z 2025-05-07T19:45:05.1704162Z 2025-05-07T19:45:05.1704166Z 2025-05-07T19:45:05.1704195Z 2025-05-07T19:45:05.1704198Z 2025-05-07T19:45:05.1865300Z openblas-0.3.29 | 5.8 MB | ########## | 100%  2025-05-07T19:45:05.1865707Z 2025-05-07T19:45:05.1904625Z bazel-7.5.0 | 47.4 MB | #####4 | 55%  2025-05-07T19:45:05.1905287Z 2025-05-07T19:45:05.1905291Z 2025-05-07T19:45:05.1905295Z 2025-05-07T19:45:05.1905298Z 2025-05-07T19:45:05.1905302Z 2025-05-07T19:45:05.1905305Z 2025-05-07T19:45:05.1905309Z 2025-05-07T19:45:05.1905312Z 2025-05-07T19:45:05.2158178Z libcups-2.3.3 | 4.3 MB | | 0%  2025-05-07T19:45:05.2187343Z openjdk-23.0.2 | 181.4 MB | ###1 | 31% 2025-05-07T19:45:05.2187636Z 2025-05-07T19:45:05.2187640Z 2025-05-07T19:45:05.2187644Z 2025-05-07T19:45:05.2187682Z 2025-05-07T19:45:05.2187686Z 2025-05-07T19:45:05.2187690Z 2025-05-07T19:45:05.2187693Z 2025-05-07T19:45:05.2187697Z 2025-05-07T19:45:05.2187700Z 2025-05-07T19:45:05.2543602Z libglib-2.84.0 | 3.8 MB | | 0%  2025-05-07T19:45:05.2544410Z 2025-05-07T19:45:05.2546234Z 2025-05-07T19:45:05.2546260Z 2025-05-07T19:45:05.2546267Z 2025-05-07T19:45:05.2546273Z 2025-05-07T19:45:05.2546279Z 2025-05-07T19:45:05.2546285Z 2025-05-07T19:45:05.2872142Z libopenblas-0.3.29 | 5.6 MB | ########## | 100%  2025-05-07T19:45:05.2872489Z 2025-05-07T19:45:05.3262536Z bazel-7.5.0 | 47.4 MB | ######2 | 63%  2025-05-07T19:45:05.3262821Z 2025-05-07T19:45:05.3262935Z 2025-05-07T19:45:05.3262939Z 2025-05-07T19:45:05.3262953Z 2025-05-07T19:45:05.3262957Z 2025-05-07T19:45:05.3262975Z 2025-05-07T19:45:05.3262979Z 2025-05-07T19:45:05.3263004Z 2025-05-07T19:45:05.3263129Z 2025-05-07T19:45:05.3263137Z 2025-05-07T19:45:05.3467935Z libprotobuf-5.29.3 | 3.2 MB | | 0%  2025-05-07T19:45:05.3656350Z openjdk-23.0.2 | 181.4 MB | ###5 | 35% 2025-05-07T19:45:05.3657244Z 2025-05-07T19:45:05.3657259Z 2025-05-07T19:45:05.3657271Z 2025-05-07T19:45:05.3657281Z 2025-05-07T19:45:05.3657291Z 2025-05-07T19:45:05.3657302Z 2025-05-07T19:45:05.3657718Z 2025-05-07T19:45:05.3657732Z 2025-05-07T19:45:05.3658664Z libcups-2.3.3 | 4.3 MB | ########## | 100%  2025-05-07T19:45:05.3659523Z 2025-05-07T19:45:05.3659533Z 2025-05-07T19:45:05.3659544Z 2025-05-07T19:45:05.3659554Z 2025-05-07T19:45:05.3659564Z 2025-05-07T19:45:05.3659576Z 2025-05-07T19:45:05.3659586Z 2025-05-07T19:45:05.3659596Z 2025-05-07T19:45:05.3874548Z libcups-2.3.3 | 4.3 MB | ########## | 100%  2025-05-07T19:45:05.3875619Z 2025-05-07T19:45:05.4002160Z bazel-7.5.0 | 47.4 MB | #######3 | 73%  2025-05-07T19:45:05.4003626Z 2025-05-07T19:45:05.4003695Z 2025-05-07T19:45:05.4003716Z 2025-05-07T19:45:05.4003738Z 2025-05-07T19:45:05.4003755Z 2025-05-07T19:45:05.4003775Z 2025-05-07T19:45:05.4003797Z 2025-05-07T19:45:05.4003814Z 2025-05-07T19:45:05.4003833Z 2025-05-07T19:45:05.4005283Z libglib-2.84.0 | 3.8 MB | ########## | 100%  2025-05-07T19:45:05.4006312Z 2025-05-07T19:45:05.4006316Z 2025-05-07T19:45:05.4006319Z 2025-05-07T19:45:05.4006322Z 2025-05-07T19:45:05.4006326Z 2025-05-07T19:45:05.4006330Z 2025-05-07T19:45:05.4006333Z 2025-05-07T19:45:05.4006337Z 2025-05-07T19:45:05.4006340Z 2025-05-07T19:45:05.4214149Z libglib-2.84.0 | 3.8 MB | ########## | 100%  2025-05-07T19:45:05.4214728Z 2025-05-07T19:45:05.4214733Z 2025-05-07T19:45:05.4214736Z 2025-05-07T19:45:05.4214740Z 2025-05-07T19:45:05.4214743Z 2025-05-07T19:45:05.4214747Z 2025-05-07T19:45:05.4214750Z 2025-05-07T19:45:05.4214755Z 2025-05-07T19:45:05.4214758Z 2025-05-07T19:45:05.4214762Z 2025-05-07T19:45:05.4214779Z 2025-05-07T19:45:05.4233686Z tk-8.6.13 | 3.2 MB | | 0%  2025-05-07T19:45:05.4234196Z 2025-05-07T19:45:05.4234201Z 2025-05-07T19:45:05.4234205Z 2025-05-07T19:45:05.4234209Z 2025-05-07T19:45:05.4373782Z icu-75.1 | 11.6 MB | ########## | 100%  2025-05-07T19:45:05.4374309Z 2025-05-07T19:45:05.4374316Z 2025-05-07T19:45:05.4374322Z 2025-05-07T19:45:05.4374327Z 2025-05-07T19:45:05.4374332Z 2025-05-07T19:45:05.4374338Z 2025-05-07T19:45:05.4374344Z 2025-05-07T19:45:05.4374350Z 2025-05-07T19:45:05.4374355Z 2025-05-07T19:45:05.4374360Z 2025-05-07T19:45:05.4374363Z 2025-05-07T19:45:05.4374367Z 2025-05-07T19:45:05.4556389Z harfbuzz-11.0.0 | 1.6 MB | | 1%  2025-05-07T19:45:05.4557001Z 2025-05-07T19:45:05.4557007Z 2025-05-07T19:45:05.4557014Z 2025-05-07T19:45:05.4557022Z 2025-05-07T19:45:05.4557028Z 2025-05-07T19:45:05.4557034Z 2025-05-07T19:45:05.4557065Z 2025-05-07T19:45:05.4557099Z 2025-05-07T19:45:05.4557106Z 2025-05-07T19:45:05.4557112Z 2025-05-07T19:45:05.4557614Z libprotobuf-5.29.3 | 3.2 MB | ########## | 100%  2025-05-07T19:45:05.4557935Z 2025-05-07T19:45:05.4557938Z 2025-05-07T19:45:05.4557942Z 2025-05-07T19:45:05.4557945Z 2025-05-07T19:45:05.4557954Z 2025-05-07T19:45:05.4557958Z 2025-05-07T19:45:05.4557961Z 2025-05-07T19:45:05.4557991Z 2025-05-07T19:45:05.4557994Z 2025-05-07T19:45:05.4557997Z 2025-05-07T19:45:05.4758820Z libprotobuf-5.29.3 | 3.2 MB | ########## | 100%  2025-05-07T19:45:05.4878671Z openjdk-23.0.2 | 181.4 MB | ###8 | 39% 2025-05-07T19:45:05.4880074Z 2025-05-07T19:45:05.4993279Z bazel-7.5.0 | 47.4 MB | ########1 | 82%  2025-05-07T19:45:05.4993759Z 2025-05-07T19:45:05.4993764Z 2025-05-07T19:45:05.4993768Z 2025-05-07T19:45:05.4993772Z 2025-05-07T19:45:05.4993776Z 2025-05-07T19:45:05.4993779Z 2025-05-07T19:45:05.4993988Z 2025-05-07T19:45:05.4993992Z 2025-05-07T19:45:05.4993995Z 2025-05-07T19:45:05.4993998Z 2025-05-07T19:45:05.4994002Z 2025-05-07T19:45:05.4994006Z 2025-05-07T19:45:05.4994009Z 2025-05-07T19:45:05.5315951Z font-ttf-ubuntu-0.83 | 1.5 MB | 1 | 1%  2025-05-07T19:45:05.5316449Z 2025-05-07T19:45:05.5316639Z 2025-05-07T19:45:05.5316645Z 2025-05-07T19:45:05.5316649Z 2025-05-07T19:45:05.5316652Z 2025-05-07T19:45:05.5316656Z 2025-05-07T19:45:05.5316659Z 2025-05-07T19:45:05.5316662Z 2025-05-07T19:45:05.5316666Z 2025-05-07T19:45:05.5316669Z 2025-05-07T19:45:05.5316673Z 2025-05-07T19:45:05.5319002Z 2025-05-07T19:45:05.5666222Z harfbuzz-11.0.0 | 1.6 MB | ########## | 100%  2025-05-07T19:45:05.5666692Z 2025-05-07T19:45:05.5666697Z 2025-05-07T19:45:05.5666700Z 2025-05-07T19:45:05.5666704Z 2025-05-07T19:45:05.5666707Z 2025-05-07T19:45:05.5666710Z 2025-05-07T19:45:05.5666714Z 2025-05-07T19:45:05.5666733Z 2025-05-07T19:45:05.5666736Z 2025-05-07T19:45:05.5666740Z 2025-05-07T19:45:05.5666766Z 2025-05-07T19:45:05.5666769Z 2025-05-07T19:45:05.5666773Z 2025-05-07T19:45:05.5719941Z font-ttf-ubuntu-0.83 | 1.5 MB | ########## | 100%  2025-05-07T19:45:05.5721618Z 2025-05-07T19:45:05.5721632Z 2025-05-07T19:45:05.5721671Z 2025-05-07T19:45:05.5721682Z 2025-05-07T19:45:05.5721724Z 2025-05-07T19:45:05.5721734Z 2025-05-07T19:45:05.5721745Z 2025-05-07T19:45:05.5721755Z 2025-05-07T19:45:05.5721765Z 2025-05-07T19:45:05.5721776Z 2025-05-07T19:45:05.5721786Z 2025-05-07T19:45:05.5722526Z tk-8.6.13 | 3.2 MB | ########## | 100%  2025-05-07T19:45:05.5723369Z 2025-05-07T19:45:05.5723373Z 2025-05-07T19:45:05.5723376Z 2025-05-07T19:45:05.5723405Z 2025-05-07T19:45:05.5723409Z 2025-05-07T19:45:05.5723412Z 2025-05-07T19:45:05.5723415Z 2025-05-07T19:45:05.5723419Z 2025-05-07T19:45:05.5723422Z 2025-05-07T19:45:05.5723433Z 2025-05-07T19:45:05.5723437Z 2025-05-07T19:45:05.5764562Z tk-8.6.13 | 3.2 MB | ########## | 100%  2025-05-07T19:45:05.5765136Z 2025-05-07T19:45:05.5765141Z 2025-05-07T19:45:05.5765145Z 2025-05-07T19:45:05.5765148Z 2025-05-07T19:45:05.5765152Z 2025-05-07T19:45:05.5765155Z 2025-05-07T19:45:05.5765170Z 2025-05-07T19:45:05.5765174Z 2025-05-07T19:45:05.5765177Z 2025-05-07T19:45:05.5765181Z 2025-05-07T19:45:05.5765184Z 2025-05-07T19:45:05.5765187Z 2025-05-07T19:45:05.5765191Z 2025-05-07T19:45:05.5765194Z 2025-05-07T19:45:05.5886526Z libgfortran5-15.1.0 | 1.5 MB | 1 | 1%  2025-05-07T19:45:05.5887115Z 2025-05-07T19:45:05.6036859Z bazel-7.5.0 | 47.4 MB | ######### | 91%  2025-05-07T19:45:05.6077405Z openjdk-23.0.2 | 181.4 MB | ####2 | 42% 2025-05-07T19:45:05.6078842Z 2025-05-07T19:45:05.6078862Z 2025-05-07T19:45:05.6078928Z 2025-05-07T19:45:05.6078947Z 2025-05-07T19:45:05.6079014Z 2025-05-07T19:45:05.6079029Z 2025-05-07T19:45:05.6079050Z 2025-05-07T19:45:05.6079072Z 2025-05-07T19:45:05.6079087Z 2025-05-07T19:45:05.6079108Z 2025-05-07T19:45:05.6079160Z 2025-05-07T19:45:05.6079181Z 2025-05-07T19:45:05.6079202Z 2025-05-07T19:45:05.6079225Z 2025-05-07T19:45:05.6079239Z 2025-05-07T19:45:05.6256503Z krb5-1.21.3 | 1.3 MB | 1 | 1%  2025-05-07T19:45:05.6257067Z 2025-05-07T19:45:05.6257072Z 2025-05-07T19:45:05.6257103Z 2025-05-07T19:45:05.6257107Z 2025-05-07T19:45:05.6257110Z 2025-05-07T19:45:05.6257114Z 2025-05-07T19:45:05.6257117Z 2025-05-07T19:45:05.6257120Z 2025-05-07T19:45:05.6257124Z 2025-05-07T19:45:05.6257127Z 2025-05-07T19:45:05.6257131Z 2025-05-07T19:45:05.6257134Z 2025-05-07T19:45:05.6257137Z 2025-05-07T19:45:05.6257140Z 2025-05-07T19:45:05.6257144Z 2025-05-07T19:45:05.6257147Z 2025-05-07T19:45:05.6675438Z libabseil-20250127.1 | 1.3 MB | 1 | 1%  2025-05-07T19:45:05.6676035Z 2025-05-07T19:45:05.6676039Z 2025-05-07T19:45:05.6676043Z 2025-05-07T19:45:05.6676047Z 2025-05-07T19:45:05.6676050Z 2025-05-07T19:45:05.6676054Z 2025-05-07T19:45:05.6676057Z 2025-05-07T19:45:05.6676061Z 2025-05-07T19:45:05.6676064Z 2025-05-07T19:45:05.6676068Z 2025-05-07T19:45:05.6676158Z 2025-05-07T19:45:05.6676162Z 2025-05-07T19:45:05.6676166Z 2025-05-07T19:45:05.6676512Z 2025-05-07T19:45:05.6709275Z libgfortran5-15.1.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:05.6709877Z 2025-05-07T19:45:05.6709882Z 2025-05-07T19:45:05.6709886Z 2025-05-07T19:45:05.6709890Z 2025-05-07T19:45:05.6709893Z 2025-05-07T19:45:05.6709896Z 2025-05-07T19:45:05.6709900Z 2025-05-07T19:45:05.6709903Z 2025-05-07T19:45:05.6709907Z 2025-05-07T19:45:05.6709910Z 2025-05-07T19:45:05.6709914Z 2025-05-07T19:45:05.6709917Z 2025-05-07T19:45:05.6709921Z 2025-05-07T19:45:05.6709924Z 2025-05-07T19:45:05.6709940Z 2025-05-07T19:45:05.6815655Z krb5-1.21.3 | 1.3 MB | ########## | 100%  2025-05-07T19:45:05.6816095Z 2025-05-07T19:45:05.6816099Z 2025-05-07T19:45:05.6816103Z 2025-05-07T19:45:05.6816106Z 2025-05-07T19:45:05.6816110Z 2025-05-07T19:45:05.6816113Z 2025-05-07T19:45:05.6816117Z 2025-05-07T19:45:05.6816132Z 2025-05-07T19:45:05.6816156Z 2025-05-07T19:45:05.6816160Z 2025-05-07T19:45:05.6816163Z 2025-05-07T19:45:05.6816167Z 2025-05-07T19:45:05.6816170Z 2025-05-07T19:45:05.6816174Z 2025-05-07T19:45:05.6816177Z 2025-05-07T19:45:05.6816341Z 2025-05-07T19:45:05.6893728Z libabseil-20250127.1 | 1.3 MB | ########## | 100%  2025-05-07T19:45:05.6894706Z 2025-05-07T19:45:05.7071729Z bazel-7.5.0 | 47.4 MB | #########9 | 100%  2025-05-07T19:45:05.7072165Z 2025-05-07T19:45:05.7072169Z 2025-05-07T19:45:05.7072188Z 2025-05-07T19:45:05.7072191Z 2025-05-07T19:45:05.7072195Z 2025-05-07T19:45:05.7072211Z 2025-05-07T19:45:05.7072237Z 2025-05-07T19:45:05.7072240Z 2025-05-07T19:45:05.7072244Z 2025-05-07T19:45:05.7072247Z 2025-05-07T19:45:05.7072251Z 2025-05-07T19:45:05.7072254Z 2025-05-07T19:45:05.7072258Z 2025-05-07T19:45:05.7072261Z 2025-05-07T19:45:05.7072265Z 2025-05-07T19:45:05.7072268Z 2025-05-07T19:45:05.7072271Z 2025-05-07T19:45:05.7119769Z cairo-1.18.4 | 955 KB | 1 | 2%  2025-05-07T19:45:05.7148996Z openjdk-23.0.2 | 181.4 MB | ####5 | 45% 2025-05-07T19:45:05.7149677Z 2025-05-07T19:45:05.7149684Z 2025-05-07T19:45:05.7149689Z 2025-05-07T19:45:05.7149694Z 2025-05-07T19:45:05.7149699Z 2025-05-07T19:45:05.7149729Z 2025-05-07T19:45:05.7149735Z 2025-05-07T19:45:05.7149740Z 2025-05-07T19:45:05.7149745Z 2025-05-07T19:45:05.7149750Z 2025-05-07T19:45:05.7149755Z 2025-05-07T19:45:05.7149761Z 2025-05-07T19:45:05.7149767Z 2025-05-07T19:45:05.7149773Z 2025-05-07T19:45:05.7149778Z 2025-05-07T19:45:05.7149799Z 2025-05-07T19:45:05.7149804Z 2025-05-07T19:45:05.7149808Z 2025-05-07T19:45:05.7436848Z pcre2-10.44 | 934 KB | 1 | 2%  2025-05-07T19:45:05.7437345Z 2025-05-07T19:45:05.7437350Z 2025-05-07T19:45:05.7437353Z 2025-05-07T19:45:05.7437357Z 2025-05-07T19:45:05.7437374Z 2025-05-07T19:45:05.7437377Z 2025-05-07T19:45:05.7437381Z 2025-05-07T19:45:05.7437384Z 2025-05-07T19:45:05.7437387Z 2025-05-07T19:45:05.7437391Z 2025-05-07T19:45:05.7437394Z 2025-05-07T19:45:05.7437397Z 2025-05-07T19:45:05.7437401Z 2025-05-07T19:45:05.7437404Z 2025-05-07T19:45:05.7437408Z 2025-05-07T19:45:05.7437411Z 2025-05-07T19:45:05.7437414Z 2025-05-07T19:45:05.7478053Z cairo-1.18.4 | 955 KB | ########## | 100%  2025-05-07T19:45:05.7478452Z 2025-05-07T19:45:05.7478457Z 2025-05-07T19:45:05.7478461Z 2025-05-07T19:45:05.7478464Z 2025-05-07T19:45:05.7478468Z 2025-05-07T19:45:05.7478684Z 2025-05-07T19:45:05.7478688Z 2025-05-07T19:45:05.7478692Z 2025-05-07T19:45:05.7478718Z 2025-05-07T19:45:05.7478722Z 2025-05-07T19:45:05.7478725Z 2025-05-07T19:45:05.7478728Z 2025-05-07T19:45:05.7478732Z 2025-05-07T19:45:05.7478735Z 2025-05-07T19:45:05.7478738Z 2025-05-07T19:45:05.7478742Z 2025-05-07T19:45:05.7478745Z 2025-05-07T19:45:05.7478883Z 2025-05-07T19:45:05.7520543Z pcre2-10.44 | 934 KB | ########## | 100%  2025-05-07T19:45:05.7521893Z 2025-05-07T19:45:05.7521907Z 2025-05-07T19:45:05.7521918Z 2025-05-07T19:45:05.7521929Z 2025-05-07T19:45:05.7521939Z 2025-05-07T19:45:05.7521949Z 2025-05-07T19:45:05.7521960Z 2025-05-07T19:45:05.7521970Z 2025-05-07T19:45:05.7521980Z 2025-05-07T19:45:05.7521991Z 2025-05-07T19:45:05.7522001Z 2025-05-07T19:45:05.7522012Z 2025-05-07T19:45:05.7522022Z 2025-05-07T19:45:05.7522032Z 2025-05-07T19:45:05.7522042Z 2025-05-07T19:45:05.7522053Z 2025-05-07T19:45:05.7522091Z 2025-05-07T19:45:05.7522101Z 2025-05-07T19:45:05.7522112Z 2025-05-07T19:45:05.7786128Z ... (more hidden) ... 2025-05-07T19:45:05.7786724Z 2025-05-07T19:45:05.7786729Z 2025-05-07T19:45:05.7786733Z 2025-05-07T19:45:05.7786736Z 2025-05-07T19:45:05.7786739Z 2025-05-07T19:45:05.7786743Z 2025-05-07T19:45:05.7786758Z 2025-05-07T19:45:05.7786782Z 2025-05-07T19:45:05.7786786Z 2025-05-07T19:45:05.7786789Z 2025-05-07T19:45:05.7786793Z 2025-05-07T19:45:05.7786796Z 2025-05-07T19:45:05.7786800Z 2025-05-07T19:45:05.7786803Z 2025-05-07T19:45:05.7786806Z 2025-05-07T19:45:05.7786810Z 2025-05-07T19:45:05.7786813Z 2025-05-07T19:45:05.7786817Z 2025-05-07T19:45:05.7786820Z 2025-05-07T19:45:05.8119425Z ... (more hidden) ... 2025-05-07T19:45:05.9119898Z openjdk-23.0.2 | 181.4 MB | ####8 | 49% 2025-05-07T19:45:06.0120581Z openjdk-23.0.2 | 181.4 MB | #####3 | 53% 2025-05-07T19:45:06.1121870Z openjdk-23.0.2 | 181.4 MB | #####8 | 58% 2025-05-07T19:45:06.2121886Z openjdk-23.0.2 | 181.4 MB | ######3 | 63% 2025-05-07T19:45:06.2949225Z openjdk-23.0.2 | 181.4 MB | ######8 | 69% 2025-05-07T19:45:06.2949972Z 2025-05-07T19:45:06.2949979Z 2025-05-07T19:45:06.2949986Z 2025-05-07T19:45:06.2950015Z 2025-05-07T19:45:06.2950021Z 2025-05-07T19:45:06.3831477Z libgrpc-1.71.0 | 7.6 MB | ########## | 100%  2025-05-07T19:45:06.4059908Z openjdk-23.0.2 | 181.4 MB | #######3 | 74% 2025-05-07T19:45:06.4060426Z 2025-05-07T19:45:06.4919068Z bazel-7.5.0 | 47.4 MB | ########## | 100%  2025-05-07T19:45:06.5920952Z openjdk-23.0.2 | 181.4 MB | #######8 | 78% 2025-05-07T19:45:06.6920996Z openjdk-23.0.2 | 181.4 MB | ########4 | 84% 2025-05-07T19:45:06.7608609Z openjdk-23.0.2 | 181.4 MB | ########9 | 89% 2025-05-07T19:45:06.7609416Z 2025-05-07T19:45:06.7609431Z 2025-05-07T19:45:06.7609477Z 2025-05-07T19:45:06.7609487Z 2025-05-07T19:45:06.7609498Z 2025-05-07T19:45:06.7609508Z 2025-05-07T19:45:06.8231829Z openblas-0.3.29 | 5.8 MB | ########## | 100%  2025-05-07T19:45:06.9819604Z openjdk-23.0.2 | 181.4 MB | #########4 | 94% 2025-05-07T19:45:06.9871892Z openjdk-23.0.2 | 181.4 MB | #########8 | 99% 2025-05-07T19:45:06.9872445Z 2025-05-07T19:45:06.9872452Z 2025-05-07T19:45:06.9872456Z 2025-05-07T19:45:06.9872460Z 2025-05-07T19:45:06.9872463Z 2025-05-07T19:45:06.9872467Z 2025-05-07T19:45:06.9872470Z 2025-05-07T19:45:07.1058035Z libopenblas-0.3.29 | 5.6 MB | ########## | 100%  2025-05-07T19:45:07.1059513Z 2025-05-07T19:45:07.1059556Z 2025-05-07T19:45:07.1059568Z 2025-05-07T19:45:07.1059579Z 2025-05-07T19:45:07.1059590Z 2025-05-07T19:45:07.1059601Z 2025-05-07T19:45:07.1059611Z 2025-05-07T19:45:07.1059621Z 2025-05-07T19:45:07.3895455Z libcups-2.3.3 | 4.3 MB | ########## | 100%  2025-05-07T19:45:07.3896472Z 2025-05-07T19:45:07.3896477Z 2025-05-07T19:45:07.3896481Z 2025-05-07T19:45:07.3896485Z 2025-05-07T19:45:07.3896489Z 2025-05-07T19:45:07.3896492Z 2025-05-07T19:45:07.3896496Z 2025-05-07T19:45:07.3896499Z 2025-05-07T19:45:07.3896503Z 2025-05-07T19:45:07.6253681Z libglib-2.84.0 | 3.8 MB | ########## | 100%  2025-05-07T19:45:07.6254516Z 2025-05-07T19:45:07.6254521Z 2025-05-07T19:45:07.7391227Z python-3.11.11 | 29.2 MB | ########## | 100%  2025-05-07T19:45:07.7392595Z 2025-05-07T19:45:07.7392608Z 2025-05-07T19:45:07.7392619Z 2025-05-07T19:45:07.7392629Z 2025-05-07T19:45:07.7392640Z 2025-05-07T19:45:07.7392650Z 2025-05-07T19:45:07.7392660Z 2025-05-07T19:45:07.7392670Z 2025-05-07T19:45:07.7392680Z 2025-05-07T19:45:07.7392691Z 2025-05-07T19:45:07.7716106Z libprotobuf-5.29.3 | 3.2 MB | ########## | 100%  2025-05-07T19:45:07.7716625Z 2025-05-07T19:45:07.7716630Z 2025-05-07T19:45:07.7716653Z 2025-05-07T19:45:07.7716657Z 2025-05-07T19:45:07.7716660Z 2025-05-07T19:45:07.7716664Z 2025-05-07T19:45:07.7716667Z 2025-05-07T19:45:07.7716671Z 2025-05-07T19:45:07.7716675Z 2025-05-07T19:45:07.7716678Z 2025-05-07T19:45:07.7716682Z 2025-05-07T19:45:07.7716685Z 2025-05-07T19:45:07.7717446Z harfbuzz-11.0.0 | 1.6 MB | ########## | 100%  2025-05-07T19:45:07.7718072Z 2025-05-07T19:45:07.7718080Z 2025-05-07T19:45:07.7718088Z 2025-05-07T19:45:07.7718093Z 2025-05-07T19:45:07.7718101Z 2025-05-07T19:45:07.7718109Z 2025-05-07T19:45:07.7718114Z 2025-05-07T19:45:07.7718122Z 2025-05-07T19:45:07.7718128Z 2025-05-07T19:45:07.7718135Z 2025-05-07T19:45:07.7718143Z 2025-05-07T19:45:07.7718148Z 2025-05-07T19:45:07.7931072Z harfbuzz-11.0.0 | 1.6 MB | ########## | 100%  2025-05-07T19:45:07.7931941Z 2025-05-07T19:45:07.7931946Z 2025-05-07T19:45:07.7931950Z 2025-05-07T19:45:07.7931953Z 2025-05-07T19:45:07.7931975Z 2025-05-07T19:45:07.7931979Z 2025-05-07T19:45:07.7931982Z 2025-05-07T19:45:07.7931986Z 2025-05-07T19:45:07.7932021Z 2025-05-07T19:45:07.7932025Z 2025-05-07T19:45:07.7932028Z 2025-05-07T19:45:07.7932031Z 2025-05-07T19:45:07.7932035Z 2025-05-07T19:45:07.7932485Z font-ttf-ubuntu-0.83 | 1.5 MB | ########## | 100%  2025-05-07T19:45:07.7932934Z 2025-05-07T19:45:07.7932938Z 2025-05-07T19:45:07.7932941Z 2025-05-07T19:45:07.7932945Z 2025-05-07T19:45:07.7932948Z 2025-05-07T19:45:07.7932976Z 2025-05-07T19:45:07.7932979Z 2025-05-07T19:45:07.7932982Z 2025-05-07T19:45:07.7932985Z 2025-05-07T19:45:07.7932988Z 2025-05-07T19:45:07.7932992Z 2025-05-07T19:45:07.7932995Z 2025-05-07T19:45:07.7933009Z 2025-05-07T19:45:07.8920381Z font-ttf-ubuntu-0.83 | 1.5 MB | ########## | 100%  2025-05-07T19:45:07.8922394Z 2025-05-07T19:45:07.8922411Z 2025-05-07T19:45:07.8922428Z 2025-05-07T19:45:07.8922444Z 2025-05-07T19:45:07.8922499Z 2025-05-07T19:45:07.8922515Z 2025-05-07T19:45:07.8922572Z 2025-05-07T19:45:07.8922590Z 2025-05-07T19:45:07.8922606Z 2025-05-07T19:45:07.8922626Z 2025-05-07T19:45:07.8922642Z 2025-05-07T19:45:07.8922654Z 2025-05-07T19:45:07.8922664Z 2025-05-07T19:45:07.8922676Z 2025-05-07T19:45:07.8923653Z libgfortran5-15.1.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:07.8924645Z 2025-05-07T19:45:07.8924691Z 2025-05-07T19:45:07.8924701Z 2025-05-07T19:45:07.8924712Z 2025-05-07T19:45:07.8924722Z 2025-05-07T19:45:07.8924733Z 2025-05-07T19:45:07.8924744Z 2025-05-07T19:45:07.8924754Z 2025-05-07T19:45:07.8924763Z 2025-05-07T19:45:07.8924774Z 2025-05-07T19:45:07.8924784Z 2025-05-07T19:45:07.8924794Z 2025-05-07T19:45:07.8924804Z 2025-05-07T19:45:07.8924814Z 2025-05-07T19:45:08.0516968Z libgfortran5-15.1.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:08.0519004Z 2025-05-07T19:45:08.0519019Z 2025-05-07T19:45:08.0519448Z 2025-05-07T19:45:08.0519460Z 2025-05-07T19:45:08.0519471Z 2025-05-07T19:45:08.0519481Z 2025-05-07T19:45:08.0519492Z 2025-05-07T19:45:08.0519502Z 2025-05-07T19:45:08.0519512Z 2025-05-07T19:45:08.0519522Z 2025-05-07T19:45:08.0519533Z 2025-05-07T19:45:08.0519543Z 2025-05-07T19:45:08.0519553Z 2025-05-07T19:45:08.0519564Z 2025-05-07T19:45:08.0519790Z 2025-05-07T19:45:08.0520719Z krb5-1.21.3 | 1.3 MB | ########## | 100%  2025-05-07T19:45:08.0521608Z 2025-05-07T19:45:08.0521619Z 2025-05-07T19:45:08.0521630Z 2025-05-07T19:45:08.0521641Z 2025-05-07T19:45:08.0521651Z 2025-05-07T19:45:08.0521661Z 2025-05-07T19:45:08.0521671Z 2025-05-07T19:45:08.0521681Z 2025-05-07T19:45:08.0521692Z 2025-05-07T19:45:08.0521734Z 2025-05-07T19:45:08.0521744Z 2025-05-07T19:45:08.0521755Z 2025-05-07T19:45:08.0521764Z 2025-05-07T19:45:08.0521775Z 2025-05-07T19:45:08.0521785Z 2025-05-07T19:45:08.1732022Z krb5-1.21.3 | 1.3 MB | ########## | 100%  2025-05-07T19:45:08.1733448Z 2025-05-07T19:45:08.1733497Z 2025-05-07T19:45:08.1733509Z 2025-05-07T19:45:08.1733519Z 2025-05-07T19:45:08.1733530Z 2025-05-07T19:45:08.1733540Z 2025-05-07T19:45:08.1733551Z 2025-05-07T19:45:08.1733561Z 2025-05-07T19:45:08.1733572Z 2025-05-07T19:45:08.1733583Z 2025-05-07T19:45:08.1733610Z 2025-05-07T19:45:08.2481317Z tk-8.6.13 | 3.2 MB | ########## | 100%  2025-05-07T19:45:08.2482740Z 2025-05-07T19:45:08.2482769Z 2025-05-07T19:45:08.2482781Z 2025-05-07T19:45:08.2501508Z cmake-4.0.2 | 19.4 MB | ########## | 100%  2025-05-07T19:45:08.2503041Z 2025-05-07T19:45:08.2503061Z 2025-05-07T19:45:08.2503141Z 2025-05-07T19:45:08.2503157Z 2025-05-07T19:45:08.2503168Z 2025-05-07T19:45:08.2503179Z 2025-05-07T19:45:08.2503189Z 2025-05-07T19:45:08.2503199Z 2025-05-07T19:45:08.2503209Z 2025-05-07T19:45:08.2503219Z 2025-05-07T19:45:08.2503230Z 2025-05-07T19:45:08.2503272Z 2025-05-07T19:45:08.2503283Z 2025-05-07T19:45:08.2503293Z 2025-05-07T19:45:08.2503303Z 2025-05-07T19:45:08.2503313Z 2025-05-07T19:45:08.2503324Z 2025-05-07T19:45:08.2504275Z cairo-1.18.4 | 955 KB | ########## | 100%  2025-05-07T19:45:08.2505221Z 2025-05-07T19:45:08.2505248Z 2025-05-07T19:45:08.2505259Z 2025-05-07T19:45:08.2505270Z 2025-05-07T19:45:08.2505279Z 2025-05-07T19:45:08.2505289Z 2025-05-07T19:45:08.2505299Z 2025-05-07T19:45:08.2505310Z 2025-05-07T19:45:08.2505320Z 2025-05-07T19:45:08.2505330Z 2025-05-07T19:45:08.2505380Z 2025-05-07T19:45:08.2505390Z 2025-05-07T19:45:08.2505400Z 2025-05-07T19:45:08.2505410Z 2025-05-07T19:45:08.2505420Z 2025-05-07T19:45:08.2505430Z 2025-05-07T19:45:08.2505440Z 2025-05-07T19:45:08.2890680Z cairo-1.18.4 | 955 KB | ########## | 100%  2025-05-07T19:45:08.2892148Z 2025-05-07T19:45:08.2892163Z 2025-05-07T19:45:08.2892208Z 2025-05-07T19:45:08.2892219Z 2025-05-07T19:45:08.2892230Z 2025-05-07T19:45:08.2892240Z 2025-05-07T19:45:08.2892251Z 2025-05-07T19:45:08.2892261Z 2025-05-07T19:45:08.2892271Z 2025-05-07T19:45:08.2892282Z 2025-05-07T19:45:08.2892292Z 2025-05-07T19:45:08.2892301Z 2025-05-07T19:45:08.2892311Z 2025-05-07T19:45:08.2892321Z 2025-05-07T19:45:08.2892344Z 2025-05-07T19:45:08.2892355Z 2025-05-07T19:45:08.2892365Z 2025-05-07T19:45:08.2892375Z 2025-05-07T19:45:08.2892386Z 2025-05-07T19:45:08.2893229Z ... (more hidden) ... 2025-05-07T19:45:08.2893950Z 2025-05-07T19:45:08.2893954Z 2025-05-07T19:45:08.2893957Z 2025-05-07T19:45:08.2893960Z 2025-05-07T19:45:08.2893963Z 2025-05-07T19:45:08.2893967Z 2025-05-07T19:45:08.2893970Z 2025-05-07T19:45:08.2893973Z 2025-05-07T19:45:08.2893976Z 2025-05-07T19:45:08.2893979Z 2025-05-07T19:45:08.2893982Z 2025-05-07T19:45:08.2893985Z 2025-05-07T19:45:08.2893989Z 2025-05-07T19:45:08.2894018Z 2025-05-07T19:45:08.2894250Z 2025-05-07T19:45:08.2894253Z 2025-05-07T19:45:08.2894256Z 2025-05-07T19:45:08.2894259Z 2025-05-07T19:45:08.2894262Z 2025-05-07T19:45:08.4411613Z ... (more hidden) ... 2025-05-07T19:45:08.4412818Z 2025-05-07T19:45:08.4412823Z 2025-05-07T19:45:08.4412827Z 2025-05-07T19:45:08.4413083Z 2025-05-07T19:45:08.4413089Z 2025-05-07T19:45:08.4413121Z 2025-05-07T19:45:08.4413125Z 2025-05-07T19:45:08.4413128Z 2025-05-07T19:45:08.4413132Z 2025-05-07T19:45:08.4413135Z 2025-05-07T19:45:08.4413139Z 2025-05-07T19:45:08.4413143Z 2025-05-07T19:45:08.4413146Z 2025-05-07T19:45:08.4413149Z 2025-05-07T19:45:08.4413153Z 2025-05-07T19:45:08.4413156Z 2025-05-07T19:45:08.4413159Z 2025-05-07T19:45:08.4413163Z 2025-05-07T19:45:08.4413516Z pcre2-10.44 | 934 KB | ########## | 100%  2025-05-07T19:45:08.4413869Z 2025-05-07T19:45:08.4413873Z 2025-05-07T19:45:08.4413876Z 2025-05-07T19:45:08.4413887Z 2025-05-07T19:45:08.4413891Z 2025-05-07T19:45:08.4413894Z 2025-05-07T19:45:08.4413898Z 2025-05-07T19:45:08.4413901Z 2025-05-07T19:45:08.4413904Z 2025-05-07T19:45:08.4413907Z 2025-05-07T19:45:08.4413911Z 2025-05-07T19:45:08.4413914Z 2025-05-07T19:45:08.4413917Z 2025-05-07T19:45:08.4413921Z 2025-05-07T19:45:08.4413929Z 2025-05-07T19:45:08.4413932Z 2025-05-07T19:45:08.4413936Z 2025-05-07T19:45:08.4413939Z 2025-05-07T19:45:08.4811156Z pcre2-10.44 | 934 KB | ########## | 100%  2025-05-07T19:45:08.4812539Z 2025-05-07T19:45:08.4812553Z 2025-05-07T19:45:08.4812564Z 2025-05-07T19:45:08.4812574Z 2025-05-07T19:45:08.4812584Z 2025-05-07T19:45:08.4812594Z 2025-05-07T19:45:08.4812604Z 2025-05-07T19:45:08.4812647Z 2025-05-07T19:45:08.4812657Z 2025-05-07T19:45:08.4812666Z 2025-05-07T19:45:08.4812677Z 2025-05-07T19:45:08.4812687Z 2025-05-07T19:45:08.4812698Z 2025-05-07T19:45:08.4812708Z 2025-05-07T19:45:08.4812749Z 2025-05-07T19:45:08.4812759Z 2025-05-07T19:45:08.4813775Z libabseil-20250127.1 | 1.3 MB | ########## | 100%  2025-05-07T19:45:08.4814536Z 2025-05-07T19:45:08.4814539Z 2025-05-07T19:45:08.4814543Z 2025-05-07T19:45:08.4814546Z 2025-05-07T19:45:08.4814549Z 2025-05-07T19:45:08.4814552Z 2025-05-07T19:45:08.4814561Z 2025-05-07T19:45:08.4814564Z 2025-05-07T19:45:08.4814567Z 2025-05-07T19:45:08.4814571Z 2025-05-07T19:45:08.4814574Z 2025-05-07T19:45:08.4814577Z 2025-05-07T19:45:08.4814580Z 2025-05-07T19:45:08.4814583Z 2025-05-07T19:45:08.4814587Z 2025-05-07T19:45:08.4814590Z 2025-05-07T19:45:08.7036108Z libabseil-20250127.1 | 1.3 MB | ########## | 100%  2025-05-07T19:45:10.2633612Z openjdk-23.0.2 | 181.4 MB | ########## | 100% 2025-05-07T19:45:10.2634415Z 2025-05-07T19:45:10.9201190Z bazel-7.5.0 | 47.4 MB | ########## | 100%  2025-05-07T19:45:10.9204771Z openjdk-23.0.2 | 181.4 MB | ########## | 100% 2025-05-07T19:45:10.9205546Z 2025-05-07T19:45:10.9205561Z 2025-05-07T19:45:10.9205611Z 2025-05-07T19:45:10.9205622Z 2025-05-07T19:45:10.9205632Z 2025-05-07T19:45:10.9205642Z 2025-05-07T19:45:10.9205653Z 2025-05-07T19:45:10.9205663Z 2025-05-07T19:45:10.9205674Z 2025-05-07T19:45:10.9205684Z 2025-05-07T19:45:10.9205709Z 2025-05-07T19:45:10.9205720Z 2025-05-07T19:45:10.9205765Z 2025-05-07T19:45:10.9205775Z 2025-05-07T19:45:10.9205786Z 2025-05-07T19:45:10.9205796Z 2025-05-07T19:45:10.9205806Z 2025-05-07T19:45:10.9205816Z 2025-05-07T19:45:10.9205827Z 2025-05-07T19:45:10.9206163Z 2025-05-07T19:45:10.9206522Z  2025-05-07T19:45:10.9206899Z 2025-05-07T19:45:10.9207118Z 2025-05-07T19:45:10.9207291Z  2025-05-07T19:45:10.9207543Z 2025-05-07T19:45:10.9207547Z 2025-05-07T19:45:10.9207959Z  2025-05-07T19:45:10.9208187Z 2025-05-07T19:45:10.9208190Z 2025-05-07T19:45:10.9208194Z 2025-05-07T19:45:10.9208474Z  2025-05-07T19:45:10.9208703Z 2025-05-07T19:45:10.9208707Z 2025-05-07T19:45:10.9208710Z 2025-05-07T19:45:10.9208877Z 2025-05-07T19:45:10.9209067Z  2025-05-07T19:45:10.9209335Z 2025-05-07T19:45:10.9209338Z 2025-05-07T19:45:10.9209342Z 2025-05-07T19:45:10.9209346Z 2025-05-07T19:45:10.9209349Z 2025-05-07T19:45:10.9209546Z  2025-05-07T19:45:10.9209786Z 2025-05-07T19:45:10.9209790Z 2025-05-07T19:45:10.9209823Z 2025-05-07T19:45:10.9209827Z 2025-05-07T19:45:10.9209830Z 2025-05-07T19:45:10.9209834Z 2025-05-07T19:45:10.9210031Z  2025-05-07T19:45:10.9210279Z 2025-05-07T19:45:10.9210283Z 2025-05-07T19:45:10.9210287Z 2025-05-07T19:45:10.9210291Z 2025-05-07T19:45:10.9210294Z 2025-05-07T19:45:10.9210298Z 2025-05-07T19:45:10.9210333Z 2025-05-07T19:45:10.9210531Z  2025-05-07T19:45:10.9210778Z 2025-05-07T19:45:10.9210782Z 2025-05-07T19:45:10.9210789Z 2025-05-07T19:45:10.9210793Z 2025-05-07T19:45:10.9210796Z 2025-05-07T19:45:10.9210800Z 2025-05-07T19:45:10.9210803Z 2025-05-07T19:45:10.9210807Z 2025-05-07T19:45:10.9211041Z  2025-05-07T19:45:10.9211298Z 2025-05-07T19:45:10.9211302Z 2025-05-07T19:45:10.9211305Z 2025-05-07T19:45:10.9211309Z 2025-05-07T19:45:10.9211312Z 2025-05-07T19:45:10.9211316Z 2025-05-07T19:45:10.9211319Z 2025-05-07T19:45:10.9211322Z 2025-05-07T19:45:10.9211326Z 2025-05-07T19:45:10.9211565Z  2025-05-07T19:45:10.9211824Z 2025-05-07T19:45:10.9211827Z 2025-05-07T19:45:10.9211831Z 2025-05-07T19:45:10.9211835Z 2025-05-07T19:45:10.9211839Z 2025-05-07T19:45:10.9211842Z 2025-05-07T19:45:10.9211846Z 2025-05-07T19:45:10.9211919Z 2025-05-07T19:45:10.9211922Z 2025-05-07T19:45:10.9211925Z 2025-05-07T19:45:10.9212140Z  2025-05-07T19:45:10.9212422Z 2025-05-07T19:45:10.9212425Z 2025-05-07T19:45:10.9212429Z 2025-05-07T19:45:10.9212433Z 2025-05-07T19:45:10.9212436Z 2025-05-07T19:45:10.9212440Z 2025-05-07T19:45:10.9212443Z 2025-05-07T19:45:10.9212447Z 2025-05-07T19:45:10.9212451Z 2025-05-07T19:45:10.9212455Z 2025-05-07T19:45:10.9212458Z 2025-05-07T19:45:10.9212671Z  2025-05-07T19:45:10.9212957Z 2025-05-07T19:45:10.9212960Z 2025-05-07T19:45:10.9212964Z 2025-05-07T19:45:10.9212967Z 2025-05-07T19:45:10.9212971Z 2025-05-07T19:45:10.9212979Z 2025-05-07T19:45:10.9212983Z 2025-05-07T19:45:10.9212987Z 2025-05-07T19:45:10.9212990Z 2025-05-07T19:45:10.9212994Z 2025-05-07T19:45:10.9212998Z 2025-05-07T19:45:10.9213001Z 2025-05-07T19:45:10.9213218Z  2025-05-07T19:45:10.9213505Z 2025-05-07T19:45:10.9213513Z 2025-05-07T19:45:10.9213517Z 2025-05-07T19:45:10.9213520Z 2025-05-07T19:45:10.9213524Z 2025-05-07T19:45:10.9213527Z 2025-05-07T19:45:10.9213531Z 2025-05-07T19:45:10.9213535Z 2025-05-07T19:45:10.9213538Z 2025-05-07T19:45:10.9213542Z 2025-05-07T19:45:10.9213545Z 2025-05-07T19:45:10.9213549Z 2025-05-07T19:45:10.9213552Z 2025-05-07T19:45:10.9213772Z  2025-05-07T19:45:10.9214060Z 2025-05-07T19:45:10.9214064Z 2025-05-07T19:45:10.9214068Z 2025-05-07T19:45:10.9214072Z 2025-05-07T19:45:10.9214075Z 2025-05-07T19:45:10.9214079Z 2025-05-07T19:45:10.9214156Z 2025-05-07T19:45:10.9214160Z 2025-05-07T19:45:10.9214163Z 2025-05-07T19:45:10.9214167Z 2025-05-07T19:45:10.9214170Z 2025-05-07T19:45:10.9214174Z 2025-05-07T19:45:10.9214177Z 2025-05-07T19:45:10.9214181Z 2025-05-07T19:45:10.9214435Z  2025-05-07T19:45:10.9214822Z 2025-05-07T19:45:10.9214826Z 2025-05-07T19:45:10.9214830Z 2025-05-07T19:45:10.9214833Z 2025-05-07T19:45:10.9214837Z 2025-05-07T19:45:10.9214840Z 2025-05-07T19:45:10.9214844Z 2025-05-07T19:45:10.9214847Z 2025-05-07T19:45:10.9214851Z 2025-05-07T19:45:10.9214855Z 2025-05-07T19:45:10.9214859Z 2025-05-07T19:45:10.9214862Z 2025-05-07T19:45:10.9214866Z 2025-05-07T19:45:10.9214895Z 2025-05-07T19:45:10.9214899Z 2025-05-07T19:45:10.9215130Z  2025-05-07T19:45:10.9215395Z 2025-05-07T19:45:10.9215398Z 2025-05-07T19:45:10.9215401Z 2025-05-07T19:45:10.9215410Z 2025-05-07T19:45:10.9215413Z 2025-05-07T19:45:10.9215417Z 2025-05-07T19:45:10.9215421Z 2025-05-07T19:45:10.9215456Z 2025-05-07T19:45:10.9215459Z 2025-05-07T19:45:10.9215463Z 2025-05-07T19:45:10.9215466Z 2025-05-07T19:45:10.9215470Z 2025-05-07T19:45:10.9215473Z 2025-05-07T19:45:10.9215477Z 2025-05-07T19:45:10.9215481Z 2025-05-07T19:45:10.9215487Z 2025-05-07T19:45:10.9215725Z  2025-05-07T19:45:10.9215993Z 2025-05-07T19:45:10.9216028Z 2025-05-07T19:45:10.9216032Z 2025-05-07T19:45:10.9216035Z 2025-05-07T19:45:10.9216038Z 2025-05-07T19:45:10.9216042Z 2025-05-07T19:45:10.9216045Z 2025-05-07T19:45:10.9216049Z 2025-05-07T19:45:10.9216053Z 2025-05-07T19:45:10.9216056Z 2025-05-07T19:45:10.9216060Z 2025-05-07T19:45:10.9216063Z 2025-05-07T19:45:10.9216067Z 2025-05-07T19:45:10.9216070Z 2025-05-07T19:45:10.9216074Z 2025-05-07T19:45:10.9216078Z 2025-05-07T19:45:10.9216081Z 2025-05-07T19:45:10.9216323Z  2025-05-07T19:45:10.9216624Z 2025-05-07T19:45:10.9216627Z 2025-05-07T19:45:10.9216631Z 2025-05-07T19:45:10.9216634Z 2025-05-07T19:45:10.9216638Z 2025-05-07T19:45:10.9216641Z 2025-05-07T19:45:10.9216645Z 2025-05-07T19:45:10.9216652Z 2025-05-07T19:45:10.9216656Z 2025-05-07T19:45:10.9216659Z 2025-05-07T19:45:10.9216662Z 2025-05-07T19:45:10.9216666Z 2025-05-07T19:45:10.9216669Z 2025-05-07T19:45:10.9216673Z 2025-05-07T19:45:10.9216677Z 2025-05-07T19:45:10.9216681Z 2025-05-07T19:45:10.9216684Z 2025-05-07T19:45:10.9216719Z 2025-05-07T19:45:10.9216964Z  2025-05-07T19:45:10.9217242Z 2025-05-07T19:45:10.9217246Z 2025-05-07T19:45:10.9217364Z  2025-05-07T19:45:10.9217509Z 2025-05-07T19:45:10.9217513Z 2025-05-07T19:45:10.9217701Z  2025-05-07T19:45:10.9217827Z 2025-05-07T19:45:10.9217835Z 2025-05-07T19:45:10.9217838Z 2025-05-07T19:45:10.9217950Z  2025-05-07T19:45:10.9218102Z 2025-05-07T19:45:10.9218106Z 2025-05-07T19:45:10.9218110Z 2025-05-07T19:45:10.9218113Z 2025-05-07T19:45:10.9218242Z  2025-05-07T19:45:10.9218370Z 2025-05-07T19:45:10.9218374Z 2025-05-07T19:45:10.9218377Z 2025-05-07T19:45:10.9218385Z 2025-05-07T19:45:10.9218389Z 2025-05-07T19:45:10.9218529Z  2025-05-07T19:45:10.9218666Z 2025-05-07T19:45:10.9218670Z 2025-05-07T19:45:10.9218674Z 2025-05-07T19:45:10.9218677Z 2025-05-07T19:45:10.9218681Z 2025-05-07T19:45:10.9218684Z 2025-05-07T19:45:10.9218832Z  2025-05-07T19:45:10.9218973Z 2025-05-07T19:45:10.9218977Z 2025-05-07T19:45:10.9218980Z 2025-05-07T19:45:10.9218984Z 2025-05-07T19:45:10.9218987Z 2025-05-07T19:45:10.9218991Z 2025-05-07T19:45:10.9218994Z 2025-05-07T19:45:10.9219119Z  2025-05-07T19:45:10.9219304Z 2025-05-07T19:45:10.9219307Z 2025-05-07T19:45:10.9219371Z 2025-05-07T19:45:10.9219374Z 2025-05-07T19:45:10.9219378Z 2025-05-07T19:45:10.9219382Z 2025-05-07T19:45:10.9219385Z 2025-05-07T19:45:10.9219389Z 2025-05-07T19:45:10.9219519Z  2025-05-07T19:45:10.9219709Z 2025-05-07T19:45:10.9219713Z 2025-05-07T19:45:10.9219716Z 2025-05-07T19:45:10.9219720Z 2025-05-07T19:45:10.9220896Z 2025-05-07T19:45:10.9220900Z 2025-05-07T19:45:10.9220904Z 2025-05-07T19:45:10.9220907Z 2025-05-07T19:45:10.9220911Z 2025-05-07T19:45:10.9221063Z  2025-05-07T19:45:10.9221238Z 2025-05-07T19:45:10.9221242Z 2025-05-07T19:45:10.9221272Z 2025-05-07T19:45:10.9221275Z 2025-05-07T19:45:10.9221278Z 2025-05-07T19:45:10.9221282Z 2025-05-07T19:45:10.9221285Z 2025-05-07T19:45:10.9221289Z 2025-05-07T19:45:10.9221292Z 2025-05-07T19:45:10.9221295Z 2025-05-07T19:45:10.9221434Z  2025-05-07T19:45:10.9221613Z 2025-05-07T19:45:10.9221616Z 2025-05-07T19:45:10.9221620Z 2025-05-07T19:45:10.9221650Z 2025-05-07T19:45:10.9221658Z 2025-05-07T19:45:10.9221662Z 2025-05-07T19:45:10.9221665Z 2025-05-07T19:45:10.9221668Z 2025-05-07T19:45:10.9221672Z 2025-05-07T19:45:10.9221675Z 2025-05-07T19:45:10.9221679Z 2025-05-07T19:45:10.9221824Z  2025-05-07T19:45:10.9222017Z 2025-05-07T19:45:10.9222021Z 2025-05-07T19:45:10.9222028Z 2025-05-07T19:45:10.9222058Z 2025-05-07T19:45:10.9222061Z 2025-05-07T19:45:10.9222065Z 2025-05-07T19:45:10.9222068Z 2025-05-07T19:45:10.9222071Z 2025-05-07T19:45:10.9222075Z 2025-05-07T19:45:10.9222078Z 2025-05-07T19:45:10.9222082Z 2025-05-07T19:45:10.9222085Z 2025-05-07T19:45:10.9222228Z  2025-05-07T19:45:10.9222429Z 2025-05-07T19:45:10.9222456Z 2025-05-07T19:45:10.9222460Z 2025-05-07T19:45:10.9222463Z 2025-05-07T19:45:10.9222467Z 2025-05-07T19:45:10.9222470Z 2025-05-07T19:45:10.9222473Z 2025-05-07T19:45:10.9222477Z 2025-05-07T19:45:10.9222480Z 2025-05-07T19:45:10.9222484Z 2025-05-07T19:45:10.9222487Z 2025-05-07T19:45:10.9222494Z 2025-05-07T19:45:10.9222498Z 2025-05-07T19:45:10.9222645Z  2025-05-07T19:45:10.9222881Z 2025-05-07T19:45:10.9222884Z 2025-05-07T19:45:10.9222888Z 2025-05-07T19:45:10.9222891Z 2025-05-07T19:45:10.9222895Z 2025-05-07T19:45:10.9222899Z 2025-05-07T19:45:10.9222902Z 2025-05-07T19:45:10.9222909Z 2025-05-07T19:45:10.9222912Z 2025-05-07T19:45:10.9222916Z 2025-05-07T19:45:10.9222919Z 2025-05-07T19:45:10.9222922Z 2025-05-07T19:45:10.9222926Z 2025-05-07T19:45:10.9222929Z 2025-05-07T19:45:10.9223087Z  2025-05-07T19:45:10.9223334Z 2025-05-07T19:45:10.9223337Z 2025-05-07T19:45:10.9223341Z 2025-05-07T19:45:10.9223344Z 2025-05-07T19:45:10.9223347Z 2025-05-07T19:45:10.9223351Z 2025-05-07T19:45:10.9223354Z 2025-05-07T19:45:10.9223358Z 2025-05-07T19:45:10.9223361Z 2025-05-07T19:45:10.9223365Z 2025-05-07T19:45:10.9223368Z 2025-05-07T19:45:10.9223372Z 2025-05-07T19:45:10.9223376Z 2025-05-07T19:45:10.9223383Z 2025-05-07T19:45:10.9223386Z 2025-05-07T19:45:10.9223580Z  2025-05-07T19:45:10.9223801Z 2025-05-07T19:45:10.9223805Z 2025-05-07T19:45:10.9223808Z 2025-05-07T19:45:10.9223812Z 2025-05-07T19:45:10.9223815Z 2025-05-07T19:45:10.9223818Z 2025-05-07T19:45:10.9223822Z 2025-05-07T19:45:10.9223829Z 2025-05-07T19:45:10.9223832Z 2025-05-07T19:45:10.9223836Z 2025-05-07T19:45:10.9223839Z 2025-05-07T19:45:10.9223842Z 2025-05-07T19:45:10.9223846Z 2025-05-07T19:45:10.9223849Z 2025-05-07T19:45:10.9223853Z 2025-05-07T19:45:10.9223882Z 2025-05-07T19:45:10.9224045Z  2025-05-07T19:45:10.9224273Z 2025-05-07T19:45:10.9224277Z 2025-05-07T19:45:10.9224280Z 2025-05-07T19:45:10.9224283Z 2025-05-07T19:45:10.9224287Z 2025-05-07T19:45:10.9224290Z 2025-05-07T19:45:10.9224293Z 2025-05-07T19:45:10.9224297Z 2025-05-07T19:45:10.9224300Z 2025-05-07T19:45:10.9224303Z 2025-05-07T19:45:10.9224333Z 2025-05-07T19:45:10.9224407Z 2025-05-07T19:45:10.9224410Z 2025-05-07T19:45:10.9224414Z 2025-05-07T19:45:10.9224417Z 2025-05-07T19:45:10.9224421Z 2025-05-07T19:45:10.9224424Z 2025-05-07T19:45:10.9224600Z  2025-05-07T19:45:10.9224840Z 2025-05-07T19:45:10.9224844Z 2025-05-07T19:45:10.9224847Z 2025-05-07T19:45:10.9224932Z 2025-05-07T19:45:10.9224936Z 2025-05-07T19:45:10.9224939Z 2025-05-07T19:45:10.9224943Z 2025-05-07T19:45:10.9224946Z 2025-05-07T19:45:10.9224950Z 2025-05-07T19:45:10.9224954Z 2025-05-07T19:45:10.9224957Z 2025-05-07T19:45:10.9224961Z 2025-05-07T19:45:10.9224964Z 2025-05-07T19:45:10.9224967Z 2025-05-07T19:45:10.9224971Z 2025-05-07T19:45:10.9224974Z 2025-05-07T19:45:10.9224977Z 2025-05-07T19:45:10.9224981Z 2025-05-07T19:45:10.9225162Z  2025-05-07T19:45:10.9225423Z 2025-05-07T19:45:10.9225426Z 2025-05-07T19:45:10.9225670Z  2025-05-07T19:45:10.9225787Z 2025-05-07T19:45:10.9225791Z 2025-05-07T19:45:10.9225938Z  2025-05-07T19:45:10.9226074Z 2025-05-07T19:45:10.9226078Z 2025-05-07T19:45:10.9226081Z 2025-05-07T19:45:10.9226204Z  2025-05-07T19:45:10.9226357Z 2025-05-07T19:45:10.9226361Z 2025-05-07T19:45:10.9226364Z 2025-05-07T19:45:10.9226368Z 2025-05-07T19:45:10.9226484Z  2025-05-07T19:45:10.9226619Z 2025-05-07T19:45:10.9226625Z 2025-05-07T19:45:10.9226629Z 2025-05-07T19:45:10.9226632Z 2025-05-07T19:45:10.9226665Z 2025-05-07T19:45:10.9226784Z  2025-05-07T19:45:10.9226920Z 2025-05-07T19:45:10.9226924Z 2025-05-07T19:45:10.9226927Z 2025-05-07T19:45:10.9226931Z 2025-05-07T19:45:10.9226934Z 2025-05-07T19:45:10.9226937Z 2025-05-07T19:45:10.9227087Z  2025-05-07T19:45:10.9227229Z 2025-05-07T19:45:10.9227232Z 2025-05-07T19:45:10.9227235Z 2025-05-07T19:45:10.9227239Z 2025-05-07T19:45:10.9227242Z 2025-05-07T19:45:10.9227246Z 2025-05-07T19:45:10.9227249Z 2025-05-07T19:45:10.9227376Z  2025-05-07T19:45:10.9227561Z 2025-05-07T19:45:10.9227565Z 2025-05-07T19:45:10.9227569Z 2025-05-07T19:45:10.9227572Z 2025-05-07T19:45:10.9227575Z 2025-05-07T19:45:10.9227579Z 2025-05-07T19:45:10.9227582Z 2025-05-07T19:45:10.9227585Z 2025-05-07T19:45:10.9227716Z  2025-05-07T19:45:10.9227908Z 2025-05-07T19:45:10.9227912Z 2025-05-07T19:45:10.9227919Z 2025-05-07T19:45:10.9227922Z 2025-05-07T19:45:10.9227926Z 2025-05-07T19:45:10.9227929Z 2025-05-07T19:45:10.9227932Z 2025-05-07T19:45:10.9227936Z 2025-05-07T19:45:10.9227939Z 2025-05-07T19:45:10.9228071Z  2025-05-07T19:45:10.9228245Z 2025-05-07T19:45:10.9228275Z 2025-05-07T19:45:10.9228278Z 2025-05-07T19:45:10.9228282Z 2025-05-07T19:45:10.9228285Z 2025-05-07T19:45:10.9228289Z 2025-05-07T19:45:10.9228292Z 2025-05-07T19:45:10.9228295Z 2025-05-07T19:45:10.9228299Z 2025-05-07T19:45:10.9228302Z 2025-05-07T19:45:10.9228446Z  2025-05-07T19:45:10.9228628Z 2025-05-07T19:45:10.9228660Z 2025-05-07T19:45:10.9228666Z 2025-05-07T19:45:10.9228670Z 2025-05-07T19:45:10.9228673Z 2025-05-07T19:45:10.9228677Z 2025-05-07T19:45:10.9228680Z 2025-05-07T19:45:10.9228683Z 2025-05-07T19:45:10.9228687Z 2025-05-07T19:45:10.9228690Z 2025-05-07T19:45:10.9228694Z 2025-05-07T19:45:10.9228836Z  2025-05-07T19:45:10.9229032Z 2025-05-07T19:45:10.9229066Z 2025-05-07T19:45:10.9229070Z 2025-05-07T19:45:10.9229073Z 2025-05-07T19:45:10.9229077Z 2025-05-07T19:45:10.9229080Z 2025-05-07T19:45:10.9229083Z 2025-05-07T19:45:10.9229087Z 2025-05-07T19:45:10.9229090Z 2025-05-07T19:45:10.9229094Z 2025-05-07T19:45:10.9229097Z 2025-05-07T19:45:10.9229101Z 2025-05-07T19:45:10.9229245Z  2025-05-07T19:45:10.9229624Z 2025-05-07T19:45:10.9229628Z 2025-05-07T19:45:10.9229631Z 2025-05-07T19:45:10.9229635Z 2025-05-07T19:45:10.9229638Z 2025-05-07T19:45:10.9229642Z 2025-05-07T19:45:10.9229645Z 2025-05-07T19:45:10.9229649Z 2025-05-07T19:45:10.9229728Z 2025-05-07T19:45:10.9229731Z 2025-05-07T19:45:10.9229735Z 2025-05-07T19:45:10.9229739Z 2025-05-07T19:45:10.9229742Z 2025-05-07T19:45:10.9229900Z  2025-05-07T19:45:10.9230152Z 2025-05-07T19:45:10.9230155Z 2025-05-07T19:45:10.9230159Z 2025-05-07T19:45:10.9230162Z 2025-05-07T19:45:10.9230166Z 2025-05-07T19:45:10.9230232Z 2025-05-07T19:45:10.9230236Z 2025-05-07T19:45:10.9230240Z 2025-05-07T19:45:10.9230244Z 2025-05-07T19:45:10.9230247Z 2025-05-07T19:45:10.9230251Z 2025-05-07T19:45:10.9230254Z 2025-05-07T19:45:10.9230258Z 2025-05-07T19:45:10.9230261Z 2025-05-07T19:45:10.9230415Z  2025-05-07T19:45:10.9230663Z 2025-05-07T19:45:10.9230667Z 2025-05-07T19:45:10.9230670Z 2025-05-07T19:45:10.9230673Z 2025-05-07T19:45:10.9230677Z 2025-05-07T19:45:10.9230680Z 2025-05-07T19:45:10.9230684Z 2025-05-07T19:45:10.9230687Z 2025-05-07T19:45:10.9230690Z 2025-05-07T19:45:10.9230694Z 2025-05-07T19:45:10.9230697Z 2025-05-07T19:45:10.9230705Z 2025-05-07T19:45:10.9230708Z 2025-05-07T19:45:10.9230711Z 2025-05-07T19:45:10.9230715Z 2025-05-07T19:45:10.9230906Z  2025-05-07T19:45:10.9231127Z 2025-05-07T19:45:10.9231131Z 2025-05-07T19:45:10.9231134Z 2025-05-07T19:45:10.9231138Z 2025-05-07T19:45:10.9231141Z 2025-05-07T19:45:10.9231147Z 2025-05-07T19:45:10.9231151Z 2025-05-07T19:45:10.9231154Z 2025-05-07T19:45:10.9231157Z 2025-05-07T19:45:10.9231161Z 2025-05-07T19:45:10.9231164Z 2025-05-07T19:45:10.9231168Z 2025-05-07T19:45:10.9231171Z 2025-05-07T19:45:10.9231175Z 2025-05-07T19:45:10.9231202Z 2025-05-07T19:45:10.9231206Z 2025-05-07T19:45:10.9231365Z  2025-05-07T19:45:10.9231596Z 2025-05-07T19:45:10.9231599Z 2025-05-07T19:45:10.9231603Z 2025-05-07T19:45:10.9231606Z 2025-05-07T19:45:10.9231610Z 2025-05-07T19:45:10.9231613Z 2025-05-07T19:45:10.9231616Z 2025-05-07T19:45:10.9231620Z 2025-05-07T19:45:10.9231647Z 2025-05-07T19:45:10.9231655Z 2025-05-07T19:45:10.9231658Z 2025-05-07T19:45:10.9231662Z 2025-05-07T19:45:10.9231665Z 2025-05-07T19:45:10.9231668Z 2025-05-07T19:45:10.9231672Z 2025-05-07T19:45:10.9231675Z 2025-05-07T19:45:10.9231678Z 2025-05-07T19:45:10.9231845Z  2025-05-07T19:45:10.9232078Z 2025-05-07T19:45:10.9232085Z 2025-05-07T19:45:10.9232111Z 2025-05-07T19:45:10.9232114Z 2025-05-07T19:45:10.9232118Z 2025-05-07T19:45:10.9232121Z 2025-05-07T19:45:10.9232125Z 2025-05-07T19:45:10.9232128Z 2025-05-07T19:45:10.9232131Z 2025-05-07T19:45:10.9232135Z 2025-05-07T19:45:10.9232138Z 2025-05-07T19:45:10.9232142Z 2025-05-07T19:45:10.9232145Z 2025-05-07T19:45:10.9232149Z 2025-05-07T19:45:10.9232152Z 2025-05-07T19:45:10.9232155Z 2025-05-07T19:45:10.9232159Z 2025-05-07T19:45:10.9232162Z 2025-05-07T19:45:10.9232338Z  2025-05-07T19:45:10.9232597Z 2025-05-07T19:45:10.9232601Z 2025-05-07T19:45:10.9232711Z  2025-05-07T19:45:10.9232826Z 2025-05-07T19:45:10.9232830Z 2025-05-07T19:45:10.9232967Z  2025-05-07T19:45:10.9233088Z 2025-05-07T19:45:10.9233091Z 2025-05-07T19:45:10.9233095Z 2025-05-07T19:45:10.9233205Z  2025-05-07T19:45:10.9233355Z 2025-05-07T19:45:10.9233358Z 2025-05-07T19:45:10.9233362Z 2025-05-07T19:45:10.9233365Z 2025-05-07T19:45:10.9233482Z  2025-05-07T19:45:10.9233610Z 2025-05-07T19:45:10.9233614Z 2025-05-07T19:45:10.9233618Z 2025-05-07T19:45:10.9233621Z 2025-05-07T19:45:10.9233647Z 2025-05-07T19:45:10.9233761Z  2025-05-07T19:45:10.9233900Z 2025-05-07T19:45:10.9233903Z 2025-05-07T19:45:10.9233907Z 2025-05-07T19:45:10.9233910Z 2025-05-07T19:45:10.9233913Z 2025-05-07T19:45:10.9233917Z 2025-05-07T19:45:10.9234061Z  2025-05-07T19:45:10.9234203Z 2025-05-07T19:45:10.9234207Z 2025-05-07T19:45:10.9234210Z 2025-05-07T19:45:10.9234213Z 2025-05-07T19:45:10.9234217Z 2025-05-07T19:45:10.9234220Z 2025-05-07T19:45:10.9234299Z 2025-05-07T19:45:10.9234422Z  2025-05-07T19:45:10.9234597Z 2025-05-07T19:45:10.9234601Z 2025-05-07T19:45:10.9234604Z 2025-05-07T19:45:10.9234608Z 2025-05-07T19:45:10.9234612Z 2025-05-07T19:45:10.9234615Z 2025-05-07T19:45:10.9234618Z 2025-05-07T19:45:10.9234622Z 2025-05-07T19:45:10.9234747Z  2025-05-07T19:45:10.9234993Z 2025-05-07T19:45:10.9234997Z 2025-05-07T19:45:10.9235000Z 2025-05-07T19:45:10.9235004Z 2025-05-07T19:45:10.9235007Z 2025-05-07T19:45:10.9235010Z 2025-05-07T19:45:10.9235014Z 2025-05-07T19:45:10.9235017Z 2025-05-07T19:45:10.9235021Z 2025-05-07T19:45:10.9235153Z  2025-05-07T19:45:10.9235325Z 2025-05-07T19:45:10.9235351Z 2025-05-07T19:45:10.9235355Z 2025-05-07T19:45:10.9235358Z 2025-05-07T19:45:10.9235362Z 2025-05-07T19:45:10.9235365Z 2025-05-07T19:45:10.9235369Z 2025-05-07T19:45:10.9235373Z 2025-05-07T19:45:10.9235376Z 2025-05-07T19:45:10.9235380Z 2025-05-07T19:45:10.9235512Z  2025-05-07T19:45:10.9235697Z 2025-05-07T19:45:10.9235725Z 2025-05-07T19:45:10.9235729Z 2025-05-07T19:45:10.9235732Z 2025-05-07T19:45:10.9235736Z 2025-05-07T19:45:10.9235739Z 2025-05-07T19:45:10.9235742Z 2025-05-07T19:45:10.9235746Z 2025-05-07T19:45:10.9235749Z 2025-05-07T19:45:10.9235753Z 2025-05-07T19:45:10.9235756Z 2025-05-07T19:45:10.9235898Z  2025-05-07T19:45:10.9236087Z 2025-05-07T19:45:10.9236115Z 2025-05-07T19:45:10.9236119Z 2025-05-07T19:45:10.9236122Z 2025-05-07T19:45:10.9236126Z 2025-05-07T19:45:10.9236129Z 2025-05-07T19:45:10.9236133Z 2025-05-07T19:45:10.9236136Z 2025-05-07T19:45:10.9236140Z 2025-05-07T19:45:10.9236143Z 2025-05-07T19:45:10.9236146Z 2025-05-07T19:45:10.9236150Z 2025-05-07T19:45:10.9236291Z  2025-05-07T19:45:10.9236511Z 2025-05-07T19:45:10.9236515Z 2025-05-07T19:45:10.9236519Z 2025-05-07T19:45:10.9236522Z 2025-05-07T19:45:10.9236525Z 2025-05-07T19:45:10.9236529Z 2025-05-07T19:45:10.9236536Z 2025-05-07T19:45:10.9236539Z 2025-05-07T19:45:10.9236543Z 2025-05-07T19:45:10.9236547Z 2025-05-07T19:45:10.9236550Z 2025-05-07T19:45:10.9236554Z 2025-05-07T19:45:10.9236557Z 2025-05-07T19:45:10.9236702Z  2025-05-07T19:45:10.9236940Z 2025-05-07T19:45:10.9236944Z 2025-05-07T19:45:10.9236947Z 2025-05-07T19:45:10.9236954Z 2025-05-07T19:45:10.9236958Z 2025-05-07T19:45:10.9236961Z 2025-05-07T19:45:10.9236965Z 2025-05-07T19:45:10.9236968Z 2025-05-07T19:45:10.9236971Z 2025-05-07T19:45:10.9236975Z 2025-05-07T19:45:10.9236978Z 2025-05-07T19:45:10.9236981Z 2025-05-07T19:45:10.9236985Z 2025-05-07T19:45:10.9236988Z 2025-05-07T19:45:10.9237139Z  2025-05-07T19:45:10.9237375Z 2025-05-07T19:45:10.9237379Z 2025-05-07T19:45:10.9237382Z 2025-05-07T19:45:10.9237386Z 2025-05-07T19:45:10.9237389Z 2025-05-07T19:45:10.9237392Z 2025-05-07T19:45:10.9237396Z 2025-05-07T19:45:10.9237399Z 2025-05-07T19:45:10.9237406Z 2025-05-07T19:45:10.9237410Z 2025-05-07T19:45:10.9237413Z 2025-05-07T19:45:10.9237417Z 2025-05-07T19:45:10.9237420Z 2025-05-07T19:45:10.9237424Z 2025-05-07T19:45:10.9237427Z 2025-05-07T19:45:10.9237684Z  2025-05-07T19:45:10.9237904Z 2025-05-07T19:45:10.9237908Z 2025-05-07T19:45:10.9237911Z 2025-05-07T19:45:10.9237918Z 2025-05-07T19:45:10.9237921Z 2025-05-07T19:45:10.9237925Z 2025-05-07T19:45:10.9237928Z 2025-05-07T19:45:10.9237931Z 2025-05-07T19:45:10.9237935Z 2025-05-07T19:45:10.9237938Z 2025-05-07T19:45:10.9237941Z 2025-05-07T19:45:10.9237945Z 2025-05-07T19:45:10.9237948Z 2025-05-07T19:45:10.9237975Z 2025-05-07T19:45:10.9237978Z 2025-05-07T19:45:10.9237981Z 2025-05-07T19:45:10.9238142Z  2025-05-07T19:45:10.9238366Z 2025-05-07T19:45:10.9238369Z 2025-05-07T19:45:10.9238373Z 2025-05-07T19:45:10.9238376Z 2025-05-07T19:45:10.9238379Z 2025-05-07T19:45:10.9238383Z 2025-05-07T19:45:10.9238386Z 2025-05-07T19:45:10.9238531Z 2025-05-07T19:45:10.9238535Z 2025-05-07T19:45:10.9238538Z 2025-05-07T19:45:10.9238542Z 2025-05-07T19:45:10.9238545Z 2025-05-07T19:45:10.9238549Z 2025-05-07T19:45:10.9238552Z 2025-05-07T19:45:10.9238556Z 2025-05-07T19:45:10.9238559Z 2025-05-07T19:45:10.9238562Z 2025-05-07T19:45:10.9238794Z  2025-05-07T19:45:10.9239024Z 2025-05-07T19:45:10.9239054Z 2025-05-07T19:45:10.9239057Z 2025-05-07T19:45:10.9239061Z 2025-05-07T19:45:10.9239064Z 2025-05-07T19:45:10.9239067Z 2025-05-07T19:45:10.9239071Z 2025-05-07T19:45:10.9239074Z 2025-05-07T19:45:10.9239077Z 2025-05-07T19:45:10.9239081Z 2025-05-07T19:45:10.9239084Z 2025-05-07T19:45:10.9239087Z 2025-05-07T19:45:10.9239091Z 2025-05-07T19:45:10.9239094Z 2025-05-07T19:45:10.9239097Z 2025-05-07T19:45:10.9239101Z 2025-05-07T19:45:10.9239104Z 2025-05-07T19:45:10.9239108Z 2025-05-07T19:45:10.9239287Z  2025-05-07T19:45:10.9239550Z 2025-05-07T19:45:10.9239553Z 2025-05-07T19:45:10.9239664Z  2025-05-07T19:45:10.9239784Z 2025-05-07T19:45:10.9239787Z 2025-05-07T19:45:10.9239923Z  2025-05-07T19:45:10.9240043Z 2025-05-07T19:45:10.9240047Z 2025-05-07T19:45:10.9240050Z 2025-05-07T19:45:10.9240163Z  2025-05-07T19:45:10.9240320Z 2025-05-07T19:45:10.9240327Z 2025-05-07T19:45:10.9240331Z 2025-05-07T19:45:10.9240334Z 2025-05-07T19:45:10.9240452Z  2025-05-07T19:45:10.9240585Z 2025-05-07T19:45:10.9240588Z 2025-05-07T19:45:10.9240592Z 2025-05-07T19:45:10.9240622Z 2025-05-07T19:45:10.9240625Z 2025-05-07T19:45:10.9240745Z  2025-05-07T19:45:10.9240884Z 2025-05-07T19:45:10.9240887Z 2025-05-07T19:45:10.9240891Z 2025-05-07T19:45:10.9240895Z 2025-05-07T19:45:10.9240898Z 2025-05-07T19:45:10.9240902Z 2025-05-07T19:45:10.9241055Z  2025-05-07T19:45:10.9241196Z 2025-05-07T19:45:10.9241200Z 2025-05-07T19:45:10.9241203Z 2025-05-07T19:45:10.9241207Z 2025-05-07T19:45:10.9241214Z 2025-05-07T19:45:10.9241217Z 2025-05-07T19:45:10.9241221Z 2025-05-07T19:45:10.9241345Z  2025-05-07T19:45:10.9241524Z 2025-05-07T19:45:10.9241528Z 2025-05-07T19:45:10.9241531Z 2025-05-07T19:45:10.9241534Z 2025-05-07T19:45:10.9241538Z 2025-05-07T19:45:10.9241541Z 2025-05-07T19:45:10.9241544Z 2025-05-07T19:45:10.9241551Z 2025-05-07T19:45:10.9241679Z  2025-05-07T19:45:10.9241869Z 2025-05-07T19:45:10.9241873Z 2025-05-07T19:45:10.9241876Z 2025-05-07T19:45:10.9241879Z 2025-05-07T19:45:10.9241883Z 2025-05-07T19:45:10.9241886Z 2025-05-07T19:45:10.9241890Z 2025-05-07T19:45:10.9241893Z 2025-05-07T19:45:10.9241897Z 2025-05-07T19:45:10.9242030Z  2025-05-07T19:45:10.9242230Z 2025-05-07T19:45:10.9242233Z 2025-05-07T19:45:10.9242237Z 2025-05-07T19:45:10.9242240Z 2025-05-07T19:45:10.9242244Z 2025-05-07T19:45:10.9242247Z 2025-05-07T19:45:10.9242250Z 2025-05-07T19:45:10.9242254Z 2025-05-07T19:45:10.9242257Z 2025-05-07T19:45:10.9242264Z 2025-05-07T19:45:10.9242403Z  2025-05-07T19:45:10.9242608Z 2025-05-07T19:45:10.9242612Z 2025-05-07T19:45:10.9242615Z 2025-05-07T19:45:10.9242619Z 2025-05-07T19:45:10.9242622Z 2025-05-07T19:45:10.9242625Z 2025-05-07T19:45:10.9242629Z 2025-05-07T19:45:10.9242632Z 2025-05-07T19:45:10.9242636Z 2025-05-07T19:45:10.9242642Z 2025-05-07T19:45:10.9242645Z 2025-05-07T19:45:10.9242786Z  2025-05-07T19:45:10.9243006Z 2025-05-07T19:45:10.9243010Z 2025-05-07T19:45:10.9243013Z 2025-05-07T19:45:10.9243017Z 2025-05-07T19:45:10.9243021Z 2025-05-07T19:45:10.9243024Z 2025-05-07T19:45:10.9243028Z 2025-05-07T19:45:10.9243031Z 2025-05-07T19:45:10.9243034Z 2025-05-07T19:45:10.9243038Z 2025-05-07T19:45:10.9243041Z 2025-05-07T19:45:10.9243044Z 2025-05-07T19:45:10.9243186Z  2025-05-07T19:45:10.9243416Z 2025-05-07T19:45:10.9243420Z 2025-05-07T19:45:10.9243423Z 2025-05-07T19:45:10.9243426Z 2025-05-07T19:45:10.9243491Z 2025-05-07T19:45:10.9243495Z 2025-05-07T19:45:10.9243498Z 2025-05-07T19:45:10.9243502Z 2025-05-07T19:45:10.9243505Z 2025-05-07T19:45:10.9243508Z 2025-05-07T19:45:10.9243512Z 2025-05-07T19:45:10.9243515Z 2025-05-07T19:45:10.9243519Z 2025-05-07T19:45:10.9243668Z  2025-05-07T19:45:10.9243958Z 2025-05-07T19:45:10.9243962Z 2025-05-07T19:45:10.9243966Z 2025-05-07T19:45:10.9243969Z 2025-05-07T19:45:10.9243972Z 2025-05-07T19:45:10.9243976Z 2025-05-07T19:45:10.9243979Z 2025-05-07T19:45:10.9243983Z 2025-05-07T19:45:10.9243986Z 2025-05-07T19:45:10.9243989Z 2025-05-07T19:45:10.9243993Z 2025-05-07T19:45:10.9243996Z 2025-05-07T19:45:10.9243999Z 2025-05-07T19:45:10.9244003Z 2025-05-07T19:45:10.9244156Z  2025-05-07T19:45:10.9244400Z 2025-05-07T19:45:10.9244403Z 2025-05-07T19:45:10.9244407Z 2025-05-07T19:45:10.9244410Z 2025-05-07T19:45:10.9244413Z 2025-05-07T19:45:10.9244417Z 2025-05-07T19:45:10.9244424Z 2025-05-07T19:45:10.9244428Z 2025-05-07T19:45:10.9244431Z 2025-05-07T19:45:10.9244435Z 2025-05-07T19:45:10.9244438Z 2025-05-07T19:45:10.9244442Z 2025-05-07T19:45:10.9244445Z 2025-05-07T19:45:10.9244448Z 2025-05-07T19:45:10.9244452Z 2025-05-07T19:45:10.9244641Z  2025-05-07T19:45:10.9244866Z 2025-05-07T19:45:10.9244870Z 2025-05-07T19:45:10.9244873Z 2025-05-07T19:45:10.9244877Z 2025-05-07T19:45:10.9244880Z 2025-05-07T19:45:10.9244883Z 2025-05-07T19:45:10.9244887Z 2025-05-07T19:45:10.9244890Z 2025-05-07T19:45:10.9244894Z 2025-05-07T19:45:10.9244897Z 2025-05-07T19:45:10.9244900Z 2025-05-07T19:45:10.9244904Z 2025-05-07T19:45:10.9244907Z 2025-05-07T19:45:10.9244938Z 2025-05-07T19:45:10.9244941Z 2025-05-07T19:45:10.9244945Z 2025-05-07T19:45:10.9245113Z  2025-05-07T19:45:10.9245340Z 2025-05-07T19:45:10.9245344Z 2025-05-07T19:45:10.9245347Z 2025-05-07T19:45:10.9245351Z 2025-05-07T19:45:10.9245357Z 2025-05-07T19:45:10.9245361Z 2025-05-07T19:45:10.9245366Z 2025-05-07T19:45:10.9245396Z 2025-05-07T19:45:10.9245400Z 2025-05-07T19:45:10.9245403Z 2025-05-07T19:45:10.9245406Z 2025-05-07T19:45:10.9245410Z 2025-05-07T19:45:10.9245413Z 2025-05-07T19:45:10.9245416Z 2025-05-07T19:45:10.9245420Z 2025-05-07T19:45:10.9245423Z 2025-05-07T19:45:10.9245429Z 2025-05-07T19:45:10.9245600Z  2025-05-07T19:45:10.9245835Z 2025-05-07T19:45:10.9245864Z 2025-05-07T19:45:10.9245868Z 2025-05-07T19:45:10.9245871Z 2025-05-07T19:45:10.9245874Z 2025-05-07T19:45:10.9245878Z 2025-05-07T19:45:10.9245881Z 2025-05-07T19:45:10.9245884Z 2025-05-07T19:45:10.9245888Z 2025-05-07T19:45:10.9245891Z 2025-05-07T19:45:10.9245894Z 2025-05-07T19:45:10.9245898Z 2025-05-07T19:45:10.9245901Z 2025-05-07T19:45:10.9245905Z 2025-05-07T19:45:10.9245908Z 2025-05-07T19:45:10.9245911Z 2025-05-07T19:45:10.9245915Z 2025-05-07T19:45:10.9245919Z 2025-05-07T19:45:10.9246100Z  2025-05-07T19:45:10.9246377Z 2025-05-07T19:45:10.9246381Z 2025-05-07T19:45:10.9246490Z  2025-05-07T19:45:10.9246613Z 2025-05-07T19:45:10.9246617Z 2025-05-07T19:45:10.9246759Z  2025-05-07T19:45:10.9246886Z 2025-05-07T19:45:10.9246890Z 2025-05-07T19:45:10.9246893Z 2025-05-07T19:45:10.9247012Z  2025-05-07T19:45:10.9247167Z 2025-05-07T19:45:10.9247170Z 2025-05-07T19:45:10.9247175Z 2025-05-07T19:45:10.9247179Z 2025-05-07T19:45:10.9247292Z  2025-05-07T19:45:10.9247426Z 2025-05-07T19:45:10.9247430Z 2025-05-07T19:45:10.9247433Z 2025-05-07T19:45:10.9247467Z 2025-05-07T19:45:10.9247470Z 2025-05-07T19:45:10.9247596Z  done 2025-05-07T19:45:11.2394884Z Preparing transaction: \ | / done 2025-05-07T19:45:14.9782704Z Verifying transaction: \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | done 2025-05-07T19:45:17.6941816Z Executing transaction: - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | done 2025-05-07T19:45:18.1152875Z [INSTALL] Adding symlink librhash.so.0, which is needed by CMake ... 2025-05-07T19:45:19.9750807Z + ln -s /github/home/miniconda/envs/build_binary/lib/librhash.so /github/home/miniconda/envs/build_binary/lib/librhash.so.0 2025-05-07T19:45:19.9751548Z 2025-05-07T19:45:19.9760305Z 2025-05-07T19:45:19.9786556Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install build 2025-05-07T19:45:22.2977448Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:45:22.2982264Z 2025-05-07T19:45:22.2982411Z Collecting build 2025-05-07T19:45:22.2982831Z Downloading build-1.2.2.post1-py3-none-any.whl.metadata (6.5 kB) 2025-05-07T19:45:22.2983661Z Requirement already satisfied: packaging>=19.1 in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from build) (25.0) 2025-05-07T19:45:22.2984565Z Collecting pyproject_hooks (from build) 2025-05-07T19:45:22.2985269Z Downloading pyproject_hooks-1.2.0-py3-none-any.whl.metadata (1.3 kB) 2025-05-07T19:45:22.2985828Z Downloading build-1.2.2.post1-py3-none-any.whl (22 kB) 2025-05-07T19:45:22.2986308Z Downloading pyproject_hooks-1.2.0-py3-none-any.whl (10 kB) 2025-05-07T19:45:22.2986804Z Installing collected packages: pyproject_hooks, build 2025-05-07T19:45:22.2987084Z 2025-05-07T19:45:22.2987298Z Successfully installed build-1.2.2.post1 pyproject_hooks-1.2.0 2025-05-07T19:45:22.2987644Z 2025-05-07T19:45:24.1622909Z /github/home/miniconda/envs/build_binary/bin/make 2025-05-07T19:45:24.1623272Z 2025-05-07T19:45:24.2196894Z [CHECK] Binary make found in PATH 2025-05-07T19:45:25.9916892Z /github/home/miniconda/envs/build_binary/bin/cmake 2025-05-07T19:45:25.9917365Z 2025-05-07T19:45:26.0505710Z [CHECK] Binary cmake found in PATH 2025-05-07T19:45:27.8312316Z /github/home/miniconda/envs/build_binary/bin/ninja 2025-05-07T19:45:27.8313228Z 2025-05-07T19:45:27.9060985Z [CHECK] Binary ninja found in PATH 2025-05-07T19:45:29.7912702Z [CHECK] Python (sub-)package 'click' found ... 2025-05-07T19:45:31.8163844Z [CHECK] Python (sub-)package 'hypothesis' found ... 2025-05-07T19:45:33.7371596Z [CHECK] Python (sub-)package 'jinja2' found ... 2025-05-07T19:45:35.7708553Z [CHECK] Python (sub-)package 'skbuild' found ... 2025-05-07T19:45:37.6263989Z [CHECK] Python (sub-)package 'wheel' found ... 2025-05-07T19:45:37.6265651Z [INSTALL] Successfully installed all the build tools 2025-05-07T19:45:37.6341768Z ##[group]Run . $PRELUDE; install_cuda $BUILD_ENV 12.6.3 2025-05-07T19:45:37.6342277Z . $PRELUDE; install_cuda $BUILD_ENV 12.6.3 2025-05-07T19:45:37.6342939Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:45:37.6343361Z env: 2025-05-07T19:45:37.6343596Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:45:37.6343938Z BUILD_ENV: build_binary 2025-05-07T19:45:37.6344216Z BUILD_TARGET: default 2025-05-07T19:45:37.6344479Z BUILD_VARIANT: cuda 2025-05-07T19:45:37.6344724Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:45:37.6345006Z ##[endgroup] 2025-05-07T19:45:38.0931846Z ################################################################################ 2025-05-07T19:45:38.0932917Z # Install CUDA 2025-05-07T19:45:38.0933553Z # 2025-05-07T19:45:38.0948870Z # [2025-05-07T19:45:38.094Z] + install_cuda build_binary 12.6.3 2025-05-07T19:45:38.0950367Z ################################################################################ 2025-05-07T19:45:38.0951198Z 2025-05-07T19:45:38.0965935Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:45:38.1903467Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:45:38.1904597Z [SETUP] Cleaning up Conda packages ... 2025-05-07T19:45:38.1907291Z + conda clean --packages --tarball -y 2025-05-07T19:45:38.1907916Z 2025-05-07T19:45:38.7199581Z Will remove 133 (497.9 MB) tarball(s). 2025-05-07T19:45:38.7200508Z Will remove 16 (100.0 MB) package(s). 2025-05-07T19:45:38.7776667Z 2025-05-07T19:45:38.7782106Z + conda clean --all -y 2025-05-07T19:45:38.7782624Z 2025-05-07T19:45:39.4176606Z There are no unused tarball(s) to remove. 2025-05-07T19:45:39.4177047Z Will remove 1 index cache(s). 2025-05-07T19:45:39.4177418Z There are no unused package(s) to remove. 2025-05-07T19:45:39.4177911Z There are no tempfile(s) to remove. 2025-05-07T19:45:39.4178238Z There are no logfile(s) to remove. 2025-05-07T19:45:39.4751631Z 2025-05-07T19:45:39.4759017Z [INSTALL] Installing CUDA 12.6.3 ... 2025-05-07T19:45:39.4787229Z [EXEC] [ATTEMPT 0/3] + conda install --force-reinstall -n build_binary -c conda-forge --override-channels -y cuda=12.6.3 2025-05-07T19:45:40.3260650Z Channels: 2025-05-07T19:45:40.3261374Z - conda-forge 2025-05-07T19:45:40.3262122Z Platform: linux-64 2025-05-07T19:45:50.0921920Z Collecting package metadata (repodata.json): - \ | / - \ | / - \ | / - \ | / - done 2025-05-07T19:45:51.5091810Z Solving environment: | / - \ done 2025-05-07T19:45:51.6338289Z 2025-05-07T19:45:51.6338574Z ## Package Plan ## 2025-05-07T19:45:51.6338765Z 2025-05-07T19:45:51.6339126Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:45:51.6339456Z 2025-05-07T19:45:51.6339581Z added / updated specs: 2025-05-07T19:45:51.6339919Z - cuda=12.6.3 2025-05-07T19:45:51.6340239Z 2025-05-07T19:45:51.6340243Z 2025-05-07T19:45:51.6340385Z The following packages will be downloaded: 2025-05-07T19:45:51.6340654Z 2025-05-07T19:45:51.6340817Z package | build 2025-05-07T19:45:51.6341196Z ---------------------------|----------------- 2025-05-07T19:45:51.6341587Z attr-2.5.1 | h166bdaf_1 69 KB conda-forge 2025-05-07T19:45:51.6342094Z binutils-2.40 | h4852527_7 31 KB conda-forge 2025-05-07T19:45:51.6342708Z c-compiler-1.5.2 | h0b41bf4_0 6 KB conda-forge 2025-05-07T19:45:51.6343197Z cuda-12.6.3 | ha804496_0 26 KB conda-forge 2025-05-07T19:45:51.6343698Z cuda-cccl_linux-64-12.6.77 | ha770c72_0 1.0 MB conda-forge 2025-05-07T19:45:51.6344241Z cuda-command-line-tools-12.6.3| ha770c72_0 20 KB conda-forge 2025-05-07T19:45:51.6344809Z cuda-compiler-12.6.3 | hbad6d8a_0 20 KB conda-forge 2025-05-07T19:45:51.6345321Z cuda-crt-dev_linux-64-12.6.85| ha770c72_0 87 KB conda-forge 2025-05-07T19:45:51.6346211Z cuda-crt-tools-12.6.85 | ha770c72_0 26 KB conda-forge 2025-05-07T19:45:51.6346717Z cuda-cudart-12.6.77 | h5888daf_0 22 KB conda-forge 2025-05-07T19:45:51.6347243Z cuda-cudart-dev-12.6.77 | h5888daf_0 22 KB conda-forge 2025-05-07T19:45:51.6347829Z cuda-cudart-dev_linux-64-12.6.77| h3f2d84a_0 357 KB conda-forge 2025-05-07T19:45:51.6348379Z cuda-cudart-static-12.6.77 | h5888daf_0 22 KB conda-forge 2025-05-07T19:45:51.6348968Z cuda-cudart-static_linux-64-12.6.77| h3f2d84a_0 744 KB conda-forge 2025-05-07T19:45:51.6349665Z cuda-cudart_linux-64-12.6.77| h3f2d84a_0 184 KB conda-forge 2025-05-07T19:45:51.6350407Z cuda-cuobjdump-12.6.77 | hbd13f7d_1 241 KB conda-forge 2025-05-07T19:45:51.6350957Z cuda-cupti-12.6.80 | hbd13f7d_0 1.9 MB conda-forge 2025-05-07T19:45:51.6351468Z cuda-cupti-dev-12.6.80 | h5888daf_0 3.4 MB conda-forge 2025-05-07T19:45:51.6352019Z cuda-cuxxfilt-12.6.77 | hbd13f7d_1 211 KB conda-forge 2025-05-07T19:45:51.6352543Z cuda-driver-dev-12.6.77 | h5888daf_0 22 KB conda-forge 2025-05-07T19:45:51.6353273Z cuda-driver-dev_linux-64-12.6.77| h3f2d84a_0 35 KB conda-forge 2025-05-07T19:45:51.6353805Z cuda-gdb-12.6.77 | h50b4baa_1 370 KB conda-forge 2025-05-07T19:45:51.6354342Z cuda-libraries-12.6.3 | ha770c72_0 20 KB conda-forge 2025-05-07T19:45:51.6354905Z cuda-libraries-dev-12.6.3 | ha770c72_0 20 KB conda-forge 2025-05-07T19:45:51.6355429Z cuda-nsight-12.6.77 | h7938cbb_0 113.2 MB conda-forge 2025-05-07T19:45:51.6355930Z cuda-nvcc-12.6.85 | hcdd1206_0 23 KB conda-forge 2025-05-07T19:45:51.6356537Z cuda-nvcc-dev_linux-64-12.6.85| he91c749_0 10.8 MB conda-forge 2025-05-07T19:45:51.6357067Z cuda-nvcc-impl-12.6.85 | h85509e4_0 25 KB conda-forge 2025-05-07T19:45:51.6377589Z cuda-nvcc-tools-12.6.85 | he02047a_0 23.0 MB conda-forge 2025-05-07T19:45:51.6378175Z cuda-nvcc_linux-64-12.6.85 | h04802cd_0 25 KB conda-forge 2025-05-07T19:45:51.6378733Z cuda-nvdisasm-12.6.77 | hbd13f7d_1 47.6 MB conda-forge 2025-05-07T19:45:51.6379208Z cuda-nvml-dev-12.6.77 | hbd13f7d_1 159 KB conda-forge 2025-05-07T19:45:51.6379709Z cuda-nvprof-12.6.80 | hbd13f7d_0 2.6 MB conda-forge 2025-05-07T19:45:51.6380242Z cuda-nvprune-12.6.77 | hbd13f7d_1 66 KB conda-forge 2025-05-07T19:45:51.6380708Z cuda-nvrtc-12.6.85 | hbd13f7d_0 17.3 MB conda-forge 2025-05-07T19:45:51.6381184Z cuda-nvrtc-dev-12.6.85 | h5888daf_0 31 KB conda-forge 2025-05-07T19:45:51.6381650Z cuda-nvtx-12.6.77 | hbd13f7d_0 31 KB conda-forge 2025-05-07T19:45:51.6382164Z cuda-nvvm-dev_linux-64-12.6.85| ha770c72_0 25 KB conda-forge 2025-05-07T19:45:51.6382670Z cuda-nvvm-impl-12.6.85 | he02047a_0 7.7 MB conda-forge 2025-05-07T19:45:51.6383189Z cuda-nvvm-tools-12.6.85 | he02047a_0 10.4 MB conda-forge 2025-05-07T19:45:51.6383679Z cuda-nvvp-12.6.80 | hbd13f7d_1 109.3 MB conda-forge 2025-05-07T19:45:51.6384132Z cuda-opencl-12.6.77 | hbd13f7d_0 29 KB conda-forge 2025-05-07T19:45:51.6385082Z cuda-opencl-dev-12.6.77 | h5888daf_0 93 KB conda-forge 2025-05-07T19:45:51.6385650Z cuda-profiler-api-12.6.77 | h7938cbb_0 22 KB conda-forge 2025-05-07T19:45:51.6386209Z cuda-runtime-12.6.3 | ha804496_0 19 KB conda-forge 2025-05-07T19:45:51.6386737Z cuda-sanitizer-api-12.6.77 | hbd13f7d_1 8.9 MB conda-forge 2025-05-07T19:45:51.6387535Z cuda-toolkit-12.6.3 | ha804496_0 19 KB conda-forge 2025-05-07T19:45:51.6388053Z cuda-tools-12.6.3 | ha770c72_0 19 KB conda-forge 2025-05-07T19:45:51.6388538Z cuda-version-12.6 | h7480c83_3 20 KB conda-forge 2025-05-07T19:45:51.6389068Z cuda-visual-tools-12.6.3 | ha770c72_0 19 KB conda-forge 2025-05-07T19:45:51.6389688Z cxx-compiler-1.5.2 | hf52228f_0 6 KB conda-forge 2025-05-07T19:45:51.6390187Z dbus-1.13.6 | h5008d03_3 604 KB conda-forge 2025-05-07T19:45:51.6390675Z expat-2.7.0 | h5888daf_0 137 KB conda-forge 2025-05-07T19:45:51.6391093Z gcc-11.4.0 | h602e360_13 49 KB conda-forge 2025-05-07T19:45:51.6391559Z gds-tools-1.11.1.6 | h5888daf_4 37.8 MB conda-forge 2025-05-07T19:45:51.6392011Z gmp-6.3.0 | hac33072_2 449 KB conda-forge 2025-05-07T19:45:51.6392455Z gxx-11.4.0 | h602e360_13 49 KB conda-forge 2025-05-07T19:45:51.6392891Z libcap-2.75 | h39aace5_0 118 KB conda-forge 2025-05-07T19:45:51.6393386Z libcublas-12.6.4.1 | h5888daf_1 256.2 MB conda-forge 2025-05-07T19:45:51.6394037Z libcublas-dev-12.6.4.1 | h5888daf_1 88 KB conda-forge 2025-05-07T19:45:51.6394529Z libcufft-11.3.0.4 | hbd13f7d_0 156.2 MB conda-forge 2025-05-07T19:45:51.6395034Z libcufft-dev-11.3.0.4 | h5888daf_0 33 KB conda-forge 2025-05-07T19:45:51.6395524Z libcufile-1.11.1.6 | h12f29b5_4 900 KB conda-forge 2025-05-07T19:45:51.6396072Z libcufile-dev-1.11.1.6 | h5888daf_4 35 KB conda-forge 2025-05-07T19:45:51.6396578Z libcurand-10.3.7.77 | hbd13f7d_0 39.9 MB conda-forge 2025-05-07T19:45:51.6397124Z libcurand-dev-10.3.7.77 | h5888daf_0 262 KB conda-forge 2025-05-07T19:45:51.6397633Z libcusolver-11.7.1.2 | h5888daf_1 95.8 MB conda-forge 2025-05-07T19:45:51.6398176Z libcusolver-dev-11.7.1.2 | h5888daf_1 59 KB conda-forge 2025-05-07T19:45:51.6398694Z libcusparse-12.5.4.2 | hbd13f7d_0 118.6 MB conda-forge 2025-05-07T19:45:51.6399231Z libcusparse-dev-12.5.4.2 | h5888daf_0 51 KB conda-forge 2025-05-07T19:45:51.6399775Z libgcrypt-lib-1.11.0 | hb9d3cd8_2 572 KB conda-forge 2025-05-07T19:45:51.6400276Z libgpg-error-1.55 | h3f2d84a_0 305 KB conda-forge 2025-05-07T19:45:51.6400763Z libnl-3.11.0 | hb9d3cd8_0 724 KB conda-forge 2025-05-07T19:45:51.6401213Z libnpp-12.3.1.54 | h5888daf_0 93.4 MB conda-forge 2025-05-07T19:45:51.6401709Z libnpp-dev-12.3.1.54 | h5888daf_0 441 KB conda-forge 2025-05-07T19:45:51.6402276Z libnuma-2.0.18 | h4ab18f5_2 42 KB conda-forge 2025-05-07T19:45:51.6402742Z libnvfatbin-12.6.77 | hbd13f7d_0 783 KB conda-forge 2025-05-07T19:45:51.6403240Z libnvfatbin-dev-12.6.77 | h5888daf_0 26 KB conda-forge 2025-05-07T19:45:51.6403723Z libnvjitlink-12.6.85 | hbd13f7d_0 14.9 MB conda-forge 2025-05-07T19:45:51.6404232Z libnvjitlink-dev-12.6.85 | h5888daf_0 25 KB conda-forge 2025-05-07T19:45:51.6404701Z libnvjpeg-12.3.3.54 | h5888daf_0 2.4 MB conda-forge 2025-05-07T19:45:51.6405184Z libnvjpeg-dev-12.3.3.54 | ha770c72_0 31 KB conda-forge 2025-05-07T19:45:51.6405679Z libsystemd0-257.4 | h4e0b6ca_1 477 KB conda-forge 2025-05-07T19:45:51.6406130Z libudev1-257.4 | hbe16f8c_1 141 KB conda-forge 2025-05-07T19:45:51.6406676Z libxkbcommon-1.9.2 | h65c71a3_0 660 KB conda-forge 2025-05-07T19:45:51.6407402Z libxkbfile-1.1.0 | h166bdaf_1 111 KB conda-forge 2025-05-07T19:45:51.6407899Z libxml2-2.13.8 | h4bc477f_0 675 KB conda-forge 2025-05-07T19:45:51.6408350Z lz4-c-1.10.0 | h5888daf_1 163 KB conda-forge 2025-05-07T19:45:51.6408862Z nsight-compute-2024.3.2.3 | hb5ebaad_0 443.1 MB conda-forge 2025-05-07T19:45:51.6409382Z nspr-4.36 | h5888daf_0 225 KB conda-forge 2025-05-07T19:45:51.6409805Z nss-3.111 | h159eef7_0 1.9 MB conda-forge 2025-05-07T19:45:51.6410263Z ocl-icd-2.3.3 | hb9d3cd8_0 104 KB conda-forge 2025-05-07T19:45:51.6410757Z opencl-headers-2024.10.24 | h5888daf_0 53 KB conda-forge 2025-05-07T19:45:51.6411278Z rdma-core-57.0 | h5888daf_0 1.2 MB conda-forge 2025-05-07T19:45:51.6411733Z wayland-1.23.1 | h3e06ad9_0 314 KB conda-forge 2025-05-07T19:45:51.6412211Z xcb-util-0.4.1 | hb711507_2 19 KB conda-forge 2025-05-07T19:45:51.6412715Z xcb-util-cursor-0.1.5 | hb9d3cd8_0 20 KB conda-forge 2025-05-07T19:45:51.6413301Z xcb-util-image-0.4.0 | hb711507_2 24 KB conda-forge 2025-05-07T19:45:51.6413830Z xcb-util-keysyms-0.4.1 | hb711507_0 14 KB conda-forge 2025-05-07T19:45:51.6414361Z xcb-util-renderutil-0.3.10 | hb711507_0 17 KB conda-forge 2025-05-07T19:45:51.6414890Z xcb-util-wm-0.4.2 | hb711507_0 50 KB conda-forge 2025-05-07T19:45:51.6415417Z xkeyboard-config-2.44 | hb9d3cd8_0 384 KB conda-forge 2025-05-07T19:45:51.6415965Z xorg-libxcomposite-0.4.6 | hb9d3cd8_2 13 KB conda-forge 2025-05-07T19:45:51.6416539Z xorg-libxdamage-1.1.6 | hb9d3cd8_0 13 KB conda-forge 2025-05-07T19:45:51.6417011Z ------------------------------------------------------------ 2025-05-07T19:45:51.6417418Z Total: 1.59 GB 2025-05-07T19:45:51.6417651Z 2025-05-07T19:45:51.6417805Z The following NEW packages will be INSTALLED: 2025-05-07T19:45:51.6418081Z 2025-05-07T19:45:51.6418275Z attr conda-forge/linux-64::attr-2.5.1-h166bdaf_1 2025-05-07T19:45:51.6418775Z binutils conda-forge/linux-64::binutils-2.40-h4852527_7 2025-05-07T19:45:51.6419288Z c-compiler conda-forge/linux-64::c-compiler-1.5.2-h0b41bf4_0 2025-05-07T19:45:51.6419897Z cuda conda-forge/noarch::cuda-12.6.3-ha804496_0 2025-05-07T19:45:51.6420384Z cuda-cccl_linux-64 conda-forge/noarch::cuda-cccl_linux-64-12.6.77-ha770c72_0 2025-05-07T19:45:51.6421027Z cuda-command-line~ conda-forge/linux-64::cuda-command-line-tools-12.6.3-ha770c72_0 2025-05-07T19:45:51.6421659Z cuda-compiler conda-forge/noarch::cuda-compiler-12.6.3-hbad6d8a_0 2025-05-07T19:45:51.6422224Z cuda-crt-dev_linu~ conda-forge/noarch::cuda-crt-dev_linux-64-12.6.85-ha770c72_0 2025-05-07T19:45:51.6422829Z cuda-crt-tools conda-forge/linux-64::cuda-crt-tools-12.6.85-ha770c72_0 2025-05-07T19:45:51.6423373Z cuda-cudart conda-forge/linux-64::cuda-cudart-12.6.77-h5888daf_0 2025-05-07T19:45:51.6423937Z cuda-cudart-dev conda-forge/linux-64::cuda-cudart-dev-12.6.77-h5888daf_0 2025-05-07T19:45:51.6424564Z cuda-cudart-dev_l~ conda-forge/noarch::cuda-cudart-dev_linux-64-12.6.77-h3f2d84a_0 2025-05-07T19:45:51.6425191Z cuda-cudart-static conda-forge/linux-64::cuda-cudart-static-12.6.77-h5888daf_0 2025-05-07T19:45:51.6425852Z cuda-cudart-stati~ conda-forge/noarch::cuda-cudart-static_linux-64-12.6.77-h3f2d84a_0 2025-05-07T19:45:51.6426480Z cuda-cudart_linux~ conda-forge/noarch::cuda-cudart_linux-64-12.6.77-h3f2d84a_0 2025-05-07T19:45:51.6427087Z cuda-cuobjdump conda-forge/linux-64::cuda-cuobjdump-12.6.77-hbd13f7d_1 2025-05-07T19:45:51.6428717Z cuda-cupti conda-forge/linux-64::cuda-cupti-12.6.80-hbd13f7d_0 2025-05-07T19:45:51.6429357Z cuda-cupti-dev conda-forge/linux-64::cuda-cupti-dev-12.6.80-h5888daf_0 2025-05-07T19:45:51.6430120Z cuda-cuxxfilt conda-forge/linux-64::cuda-cuxxfilt-12.6.77-hbd13f7d_1 2025-05-07T19:45:51.6430736Z cuda-driver-dev conda-forge/linux-64::cuda-driver-dev-12.6.77-h5888daf_0 2025-05-07T19:45:51.6431466Z cuda-driver-dev_l~ conda-forge/noarch::cuda-driver-dev_linux-64-12.6.77-h3f2d84a_0 2025-05-07T19:45:51.6432091Z cuda-gdb conda-forge/linux-64::cuda-gdb-12.6.77-h50b4baa_1 2025-05-07T19:45:51.6432640Z cuda-libraries conda-forge/linux-64::cuda-libraries-12.6.3-ha770c72_0 2025-05-07T19:45:51.6433295Z cuda-libraries-dev conda-forge/linux-64::cuda-libraries-dev-12.6.3-ha770c72_0 2025-05-07T19:45:51.6433909Z cuda-nsight conda-forge/linux-64::cuda-nsight-12.6.77-h7938cbb_0 2025-05-07T19:45:51.6434487Z cuda-nvcc conda-forge/linux-64::cuda-nvcc-12.6.85-hcdd1206_0 2025-05-07T19:45:51.6435097Z cuda-nvcc-dev_lin~ conda-forge/noarch::cuda-nvcc-dev_linux-64-12.6.85-he91c749_0 2025-05-07T19:45:51.6435725Z cuda-nvcc-impl conda-forge/linux-64::cuda-nvcc-impl-12.6.85-h85509e4_0 2025-05-07T19:45:51.6436473Z cuda-nvcc-tools conda-forge/linux-64::cuda-nvcc-tools-12.6.85-he02047a_0 2025-05-07T19:45:51.6437127Z cuda-nvcc_linux-64 conda-forge/linux-64::cuda-nvcc_linux-64-12.6.85-h04802cd_0 2025-05-07T19:45:51.6437737Z cuda-nvdisasm conda-forge/linux-64::cuda-nvdisasm-12.6.77-hbd13f7d_1 2025-05-07T19:45:51.6438336Z cuda-nvml-dev conda-forge/linux-64::cuda-nvml-dev-12.6.77-hbd13f7d_1 2025-05-07T19:45:51.6438894Z cuda-nvprof conda-forge/linux-64::cuda-nvprof-12.6.80-hbd13f7d_0 2025-05-07T19:45:51.6439480Z cuda-nvprune conda-forge/linux-64::cuda-nvprune-12.6.77-hbd13f7d_1 2025-05-07T19:45:51.6440027Z cuda-nvrtc conda-forge/linux-64::cuda-nvrtc-12.6.85-hbd13f7d_0 2025-05-07T19:45:51.6440617Z cuda-nvrtc-dev conda-forge/linux-64::cuda-nvrtc-dev-12.6.85-h5888daf_0 2025-05-07T19:45:51.6441194Z cuda-nvtx conda-forge/linux-64::cuda-nvtx-12.6.77-hbd13f7d_0 2025-05-07T19:45:51.6441773Z cuda-nvvm-dev_lin~ conda-forge/noarch::cuda-nvvm-dev_linux-64-12.6.85-ha770c72_0 2025-05-07T19:45:51.6442420Z cuda-nvvm-impl conda-forge/linux-64::cuda-nvvm-impl-12.6.85-he02047a_0 2025-05-07T19:45:51.6443053Z cuda-nvvm-tools conda-forge/linux-64::cuda-nvvm-tools-12.6.85-he02047a_0 2025-05-07T19:45:51.6443731Z cuda-nvvp conda-forge/linux-64::cuda-nvvp-12.6.80-hbd13f7d_1 2025-05-07T19:45:51.6444274Z cuda-opencl conda-forge/linux-64::cuda-opencl-12.6.77-hbd13f7d_0 2025-05-07T19:45:51.6444843Z cuda-opencl-dev conda-forge/linux-64::cuda-opencl-dev-12.6.77-h5888daf_0 2025-05-07T19:45:51.6445490Z cuda-profiler-api conda-forge/linux-64::cuda-profiler-api-12.6.77-h7938cbb_0 2025-05-07T19:45:51.6446091Z cuda-runtime conda-forge/noarch::cuda-runtime-12.6.3-ha804496_0 2025-05-07T19:45:51.6446715Z cuda-sanitizer-api conda-forge/linux-64::cuda-sanitizer-api-12.6.77-hbd13f7d_1 2025-05-07T19:45:51.6447345Z cuda-toolkit conda-forge/noarch::cuda-toolkit-12.6.3-ha804496_0 2025-05-07T19:45:51.6447875Z cuda-tools conda-forge/linux-64::cuda-tools-12.6.3-ha770c72_0 2025-05-07T19:45:51.6448412Z cuda-version conda-forge/noarch::cuda-version-12.6-h7480c83_3 2025-05-07T19:45:51.6448987Z cuda-visual-tools conda-forge/linux-64::cuda-visual-tools-12.6.3-ha770c72_0 2025-05-07T19:45:51.6449603Z cxx-compiler conda-forge/linux-64::cxx-compiler-1.5.2-hf52228f_0 2025-05-07T19:45:51.6450117Z dbus conda-forge/linux-64::dbus-1.13.6-h5008d03_3 2025-05-07T19:45:51.6450555Z expat conda-forge/linux-64::expat-2.7.0-h5888daf_0 2025-05-07T19:45:51.6451008Z gcc conda-forge/linux-64::gcc-11.4.0-h602e360_13 2025-05-07T19:45:51.6451544Z gds-tools conda-forge/linux-64::gds-tools-1.11.1.6-h5888daf_4 2025-05-07T19:45:51.6452037Z gmp conda-forge/linux-64::gmp-6.3.0-hac33072_2 2025-05-07T19:45:51.6452477Z gxx conda-forge/linux-64::gxx-11.4.0-h602e360_13 2025-05-07T19:45:51.6452914Z libcap conda-forge/linux-64::libcap-2.75-h39aace5_0 2025-05-07T19:45:51.6453441Z libcublas conda-forge/linux-64::libcublas-12.6.4.1-h5888daf_1 2025-05-07T19:45:51.6453992Z libcublas-dev conda-forge/linux-64::libcublas-dev-12.6.4.1-h5888daf_1 2025-05-07T19:45:51.6454571Z libcufft conda-forge/linux-64::libcufft-11.3.0.4-hbd13f7d_0 2025-05-07T19:45:51.6455126Z libcufft-dev conda-forge/linux-64::libcufft-dev-11.3.0.4-h5888daf_0 2025-05-07T19:45:51.6455661Z libcufile conda-forge/linux-64::libcufile-1.11.1.6-h12f29b5_4 2025-05-07T19:45:51.6456228Z libcufile-dev conda-forge/linux-64::libcufile-dev-1.11.1.6-h5888daf_4 2025-05-07T19:45:51.6456876Z libcurand conda-forge/linux-64::libcurand-10.3.7.77-hbd13f7d_0 2025-05-07T19:45:51.6457424Z libcurand-dev conda-forge/linux-64::libcurand-dev-10.3.7.77-h5888daf_0 2025-05-07T19:45:51.6457986Z libcusolver conda-forge/linux-64::libcusolver-11.7.1.2-h5888daf_1 2025-05-07T19:45:51.6458592Z libcusolver-dev conda-forge/linux-64::libcusolver-dev-11.7.1.2-h5888daf_1 2025-05-07T19:45:51.6459169Z libcusparse conda-forge/linux-64::libcusparse-12.5.4.2-hbd13f7d_0 2025-05-07T19:45:51.6459727Z libcusparse-dev conda-forge/linux-64::libcusparse-dev-12.5.4.2-h5888daf_0 2025-05-07T19:45:51.6460311Z libgcrypt-lib conda-forge/linux-64::libgcrypt-lib-1.11.0-hb9d3cd8_2 2025-05-07T19:45:51.6460867Z libgpg-error conda-forge/linux-64::libgpg-error-1.55-h3f2d84a_0 2025-05-07T19:45:51.6461344Z libnl conda-forge/linux-64::libnl-3.11.0-hb9d3cd8_0 2025-05-07T19:45:51.6461814Z libnpp conda-forge/linux-64::libnpp-12.3.1.54-h5888daf_0 2025-05-07T19:45:51.6462300Z libnpp-dev conda-forge/linux-64::libnpp-dev-12.3.1.54-h5888daf_0 2025-05-07T19:45:51.6462997Z libnuma conda-forge/linux-64::libnuma-2.0.18-h4ab18f5_2 2025-05-07T19:45:51.6463531Z libnvfatbin conda-forge/linux-64::libnvfatbin-12.6.77-hbd13f7d_0 2025-05-07T19:45:51.6464108Z libnvfatbin-dev conda-forge/linux-64::libnvfatbin-dev-12.6.77-h5888daf_0 2025-05-07T19:45:51.6464711Z libnvjitlink conda-forge/linux-64::libnvjitlink-12.6.85-hbd13f7d_0 2025-05-07T19:45:51.6465301Z libnvjitlink-dev conda-forge/linux-64::libnvjitlink-dev-12.6.85-h5888daf_0 2025-05-07T19:45:51.6465896Z libnvjpeg conda-forge/linux-64::libnvjpeg-12.3.3.54-h5888daf_0 2025-05-07T19:45:51.6466652Z libnvjpeg-dev conda-forge/linux-64::libnvjpeg-dev-12.3.3.54-ha770c72_0 2025-05-07T19:45:51.6467219Z libsystemd0 conda-forge/linux-64::libsystemd0-257.4-h4e0b6ca_1 2025-05-07T19:45:51.6467757Z libudev1 conda-forge/linux-64::libudev1-257.4-hbe16f8c_1 2025-05-07T19:45:51.6468288Z libxkbcommon conda-forge/linux-64::libxkbcommon-1.9.2-h65c71a3_0 2025-05-07T19:45:51.6468857Z libxkbfile conda-forge/linux-64::libxkbfile-1.1.0-h166bdaf_1 2025-05-07T19:45:51.6469640Z libxml2 conda-forge/linux-64::libxml2-2.13.8-h4bc477f_0 2025-05-07T19:45:51.6470110Z lz4-c conda-forge/linux-64::lz4-c-1.10.0-h5888daf_1 2025-05-07T19:45:51.6470680Z nsight-compute conda-forge/linux-64::nsight-compute-2024.3.2.3-hb5ebaad_0 2025-05-07T19:45:51.6471215Z nspr conda-forge/linux-64::nspr-4.36-h5888daf_0 2025-05-07T19:45:51.6471659Z nss conda-forge/linux-64::nss-3.111-h159eef7_0 2025-05-07T19:45:51.6472128Z ocl-icd conda-forge/linux-64::ocl-icd-2.3.3-hb9d3cd8_0 2025-05-07T19:45:51.6472682Z opencl-headers conda-forge/linux-64::opencl-headers-2024.10.24-h5888daf_0 2025-05-07T19:45:51.6473280Z rdma-core conda-forge/linux-64::rdma-core-57.0-h5888daf_0 2025-05-07T19:45:51.6473854Z wayland conda-forge/linux-64::wayland-1.23.1-h3e06ad9_0 2025-05-07T19:45:51.6474367Z xcb-util conda-forge/linux-64::xcb-util-0.4.1-hb711507_2 2025-05-07T19:45:51.6474906Z xcb-util-cursor conda-forge/linux-64::xcb-util-cursor-0.1.5-hb9d3cd8_0 2025-05-07T19:45:51.6475529Z xcb-util-image conda-forge/linux-64::xcb-util-image-0.4.0-hb711507_2 2025-05-07T19:45:51.6476162Z xcb-util-keysyms conda-forge/linux-64::xcb-util-keysyms-0.4.1-hb711507_0 2025-05-07T19:45:51.6476801Z xcb-util-renderut~ conda-forge/linux-64::xcb-util-renderutil-0.3.10-hb711507_0 2025-05-07T19:45:51.6477423Z xcb-util-wm conda-forge/linux-64::xcb-util-wm-0.4.2-hb711507_0 2025-05-07T19:45:51.6477992Z xkeyboard-config conda-forge/linux-64::xkeyboard-config-2.44-hb9d3cd8_0 2025-05-07T19:45:51.6478669Z xorg-libxcomposite conda-forge/linux-64::xorg-libxcomposite-0.4.6-hb9d3cd8_2 2025-05-07T19:45:51.6479327Z xorg-libxdamage conda-forge/linux-64::xorg-libxdamage-1.1.6-hb9d3cd8_0 2025-05-07T19:45:51.6479689Z 2025-05-07T19:45:51.6479745Z 2025-05-07T19:45:51.6479750Z 2025-05-07T19:45:51.6479941Z Downloading and Extracting Packages: ...working... 2025-05-07T19:45:51.6480370Z nsight-compute-2024. | 443.1 MB | | 0% 2025-05-07T19:45:51.6480640Z 2025-05-07T19:45:51.6481066Z libcublas-12.6.4.1 | 256.2 MB | | 0%  2025-05-07T19:45:51.6481340Z 2025-05-07T19:45:51.6481344Z 2025-05-07T19:45:51.6481582Z libcufft-11.3.0.4 | 156.2 MB | | 0%  2025-05-07T19:45:51.6481886Z 2025-05-07T19:45:51.6481890Z 2025-05-07T19:45:51.6481894Z 2025-05-07T19:45:51.6482262Z libcusparse-12.5.4.2 | 118.6 MB | | 0%  2025-05-07T19:45:51.6482544Z 2025-05-07T19:45:51.6482548Z 2025-05-07T19:45:51.6482551Z 2025-05-07T19:45:51.6482582Z 2025-05-07T19:45:51.6482830Z cuda-nsight-12.6.77 | 113.2 MB | | 0%  2025-05-07T19:45:51.6483121Z 2025-05-07T19:45:51.6483124Z 2025-05-07T19:45:51.6483128Z 2025-05-07T19:45:51.6483131Z 2025-05-07T19:45:51.6483134Z 2025-05-07T19:45:51.6483424Z cuda-nvvp-12.6.80 | 109.3 MB | | 0%  2025-05-07T19:45:51.6483712Z 2025-05-07T19:45:51.6483715Z 2025-05-07T19:45:51.6483718Z 2025-05-07T19:45:51.6483722Z 2025-05-07T19:45:51.6483725Z 2025-05-07T19:45:51.6483732Z 2025-05-07T19:45:51.6483997Z libcusolver-11.7.1.2 | 95.8 MB | | 0%  2025-05-07T19:45:51.6484329Z 2025-05-07T19:45:51.6484332Z 2025-05-07T19:45:51.6484336Z 2025-05-07T19:45:51.6484339Z 2025-05-07T19:45:51.6484539Z 2025-05-07T19:45:51.6484545Z 2025-05-07T19:45:51.6484550Z 2025-05-07T19:45:51.6484994Z libnpp-12.3.1.54 | 93.4 MB | | 0%  2025-05-07T19:45:51.6485345Z 2025-05-07T19:45:51.6485349Z 2025-05-07T19:45:51.6485352Z 2025-05-07T19:45:51.6485356Z 2025-05-07T19:45:51.6485359Z 2025-05-07T19:45:51.6485363Z 2025-05-07T19:45:51.6485366Z 2025-05-07T19:45:51.6485394Z 2025-05-07T19:45:51.6485672Z cuda-nvdisasm-12.6.7 | 47.6 MB | | 0%  2025-05-07T19:45:51.6486006Z 2025-05-07T19:45:51.6486009Z 2025-05-07T19:45:51.6486013Z 2025-05-07T19:45:51.6486017Z 2025-05-07T19:45:51.6486020Z 2025-05-07T19:45:51.6486023Z 2025-05-07T19:45:51.6486027Z 2025-05-07T19:45:51.6486030Z 2025-05-07T19:45:51.6486037Z 2025-05-07T19:45:51.6486310Z libcurand-10.3.7.77 | 39.9 MB | | 0%  2025-05-07T19:45:51.6486635Z 2025-05-07T19:45:51.6486639Z 2025-05-07T19:45:51.6486642Z 2025-05-07T19:45:51.6486645Z 2025-05-07T19:45:51.6486649Z 2025-05-07T19:45:51.6486652Z 2025-05-07T19:45:51.6486656Z 2025-05-07T19:45:51.6486659Z 2025-05-07T19:45:51.6486663Z 2025-05-07T19:45:51.6486666Z 2025-05-07T19:45:51.6486934Z gds-tools-1.11.1.6 | 37.8 MB | | 0%  2025-05-07T19:45:51.6487264Z 2025-05-07T19:45:51.6487268Z 2025-05-07T19:45:51.6487272Z 2025-05-07T19:45:51.6487275Z 2025-05-07T19:45:51.6487279Z 2025-05-07T19:45:51.6487282Z 2025-05-07T19:45:51.6487285Z 2025-05-07T19:45:51.6487421Z 2025-05-07T19:45:51.6487427Z 2025-05-07T19:45:51.6487430Z 2025-05-07T19:45:51.6487433Z 2025-05-07T19:45:51.6487734Z cuda-nvcc-tools-12.6 | 23.0 MB | | 0%  2025-05-07T19:45:51.6488092Z 2025-05-07T19:45:51.6488096Z 2025-05-07T19:45:51.6488099Z 2025-05-07T19:45:51.6488102Z 2025-05-07T19:45:51.6488106Z 2025-05-07T19:45:51.6488109Z 2025-05-07T19:45:51.6488113Z 2025-05-07T19:45:51.6488116Z 2025-05-07T19:45:51.6488119Z 2025-05-07T19:45:51.6488123Z 2025-05-07T19:45:51.6488126Z 2025-05-07T19:45:51.6488130Z 2025-05-07T19:45:51.6488459Z cuda-nvrtc-12.6.85 | 17.3 MB | | 0%  2025-05-07T19:45:51.6488772Z 2025-05-07T19:45:51.6488776Z 2025-05-07T19:45:51.6488779Z 2025-05-07T19:45:51.6488783Z 2025-05-07T19:45:51.6488786Z 2025-05-07T19:45:51.6488790Z 2025-05-07T19:45:51.6488793Z 2025-05-07T19:45:51.6488796Z 2025-05-07T19:45:51.6488800Z 2025-05-07T19:45:51.6488803Z 2025-05-07T19:45:51.6488810Z 2025-05-07T19:45:51.6488814Z 2025-05-07T19:45:51.6488817Z 2025-05-07T19:45:51.6489140Z libnvjitlink-12.6.85 | 14.9 MB | | 0%  2025-05-07T19:45:51.6489471Z 2025-05-07T19:45:51.6489576Z 2025-05-07T19:45:51.6489580Z 2025-05-07T19:45:51.6489583Z 2025-05-07T19:45:51.6489586Z 2025-05-07T19:45:51.6489590Z 2025-05-07T19:45:51.6489593Z 2025-05-07T19:45:51.6489596Z 2025-05-07T19:45:51.6489600Z 2025-05-07T19:45:51.6489603Z 2025-05-07T19:45:51.6489636Z 2025-05-07T19:45:51.6489639Z 2025-05-07T19:45:51.6489642Z 2025-05-07T19:45:51.6489646Z 2025-05-07T19:45:51.6490052Z cuda-nvcc-dev_linux- | 10.8 MB | | 0%  2025-05-07T19:45:51.6490410Z 2025-05-07T19:45:51.6490413Z 2025-05-07T19:45:51.6490417Z 2025-05-07T19:45:51.6490446Z 2025-05-07T19:45:51.6490449Z 2025-05-07T19:45:51.6490453Z 2025-05-07T19:45:51.6490456Z 2025-05-07T19:45:51.6490459Z 2025-05-07T19:45:51.6490463Z 2025-05-07T19:45:51.6490470Z 2025-05-07T19:45:51.6490474Z 2025-05-07T19:45:51.6490477Z 2025-05-07T19:45:51.6490481Z 2025-05-07T19:45:51.6490484Z 2025-05-07T19:45:51.6490488Z 2025-05-07T19:45:51.6491214Z cuda-nvvm-tools-12.6 | 10.4 MB | | 0%  2025-05-07T19:45:51.6491585Z 2025-05-07T19:45:51.6491597Z 2025-05-07T19:45:51.6491601Z 2025-05-07T19:45:51.6491604Z 2025-05-07T19:45:51.6491608Z 2025-05-07T19:45:51.6491611Z 2025-05-07T19:45:51.6491615Z 2025-05-07T19:45:51.6491618Z 2025-05-07T19:45:51.6491622Z 2025-05-07T19:45:51.6491625Z 2025-05-07T19:45:51.6491629Z 2025-05-07T19:45:51.6491632Z 2025-05-07T19:45:51.6491635Z 2025-05-07T19:45:51.6491639Z 2025-05-07T19:45:51.6491642Z 2025-05-07T19:45:51.6491646Z 2025-05-07T19:45:51.6492266Z cuda-sanitizer-api-1 | 8.9 MB | | 0%  2025-05-07T19:45:51.6492623Z 2025-05-07T19:45:51.6492639Z 2025-05-07T19:45:51.6492643Z 2025-05-07T19:45:51.6492646Z 2025-05-07T19:45:51.6492654Z 2025-05-07T19:45:51.6492658Z 2025-05-07T19:45:51.6492661Z 2025-05-07T19:45:51.6492664Z 2025-05-07T19:45:51.6492668Z 2025-05-07T19:45:51.6492671Z 2025-05-07T19:45:51.6492696Z 2025-05-07T19:45:51.6492699Z 2025-05-07T19:45:51.6492706Z 2025-05-07T19:45:51.6492709Z 2025-05-07T19:45:51.6492713Z 2025-05-07T19:45:51.6492716Z 2025-05-07T19:45:51.6492719Z 2025-05-07T19:45:51.6493338Z cuda-nvvm-impl-12.6. | 7.7 MB | | 0%  2025-05-07T19:45:51.6493685Z 2025-05-07T19:45:51.6493689Z 2025-05-07T19:45:51.6493731Z 2025-05-07T19:45:51.6493735Z 2025-05-07T19:45:51.6493738Z 2025-05-07T19:45:51.6493742Z 2025-05-07T19:45:51.6493745Z 2025-05-07T19:45:51.6493749Z 2025-05-07T19:45:51.6493752Z 2025-05-07T19:45:51.6493756Z 2025-05-07T19:45:51.6493759Z 2025-05-07T19:45:51.6493762Z 2025-05-07T19:45:51.6493766Z 2025-05-07T19:45:51.6493769Z 2025-05-07T19:45:51.6493772Z 2025-05-07T19:45:51.6493776Z 2025-05-07T19:45:51.6493858Z 2025-05-07T19:45:51.6493862Z 2025-05-07T19:45:51.6494344Z cuda-cupti-dev-12.6. | 3.4 MB | | 0%  2025-05-07T19:45:51.6494696Z 2025-05-07T19:45:51.6494712Z 2025-05-07T19:45:51.6494719Z 2025-05-07T19:45:51.6494723Z 2025-05-07T19:45:51.6494726Z 2025-05-07T19:45:51.6494730Z 2025-05-07T19:45:51.6494734Z 2025-05-07T19:45:51.6494737Z 2025-05-07T19:45:51.6494740Z 2025-05-07T19:45:51.6494744Z 2025-05-07T19:45:51.6494747Z 2025-05-07T19:45:51.6494750Z 2025-05-07T19:45:51.6494779Z 2025-05-07T19:45:51.6494783Z 2025-05-07T19:45:51.6494786Z 2025-05-07T19:45:51.6494790Z 2025-05-07T19:45:51.6494793Z 2025-05-07T19:45:51.6494796Z 2025-05-07T19:45:51.6494800Z 2025-05-07T19:45:51.7434549Z ... (more hidden) ... 2025-05-07T19:45:51.7435965Z nsight-compute-2024. | 443.1 MB | | 1% 2025-05-07T19:45:51.7436738Z 2025-05-07T19:45:51.7440999Z libcublas-12.6.4.1 | 256.2 MB | | 0%  2025-05-07T19:45:51.7441819Z 2025-05-07T19:45:51.7441831Z 2025-05-07T19:45:51.7446200Z libcufft-11.3.0.4 | 156.2 MB | | 1%  2025-05-07T19:45:51.7447001Z 2025-05-07T19:45:51.7447014Z 2025-05-07T19:45:51.7447499Z 2025-05-07T19:45:51.7470161Z libcusparse-12.5.4.2 | 118.6 MB | 2 | 2%  2025-05-07T19:45:51.7471067Z 2025-05-07T19:45:51.7471080Z 2025-05-07T19:45:51.7471090Z 2025-05-07T19:45:51.7471101Z 2025-05-07T19:45:51.8438757Z cuda-nsight-12.6.77 | 113.2 MB | | 1%  2025-05-07T19:45:51.8448858Z nsight-compute-2024. | 443.1 MB | 3 | 4% 2025-05-07T19:45:51.8449193Z 2025-05-07T19:45:51.8449208Z 2025-05-07T19:45:51.8470865Z libcufft-11.3.0.4 | 156.2 MB | 5 | 5%  2025-05-07T19:45:51.8471723Z 2025-05-07T19:45:51.8471737Z 2025-05-07T19:45:51.8471748Z 2025-05-07T19:45:51.8471758Z 2025-05-07T19:45:51.8486931Z cuda-nsight-12.6.77 | 113.2 MB | 8 | 8%  2025-05-07T19:45:51.8487263Z 2025-05-07T19:45:51.8514510Z libcublas-12.6.4.1 | 256.2 MB | 2 | 3%  2025-05-07T19:45:51.8515374Z 2025-05-07T19:45:51.8515404Z 2025-05-07T19:45:51.8515416Z 2025-05-07T19:45:51.9438002Z libcusparse-12.5.4.2 | 118.6 MB | 6 | 7%  2025-05-07T19:45:51.9469869Z nsight-compute-2024. | 443.1 MB | 5 | 6% 2025-05-07T19:45:51.9470301Z 2025-05-07T19:45:51.9470395Z 2025-05-07T19:45:51.9470400Z 2025-05-07T19:45:51.9470426Z 2025-05-07T19:45:51.9489891Z cuda-nsight-12.6.77 | 113.2 MB | #8 | 18%  2025-05-07T19:45:51.9490801Z 2025-05-07T19:45:51.9516476Z libcublas-12.6.4.1 | 256.2 MB | 5 | 5%  2025-05-07T19:45:51.9516798Z 2025-05-07T19:45:51.9516802Z 2025-05-07T19:45:51.9516921Z 2025-05-07T19:45:51.9584925Z libcusparse-12.5.4.2 | 118.6 MB | # | 10%  2025-05-07T19:45:51.9585872Z 2025-05-07T19:45:51.9585886Z 2025-05-07T19:45:52.0517110Z libcufft-11.3.0.4 | 156.2 MB | 8 | 8%  2025-05-07T19:45:52.0517415Z 2025-05-07T19:45:52.0517420Z 2025-05-07T19:45:52.0517543Z 2025-05-07T19:45:52.0586234Z libcusparse-12.5.4.2 | 118.6 MB | #5 | 15%  2025-05-07T19:45:52.0587175Z 2025-05-07T19:45:52.0587221Z 2025-05-07T19:45:52.0681259Z libcufft-11.3.0.4 | 156.2 MB | #2 | 12%  2025-05-07T19:45:52.0681562Z 2025-05-07T19:45:52.0681567Z 2025-05-07T19:45:52.0681593Z 2025-05-07T19:45:52.0681596Z 2025-05-07T19:45:52.0776437Z cuda-nsight-12.6.77 | 113.2 MB | ##5 | 25%  2025-05-07T19:45:52.0928702Z nsight-compute-2024. | 443.1 MB | 7 | 8% 2025-05-07T19:45:52.0929223Z 2025-05-07T19:45:52.1528027Z libcublas-12.6.4.1 | 256.2 MB | 7 | 7%  2025-05-07T19:45:52.1528329Z 2025-05-07T19:45:52.1528334Z 2025-05-07T19:45:52.1528338Z 2025-05-07T19:45:52.1587296Z libcusparse-12.5.4.2 | 118.6 MB | #9 | 19%  2025-05-07T19:45:52.1587620Z 2025-05-07T19:45:52.1587625Z 2025-05-07T19:45:52.1930146Z libcufft-11.3.0.4 | 156.2 MB | #6 | 16%  2025-05-07T19:45:52.1930482Z 2025-05-07T19:45:52.2006659Z libcublas-12.6.4.1 | 256.2 MB | 9 | 10%  2025-05-07T19:45:52.2007501Z 2025-05-07T19:45:52.2007515Z 2025-05-07T19:45:52.2007557Z 2025-05-07T19:45:52.2007567Z 2025-05-07T19:45:52.2032486Z cuda-nsight-12.6.77 | 113.2 MB | ###2 | 32%  2025-05-07T19:45:52.2527487Z nsight-compute-2024. | 443.1 MB | 9 | 10% 2025-05-07T19:45:52.2528312Z 2025-05-07T19:45:52.2528325Z 2025-05-07T19:45:52.2528336Z 2025-05-07T19:45:52.2846712Z libcusparse-12.5.4.2 | 118.6 MB | ##4 | 24%  2025-05-07T19:45:52.2847039Z 2025-05-07T19:45:52.2847150Z 2025-05-07T19:45:52.2931054Z libcufft-11.3.0.4 | 156.2 MB | ## | 20%  2025-05-07T19:45:52.2931382Z 2025-05-07T19:45:52.3070364Z libcublas-12.6.4.1 | 256.2 MB | #2 | 13%  2025-05-07T19:45:52.3070698Z 2025-05-07T19:45:52.3070704Z 2025-05-07T19:45:52.3070727Z 2025-05-07T19:45:52.3070748Z 2025-05-07T19:45:52.3153281Z cuda-nsight-12.6.77 | 113.2 MB | ###8 | 38%  2025-05-07T19:45:52.3528061Z nsight-compute-2024. | 443.1 MB | #1 | 11% 2025-05-07T19:45:52.3528720Z 2025-05-07T19:45:52.3529094Z 2025-05-07T19:45:52.3529100Z 2025-05-07T19:45:52.4070818Z libcusparse-12.5.4.2 | 118.6 MB | ##9 | 29%  2025-05-07T19:45:52.4071165Z 2025-05-07T19:45:52.4071172Z 2025-05-07T19:45:52.4071175Z 2025-05-07T19:45:52.4071179Z 2025-05-07T19:45:52.4155220Z cuda-nsight-12.6.77 | 113.2 MB | ####5 | 45%  2025-05-07T19:45:52.4287578Z nsight-compute-2024. | 443.1 MB | #3 | 13% 2025-05-07T19:45:52.4287890Z 2025-05-07T19:45:52.4287918Z 2025-05-07T19:45:52.4528569Z libcufft-11.3.0.4 | 156.2 MB | ##3 | 23%  2025-05-07T19:45:52.4528885Z 2025-05-07T19:45:52.4528992Z 2025-05-07T19:45:52.4529001Z 2025-05-07T19:45:52.4809097Z libcusparse-12.5.4.2 | 118.6 MB | ###4 | 35%  2025-05-07T19:45:52.4810056Z 2025-05-07T19:45:52.5117831Z libcublas-12.6.4.1 | 256.2 MB | #4 | 15%  2025-05-07T19:45:52.5118125Z 2025-05-07T19:45:52.5118132Z 2025-05-07T19:45:52.5118136Z 2025-05-07T19:45:52.5118141Z 2025-05-07T19:45:52.5239672Z cuda-nsight-12.6.77 | 113.2 MB | #####1 | 52%  2025-05-07T19:45:52.5286668Z nsight-compute-2024. | 443.1 MB | #4 | 15% 2025-05-07T19:45:52.5287511Z 2025-05-07T19:45:52.5287526Z 2025-05-07T19:45:52.5648900Z libcufft-11.3.0.4 | 156.2 MB | ##7 | 28%  2025-05-07T19:45:52.5649342Z 2025-05-07T19:45:52.5649369Z 2025-05-07T19:45:52.5649374Z 2025-05-07T19:45:52.5809477Z libcusparse-12.5.4.2 | 118.6 MB | ###9 | 40%  2025-05-07T19:45:52.5809801Z 2025-05-07T19:45:52.6166990Z libcublas-12.6.4.1 | 256.2 MB | #7 | 17%  2025-05-07T19:45:52.6167293Z 2025-05-07T19:45:52.6167372Z 2025-05-07T19:45:52.6167377Z 2025-05-07T19:45:52.6167461Z 2025-05-07T19:45:52.6290868Z cuda-nsight-12.6.77 | 113.2 MB | #####7 | 58%  2025-05-07T19:45:52.6291812Z 2025-05-07T19:45:52.6291826Z 2025-05-07T19:45:52.6381907Z libcufft-11.3.0.4 | 156.2 MB | ###1 | 32%  2025-05-07T19:45:52.6811666Z nsight-compute-2024. | 443.1 MB | #6 | 17% 2025-05-07T19:45:52.6812517Z 2025-05-07T19:45:52.6852948Z libcublas-12.6.4.1 | 256.2 MB | #9 | 20%  2025-05-07T19:45:52.6853273Z 2025-05-07T19:45:52.6853278Z 2025-05-07T19:45:52.6853282Z 2025-05-07T19:45:52.7277844Z libcusparse-12.5.4.2 | 118.6 MB | ####4 | 44%  2025-05-07T19:45:52.7278757Z 2025-05-07T19:45:52.7278770Z 2025-05-07T19:45:52.7278805Z 2025-05-07T19:45:52.7278816Z 2025-05-07T19:45:52.7287182Z cuda-nsight-12.6.77 | 113.2 MB | ######3 | 64%  2025-05-07T19:45:52.7287481Z 2025-05-07T19:45:52.7287492Z 2025-05-07T19:45:52.7406530Z libcufft-11.3.0.4 | 156.2 MB | ###5 | 36%  2025-05-07T19:45:52.7812396Z nsight-compute-2024. | 443.1 MB | #8 | 18% 2025-05-07T19:45:52.7813656Z 2025-05-07T19:45:52.7936841Z libcublas-12.6.4.1 | 256.2 MB | ##2 | 22%  2025-05-07T19:45:52.7937685Z 2025-05-07T19:45:52.7937698Z 2025-05-07T19:45:52.7937853Z 2025-05-07T19:45:52.8293592Z libcusparse-12.5.4.2 | 118.6 MB | ####8 | 49%  2025-05-07T19:45:52.8294526Z 2025-05-07T19:45:52.8294540Z 2025-05-07T19:45:52.8408174Z libcufft-11.3.0.4 | 156.2 MB | ###9 | 39%  2025-05-07T19:45:52.8408664Z 2025-05-07T19:45:52.8408710Z 2025-05-07T19:45:52.8408716Z 2025-05-07T19:45:52.8408719Z 2025-05-07T19:45:52.8570214Z cuda-nsight-12.6.77 | 113.2 MB | ######9 | 70%  2025-05-07T19:45:52.8817081Z nsight-compute-2024. | 443.1 MB | #9 | 20% 2025-05-07T19:45:52.8817408Z 2025-05-07T19:45:52.9024770Z libcublas-12.6.4.1 | 256.2 MB | ##4 | 24%  2025-05-07T19:45:52.9025054Z 2025-05-07T19:45:52.9025058Z 2025-05-07T19:45:52.9025070Z 2025-05-07T19:45:52.9308226Z libcusparse-12.5.4.2 | 118.6 MB | #####3 | 53%  2025-05-07T19:45:52.9309133Z 2025-05-07T19:45:52.9309147Z 2025-05-07T19:45:52.9433247Z libcufft-11.3.0.4 | 156.2 MB | ####2 | 43%  2025-05-07T19:45:52.9434141Z 2025-05-07T19:45:52.9434155Z 2025-05-07T19:45:52.9434594Z 2025-05-07T19:45:52.9434605Z 2025-05-07T19:45:52.9692949Z cuda-nsight-12.6.77 | 113.2 MB | #######5 | 76%  2025-05-07T19:45:52.9821436Z nsight-compute-2024. | 443.1 MB | ##1 | 21% 2025-05-07T19:45:52.9821744Z 2025-05-07T19:45:53.0089010Z libcublas-12.6.4.1 | 256.2 MB | ##6 | 27%  2025-05-07T19:45:53.0089883Z 2025-05-07T19:45:53.0089896Z 2025-05-07T19:45:53.0089907Z 2025-05-07T19:45:53.0310891Z libcusparse-12.5.4.2 | 118.6 MB | #####7 | 57%  2025-05-07T19:45:53.0311838Z 2025-05-07T19:45:53.0311851Z 2025-05-07T19:45:53.0465684Z libcufft-11.3.0.4 | 156.2 MB | ####6 | 47%  2025-05-07T19:45:53.0466545Z 2025-05-07T19:45:53.0466559Z 2025-05-07T19:45:53.0466571Z 2025-05-07T19:45:53.0466581Z 2025-05-07T19:45:53.0733979Z cuda-nsight-12.6.77 | 113.2 MB | ########1 | 81%  2025-05-07T19:45:53.0822068Z nsight-compute-2024. | 443.1 MB | ##2 | 23% 2025-05-07T19:45:53.0822600Z 2025-05-07T19:45:53.1133982Z libcublas-12.6.4.1 | 256.2 MB | ##9 | 29%  2025-05-07T19:45:53.1134365Z 2025-05-07T19:45:53.1134369Z 2025-05-07T19:45:53.1134380Z 2025-05-07T19:45:53.1310004Z libcusparse-12.5.4.2 | 118.6 MB | ######1 | 61%  2025-05-07T19:45:53.1310365Z 2025-05-07T19:45:53.1310532Z 2025-05-07T19:45:53.1487352Z libcufft-11.3.0.4 | 156.2 MB | #####1 | 51%  2025-05-07T19:45:53.1487689Z 2025-05-07T19:45:53.1487822Z 2025-05-07T19:45:53.1758871Z 2025-05-07T19:45:53.1758878Z 2025-05-07T19:45:53.1759302Z cuda-nsight-12.6.77 | 113.2 MB | ########7 | 87%  2025-05-07T19:45:53.1823894Z nsight-compute-2024. | 443.1 MB | ##4 | 24% 2025-05-07T19:45:53.1824715Z 2025-05-07T19:45:53.2239418Z libcublas-12.6.4.1 | 256.2 MB | ###1 | 32%  2025-05-07T19:45:53.2239750Z 2025-05-07T19:45:53.2239754Z 2025-05-07T19:45:53.2239757Z 2025-05-07T19:45:53.2311357Z libcusparse-12.5.4.2 | 118.6 MB | ######5 | 66%  2025-05-07T19:45:53.2311681Z 2025-05-07T19:45:53.2311701Z 2025-05-07T19:45:53.2493226Z libcufft-11.3.0.4 | 156.2 MB | #####5 | 55%  2025-05-07T19:45:53.2494084Z 2025-05-07T19:45:53.2494097Z 2025-05-07T19:45:53.2494107Z 2025-05-07T19:45:53.2494118Z 2025-05-07T19:45:53.2820978Z cuda-nsight-12.6.77 | 113.2 MB | #########2 | 93%  2025-05-07T19:45:53.2822326Z nsight-compute-2024. | 443.1 MB | ##5 | 26% 2025-05-07T19:45:53.2823125Z 2025-05-07T19:45:53.3240831Z libcublas-12.6.4.1 | 256.2 MB | ###4 | 34%  2025-05-07T19:45:53.3241670Z 2025-05-07T19:45:53.3241684Z 2025-05-07T19:45:53.3241695Z 2025-05-07T19:45:53.3494581Z libcusparse-12.5.4.2 | 118.6 MB | ####### | 70%  2025-05-07T19:45:53.3495489Z 2025-05-07T19:45:53.3495502Z 2025-05-07T19:45:53.3495923Z 2025-05-07T19:45:53.3495936Z 2025-05-07T19:45:53.3821949Z cuda-nsight-12.6.77 | 113.2 MB | #########8 | 99%  2025-05-07T19:45:53.3822914Z nsight-compute-2024. | 443.1 MB | ##7 | 27% 2025-05-07T19:45:53.3823296Z 2025-05-07T19:45:53.4161205Z libcublas-12.6.4.1 | 256.2 MB | ###7 | 37%  2025-05-07T19:45:53.4161558Z 2025-05-07T19:45:53.4161563Z 2025-05-07T19:45:53.4239807Z libcufft-11.3.0.4 | 156.2 MB | #####8 | 59%  2025-05-07T19:45:53.4240125Z 2025-05-07T19:45:53.4240153Z 2025-05-07T19:45:53.4240156Z 2025-05-07T19:45:53.4831208Z libcusparse-12.5.4.2 | 118.6 MB | #######5 | 76%  2025-05-07T19:45:53.4831538Z 2025-05-07T19:45:53.4984942Z libcublas-12.6.4.1 | 256.2 MB | #### | 41%  2025-05-07T19:45:53.5240335Z nsight-compute-2024. | 443.1 MB | ##8 | 29% 2025-05-07T19:45:53.5240676Z 2025-05-07T19:45:53.5240681Z 2025-05-07T19:45:53.5240685Z 2025-05-07T19:45:53.5344620Z libcusparse-12.5.4.2 | 118.6 MB | ########2 | 82%  2025-05-07T19:45:53.5344945Z 2025-05-07T19:45:53.5344949Z 2025-05-07T19:45:53.5831207Z libcufft-11.3.0.4 | 156.2 MB | ######2 | 62%  2025-05-07T19:45:53.5831512Z 2025-05-07T19:45:53.6132753Z libcublas-12.6.4.1 | 256.2 MB | ####3 | 44%  2025-05-07T19:45:53.6240713Z nsight-compute-2024. | 443.1 MB | ### | 30% 2025-05-07T19:45:53.6241006Z 2025-05-07T19:45:53.6241011Z 2025-05-07T19:45:53.6241015Z 2025-05-07T19:45:53.6698414Z libcusparse-12.5.4.2 | 118.6 MB | ########8 | 89%  2025-05-07T19:45:53.6698734Z 2025-05-07T19:45:53.6698739Z 2025-05-07T19:45:53.6831417Z libcufft-11.3.0.4 | 156.2 MB | ######5 | 66%  2025-05-07T19:45:53.6831751Z 2025-05-07T19:45:53.7132488Z libcublas-12.6.4.1 | 256.2 MB | ####7 | 47%  2025-05-07T19:45:53.7245742Z nsight-compute-2024. | 443.1 MB | ###1 | 32% 2025-05-07T19:45:53.7246223Z 2025-05-07T19:45:53.7246267Z 2025-05-07T19:45:53.7246273Z 2025-05-07T19:45:53.7967341Z libcusparse-12.5.4.2 | 118.6 MB | #########5 | 96%  2025-05-07T19:45:53.7967662Z 2025-05-07T19:45:53.8234187Z libcublas-12.6.4.1 | 256.2 MB | ##### | 50%  2025-05-07T19:45:53.8939826Z nsight-compute-2024. | 443.1 MB | ###4 | 34% 2025-05-07T19:45:53.8940336Z 2025-05-07T19:45:53.8940374Z 2025-05-07T19:45:53.8966981Z libcufft-11.3.0.4 | 156.2 MB | ######8 | 68%  2025-05-07T19:45:53.8967288Z 2025-05-07T19:45:53.9343645Z libcublas-12.6.4.1 | 256.2 MB | #####3 | 53%  2025-05-07T19:45:53.9940987Z nsight-compute-2024. | 443.1 MB | ###5 | 36% 2025-05-07T19:45:53.9941539Z 2025-05-07T19:45:53.9941632Z 2025-05-07T19:45:54.0201051Z libcufft-11.3.0.4 | 156.2 MB | #######3 | 73%  2025-05-07T19:45:54.0201357Z 2025-05-07T19:45:54.0343022Z libcublas-12.6.4.1 | 256.2 MB | #####6 | 56%  2025-05-07T19:45:54.0942359Z nsight-compute-2024. | 443.1 MB | ###8 | 39% 2025-05-07T19:45:54.0942687Z 2025-05-07T19:45:54.0942692Z 2025-05-07T19:45:54.1202227Z libcufft-11.3.0.4 | 156.2 MB | #######6 | 77%  2025-05-07T19:45:54.1203084Z 2025-05-07T19:45:54.1344276Z libcublas-12.6.4.1 | 256.2 MB | ######1 | 61%  2025-05-07T19:45:54.1942741Z nsight-compute-2024. | 443.1 MB | ####1 | 41% 2025-05-07T19:45:54.1943606Z 2025-05-07T19:45:54.1943620Z 2025-05-07T19:45:54.2072507Z libcufft-11.3.0.4 | 156.2 MB | ########2 | 82%  2025-05-07T19:45:54.2073373Z 2025-05-07T19:45:54.2073388Z 2025-05-07T19:45:54.2073398Z 2025-05-07T19:45:54.2073408Z 2025-05-07T19:45:54.2345206Z cuda-nsight-12.6.77 | 113.2 MB | ########## | 100%  2025-05-07T19:45:54.2558557Z nsight-compute-2024. | 443.1 MB | ####4 | 44% 2025-05-07T19:45:54.2559416Z 2025-05-07T19:45:54.2559429Z 2025-05-07T19:45:54.2559441Z 2025-05-07T19:45:54.2559451Z 2025-05-07T19:45:54.2559461Z 2025-05-07T19:45:54.3021772Z cuda-nvvp-12.6.80 | 109.3 MB | | 0%  2025-05-07T19:45:54.3022103Z 2025-05-07T19:45:54.3022358Z 2025-05-07T19:45:54.3135507Z libcufft-11.3.0.4 | 156.2 MB | ########7 | 88%  2025-05-07T19:45:54.3135817Z 2025-05-07T19:45:54.3346248Z libcublas-12.6.4.1 | 256.2 MB | ######4 | 65%  2025-05-07T19:45:54.3558749Z nsight-compute-2024. | 443.1 MB | ####7 | 47% 2025-05-07T19:45:54.3559612Z 2025-05-07T19:45:54.3559626Z 2025-05-07T19:45:54.3559794Z 2025-05-07T19:45:54.3559797Z 2025-05-07T19:45:54.3559801Z 2025-05-07T19:45:54.4354782Z cuda-nvvp-12.6.80 | 109.3 MB | 7 | 8%  2025-05-07T19:45:54.4355134Z 2025-05-07T19:45:54.4440379Z libcublas-12.6.4.1 | 256.2 MB | ######7 | 68%  2025-05-07T19:45:54.4597772Z nsight-compute-2024. | 443.1 MB | ####9 | 50% 2025-05-07T19:45:54.5354356Z 2025-05-07T19:45:54.5354363Z 2025-05-07T19:45:54.5354368Z 2025-05-07T19:45:54.5354372Z 2025-05-07T19:45:54.5354377Z 2025-05-07T19:45:54.5354824Z cuda-nvvp-12.6.80 | 109.3 MB | #5 | 15%  2025-05-07T19:45:54.5355148Z 2025-05-07T19:45:54.5444659Z libcublas-12.6.4.1 | 256.2 MB | ####### | 71%  2025-05-07T19:45:54.5597812Z nsight-compute-2024. | 443.1 MB | #####2 | 52% 2025-05-07T19:45:54.5598115Z 2025-05-07T19:45:54.5598120Z 2025-05-07T19:45:54.5598327Z 2025-05-07T19:45:54.5598330Z 2025-05-07T19:45:54.5598334Z 2025-05-07T19:45:54.6357774Z cuda-nvvp-12.6.80 | 109.3 MB | ##4 | 25%  2025-05-07T19:45:54.6358090Z 2025-05-07T19:45:54.6358104Z 2025-05-07T19:45:54.6447390Z libcufft-11.3.0.4 | 156.2 MB | #########1 | 92%  2025-05-07T19:45:54.6598570Z nsight-compute-2024. | 443.1 MB | #####4 | 55% 2025-05-07T19:45:54.6598890Z 2025-05-07T19:45:54.6598895Z 2025-05-07T19:45:54.6598923Z 2025-05-07T19:45:54.6598927Z 2025-05-07T19:45:54.6598931Z 2025-05-07T19:45:54.6679308Z cuda-nvvp-12.6.80 | 109.3 MB | ###3 | 33%  2025-05-07T19:45:54.6679641Z 2025-05-07T19:45:54.6679646Z 2025-05-07T19:45:54.6680927Z 2025-05-07T19:45:54.6719460Z libcusparse-12.5.4.2 | 118.6 MB | ########## | 100%  2025-05-07T19:45:54.6719778Z 2025-05-07T19:45:54.7211231Z libcublas-12.6.4.1 | 256.2 MB | #######3 | 74%  2025-05-07T19:45:54.7211535Z 2025-05-07T19:45:54.7211662Z 2025-05-07T19:45:54.7211688Z 2025-05-07T19:45:54.7211693Z 2025-05-07T19:45:54.7211734Z 2025-05-07T19:45:54.7211739Z 2025-05-07T19:45:54.7362886Z libcusolver-11.7.1.2 | 95.8 MB | | 0%  2025-05-07T19:45:54.7363232Z 2025-05-07T19:45:54.7363236Z 2025-05-07T19:45:54.7707826Z libcufft-11.3.0.4 | 156.2 MB | #########5 | 96%  2025-05-07T19:45:54.7708141Z 2025-05-07T19:45:54.7708247Z 2025-05-07T19:45:54.7708256Z 2025-05-07T19:45:54.7708262Z 2025-05-07T19:45:54.7708268Z 2025-05-07T19:45:54.7836435Z cuda-nvvp-12.6.80 | 109.3 MB | #### | 41%  2025-05-07T19:45:54.8211518Z nsight-compute-2024. | 443.1 MB | #####7 | 57% 2025-05-07T19:45:54.8212031Z 2025-05-07T19:45:54.8212079Z 2025-05-07T19:45:54.8212086Z 2025-05-07T19:45:54.8212153Z 2025-05-07T19:45:54.8212157Z 2025-05-07T19:45:54.8212169Z 2025-05-07T19:45:54.8217175Z libcusolver-11.7.1.2 | 95.8 MB | 5 | 5%  2025-05-07T19:45:54.8217545Z 2025-05-07T19:45:54.8788398Z libcublas-12.6.4.1 | 256.2 MB | #######6 | 76%  2025-05-07T19:45:54.8789446Z 2025-05-07T19:45:54.8789471Z 2025-05-07T19:45:54.9112677Z libcufft-11.3.0.4 | 156.2 MB | #########9 | 99%  2025-05-07T19:45:54.9113543Z 2025-05-07T19:45:54.9113557Z 2025-05-07T19:45:54.9113568Z 2025-05-07T19:45:54.9113579Z 2025-05-07T19:45:54.9113590Z 2025-05-07T19:45:54.9213275Z cuda-nvvp-12.6.80 | 109.3 MB | ####8 | 48%  2025-05-07T19:45:54.9213605Z 2025-05-07T19:45:54.9213610Z 2025-05-07T19:45:54.9213614Z 2025-05-07T19:45:54.9213617Z 2025-05-07T19:45:54.9213621Z 2025-05-07T19:45:54.9213624Z 2025-05-07T19:45:54.9327472Z libcusolver-11.7.1.2 | 95.8 MB | # | 10%  2025-05-07T19:45:54.9327837Z 2025-05-07T19:45:54.9583269Z libcublas-12.6.4.1 | 256.2 MB | #######8 | 79%  2025-05-07T19:45:55.0213653Z nsight-compute-2024. | 443.1 MB | #####9 | 60% 2025-05-07T19:45:55.0214197Z 2025-05-07T19:45:55.0214218Z 2025-05-07T19:45:55.0214222Z 2025-05-07T19:45:55.0214251Z 2025-05-07T19:45:55.0214256Z 2025-05-07T19:45:55.0214262Z 2025-05-07T19:45:55.0228756Z libcusolver-11.7.1.2 | 95.8 MB | #4 | 15%  2025-05-07T19:45:55.0229106Z 2025-05-07T19:45:55.0229136Z 2025-05-07T19:45:55.0229140Z 2025-05-07T19:45:55.0229144Z 2025-05-07T19:45:55.0229147Z 2025-05-07T19:45:55.0330578Z cuda-nvvp-12.6.80 | 109.3 MB | #####4 | 55%  2025-05-07T19:45:55.0331498Z 2025-05-07T19:45:55.0909955Z libcublas-12.6.4.1 | 256.2 MB | ########1 | 81%  2025-05-07T19:45:55.1215836Z nsight-compute-2024. | 443.1 MB | ######1 | 62% 2025-05-07T19:45:55.1216370Z 2025-05-07T19:45:55.1216414Z 2025-05-07T19:45:55.1216422Z 2025-05-07T19:45:55.1216488Z 2025-05-07T19:45:55.1216511Z 2025-05-07T19:45:55.1216524Z 2025-05-07T19:45:55.1337560Z libcusolver-11.7.1.2 | 95.8 MB | #9 | 20%  2025-05-07T19:45:55.1338115Z 2025-05-07T19:45:55.1357389Z libcublas-12.6.4.1 | 256.2 MB | ########3 | 84%  2025-05-07T19:45:55.1358172Z 2025-05-07T19:45:55.1358184Z 2025-05-07T19:45:55.1358188Z 2025-05-07T19:45:55.1358192Z 2025-05-07T19:45:55.1358195Z 2025-05-07T19:45:55.2217538Z cuda-nvvp-12.6.80 | 109.3 MB | ######1 | 62%  2025-05-07T19:45:55.2217872Z 2025-05-07T19:45:55.2217876Z 2025-05-07T19:45:55.2217880Z 2025-05-07T19:45:55.2217883Z 2025-05-07T19:45:55.2217887Z 2025-05-07T19:45:55.2217891Z 2025-05-07T19:45:55.2230959Z libcusolver-11.7.1.2 | 95.8 MB | ##5 | 25%  2025-05-07T19:45:55.2453776Z nsight-compute-2024. | 443.1 MB | ######3 | 64% 2025-05-07T19:45:55.2454111Z 2025-05-07T19:45:55.2454268Z 2025-05-07T19:45:55.2454277Z 2025-05-07T19:45:55.2454283Z 2025-05-07T19:45:55.2454287Z 2025-05-07T19:45:55.3217784Z cuda-nvvp-12.6.80 | 109.3 MB | ######7 | 68%  2025-05-07T19:45:55.3218142Z 2025-05-07T19:45:55.3218147Z 2025-05-07T19:45:55.3218151Z 2025-05-07T19:45:55.3218154Z 2025-05-07T19:45:55.3218159Z 2025-05-07T19:45:55.3218178Z 2025-05-07T19:45:55.3224758Z libcusolver-11.7.1.2 | 95.8 MB | ###1 | 32%  2025-05-07T19:45:55.3225071Z 2025-05-07T19:45:55.3225102Z 2025-05-07T19:45:55.3225106Z 2025-05-07T19:45:55.3226221Z 2025-05-07T19:45:55.3263443Z cuda-nsight-12.6.77 | 113.2 MB | ########## | 100%  2025-05-07T19:45:55.3263772Z 2025-05-07T19:45:55.3268403Z libcublas-12.6.4.1 | 256.2 MB | ########6 | 86%  2025-05-07T19:45:55.3454644Z nsight-compute-2024. | 443.1 MB | ######5 | 65% 2025-05-07T19:45:55.3455505Z 2025-05-07T19:45:55.3455519Z 2025-05-07T19:45:55.3455530Z 2025-05-07T19:45:55.3455541Z 2025-05-07T19:45:55.3455551Z 2025-05-07T19:45:55.4272782Z cuda-nvvp-12.6.80 | 109.3 MB | #######4 | 75%  2025-05-07T19:45:55.4489373Z nsight-compute-2024. | 443.1 MB | ######8 | 69% 2025-05-07T19:45:55.4489766Z 2025-05-07T19:45:55.4571881Z libcublas-12.6.4.1 | 256.2 MB | ########8 | 88%  2025-05-07T19:45:55.4572190Z 2025-05-07T19:45:55.4572210Z 2025-05-07T19:45:55.4572214Z 2025-05-07T19:45:55.4572218Z 2025-05-07T19:45:55.4572221Z 2025-05-07T19:45:55.4572225Z 2025-05-07T19:45:55.5034731Z libcusolver-11.7.1.2 | 95.8 MB | ###7 | 37%  2025-05-07T19:45:55.5035079Z 2025-05-07T19:45:55.5035083Z 2025-05-07T19:45:55.5035087Z 2025-05-07T19:45:55.5035090Z 2025-05-07T19:45:55.5035094Z 2025-05-07T19:45:55.5355417Z cuda-nvvp-12.6.80 | 109.3 MB | ########1 | 81%  2025-05-07T19:45:55.5572344Z nsight-compute-2024. | 443.1 MB | ####### | 71% 2025-05-07T19:45:55.5572852Z 2025-05-07T19:45:55.5572892Z 2025-05-07T19:45:55.5572898Z 2025-05-07T19:45:55.5572961Z 2025-05-07T19:45:55.5572966Z 2025-05-07T19:45:55.5573017Z 2025-05-07T19:45:55.5760234Z libcusolver-11.7.1.2 | 95.8 MB | ####5 | 45%  2025-05-07T19:45:55.5761253Z 2025-05-07T19:45:55.6391855Z libcublas-12.6.4.1 | 256.2 MB | ######### | 90%  2025-05-07T19:45:55.6573879Z nsight-compute-2024. | 443.1 MB | #######3 | 73% 2025-05-07T19:45:55.6574366Z 2025-05-07T19:45:55.6574422Z 2025-05-07T19:45:55.6574428Z 2025-05-07T19:45:55.6574432Z 2025-05-07T19:45:55.6574436Z 2025-05-07T19:45:55.6574439Z 2025-05-07T19:45:55.6760933Z libcusolver-11.7.1.2 | 95.8 MB | #####3 | 54%  2025-05-07T19:45:55.6761328Z 2025-05-07T19:45:55.7570371Z libcublas-12.6.4.1 | 256.2 MB | #########2 | 93%  2025-05-07T19:45:55.7587238Z nsight-compute-2024. | 443.1 MB | #######5 | 75% 2025-05-07T19:45:55.7587518Z 2025-05-07T19:45:55.7587523Z 2025-05-07T19:45:55.7587536Z 2025-05-07T19:45:55.7587540Z 2025-05-07T19:45:55.7587543Z 2025-05-07T19:45:55.7599300Z cuda-nvvp-12.6.80 | 109.3 MB | ########6 | 87%  2025-05-07T19:45:55.7599686Z 2025-05-07T19:45:55.7599693Z 2025-05-07T19:45:55.7599698Z 2025-05-07T19:45:55.7599705Z 2025-05-07T19:45:55.7599709Z 2025-05-07T19:45:55.7599727Z 2025-05-07T19:45:55.7767010Z libcusolver-11.7.1.2 | 95.8 MB | ###### | 60%  2025-05-07T19:45:55.7767672Z 2025-05-07T19:45:55.8587764Z libcublas-12.6.4.1 | 256.2 MB | #########5 | 95%  2025-05-07T19:45:55.8588082Z 2025-05-07T19:45:55.8588088Z 2025-05-07T19:45:55.8588092Z 2025-05-07T19:45:55.8588097Z 2025-05-07T19:45:55.8588117Z 2025-05-07T19:45:55.8664523Z cuda-nvvp-12.6.80 | 109.3 MB | #########2 | 93%  2025-05-07T19:45:55.8664847Z 2025-05-07T19:45:55.8664888Z 2025-05-07T19:45:55.8664893Z 2025-05-07T19:45:55.8664899Z 2025-05-07T19:45:55.8664919Z 2025-05-07T19:45:55.8665098Z 2025-05-07T19:45:55.8768055Z libcusolver-11.7.1.2 | 95.8 MB | ######6 | 67%  2025-05-07T19:45:55.8768416Z 2025-05-07T19:45:55.8904849Z libcublas-12.6.4.1 | 256.2 MB | #########8 | 98%  2025-05-07T19:45:55.9769670Z nsight-compute-2024. | 443.1 MB | #######7 | 77% 2025-05-07T19:45:55.9770126Z 2025-05-07T19:45:55.9770170Z 2025-05-07T19:45:55.9770176Z 2025-05-07T19:45:55.9770184Z 2025-05-07T19:45:55.9770212Z 2025-05-07T19:45:55.9770281Z 2025-05-07T19:45:55.9906205Z libcusolver-11.7.1.2 | 95.8 MB | #######3 | 73%  2025-05-07T19:45:56.0770346Z nsight-compute-2024. | 443.1 MB | #######9 | 80% 2025-05-07T19:45:56.0770683Z 2025-05-07T19:45:56.0770689Z 2025-05-07T19:45:56.0770694Z 2025-05-07T19:45:56.0770697Z 2025-05-07T19:45:56.0770701Z 2025-05-07T19:45:56.0770705Z 2025-05-07T19:45:56.0905813Z libcusolver-11.7.1.2 | 95.8 MB | ########1 | 81%  2025-05-07T19:45:56.1906820Z nsight-compute-2024. | 443.1 MB | ########2 | 83% 2025-05-07T19:45:56.2083559Z nsight-compute-2024. | 443.1 MB | ########6 | 87% 2025-05-07T19:45:56.2083946Z 2025-05-07T19:45:56.2084198Z 2025-05-07T19:45:56.2084214Z 2025-05-07T19:45:56.2084223Z 2025-05-07T19:45:56.2084267Z 2025-05-07T19:45:56.2084273Z 2025-05-07T19:45:56.2991423Z libcusolver-11.7.1.2 | 95.8 MB | ########8 | 88%  2025-05-07T19:45:56.3083720Z nsight-compute-2024. | 443.1 MB | ########9 | 90% 2025-05-07T19:45:56.3084073Z 2025-05-07T19:45:56.3084079Z 2025-05-07T19:45:56.3084083Z 2025-05-07T19:45:56.3084087Z 2025-05-07T19:45:56.3084091Z 2025-05-07T19:45:56.3084095Z 2025-05-07T19:45:56.4103559Z libcusolver-11.7.1.2 | 95.8 MB | #########7 | 97%  2025-05-07T19:45:56.4786456Z nsight-compute-2024. | 443.1 MB | #########2 | 93% 2025-05-07T19:45:56.4786771Z 2025-05-07T19:45:56.4786780Z 2025-05-07T19:45:56.5104570Z libcufft-11.3.0.4 | 156.2 MB | ########## | 100%  2025-05-07T19:45:56.5290342Z nsight-compute-2024. | 443.1 MB | #########5 | 96% 2025-05-07T19:45:56.5290649Z 2025-05-07T19:45:56.5290851Z 2025-05-07T19:45:56.5290864Z 2025-05-07T19:45:56.5291026Z 2025-05-07T19:45:56.5291036Z 2025-05-07T19:45:56.5291041Z 2025-05-07T19:45:56.5291301Z 2025-05-07T19:45:56.6180396Z libnpp-12.3.1.54 | 93.4 MB | | 0%  2025-05-07T19:45:56.6312475Z nsight-compute-2024. | 443.1 MB | #########8 | 99% 2025-05-07T19:45:56.6312775Z 2025-05-07T19:45:56.6312825Z 2025-05-07T19:45:56.6312829Z 2025-05-07T19:45:56.6312833Z 2025-05-07T19:45:56.6312837Z 2025-05-07T19:45:56.6312840Z 2025-05-07T19:45:56.6312844Z 2025-05-07T19:45:56.7338828Z libnpp-12.3.1.54 | 93.4 MB | 7 | 7%  2025-05-07T19:45:56.7339172Z 2025-05-07T19:45:56.7339178Z 2025-05-07T19:45:56.7339182Z 2025-05-07T19:45:56.7339197Z 2025-05-07T19:45:56.7339201Z 2025-05-07T19:45:56.7339204Z 2025-05-07T19:45:56.7339208Z 2025-05-07T19:45:56.8191397Z libnpp-12.3.1.54 | 93.4 MB | #5 | 15%  2025-05-07T19:45:56.8191747Z 2025-05-07T19:45:56.8191752Z 2025-05-07T19:45:56.8191758Z 2025-05-07T19:45:56.8191761Z 2025-05-07T19:45:56.8191782Z 2025-05-07T19:45:56.8192095Z cuda-nvvp-12.6.80 | 109.3 MB | ########## | 100%  2025-05-07T19:45:56.8192404Z 2025-05-07T19:45:56.8192409Z 2025-05-07T19:45:56.8192413Z 2025-05-07T19:45:56.8192417Z 2025-05-07T19:45:56.8192421Z 2025-05-07T19:45:56.8361883Z cuda-nvvp-12.6.80 | 109.3 MB | ########## | 100%  2025-05-07T19:45:56.8362589Z 2025-05-07T19:45:56.8362615Z 2025-05-07T19:45:56.8362619Z 2025-05-07T19:45:56.8362634Z 2025-05-07T19:45:56.8362645Z 2025-05-07T19:45:56.8362650Z 2025-05-07T19:45:56.8362722Z 2025-05-07T19:45:56.8718081Z libnpp-12.3.1.54 | 93.4 MB | ##4 | 25%  2025-05-07T19:45:56.8718414Z 2025-05-07T19:45:56.8718422Z 2025-05-07T19:45:56.8718427Z 2025-05-07T19:45:56.8718432Z 2025-05-07T19:45:56.8718437Z 2025-05-07T19:45:56.8718444Z 2025-05-07T19:45:56.8718448Z 2025-05-07T19:45:56.8718453Z 2025-05-07T19:45:56.9719347Z cuda-nvdisasm-12.6.7 | 47.6 MB | | 0%  2025-05-07T19:45:56.9719713Z 2025-05-07T19:45:56.9719718Z 2025-05-07T19:45:56.9719753Z 2025-05-07T19:45:56.9719756Z 2025-05-07T19:45:56.9719760Z 2025-05-07T19:45:56.9719764Z 2025-05-07T19:45:56.9719768Z 2025-05-07T19:45:56.9719784Z 2025-05-07T19:45:57.0071678Z cuda-nvdisasm-12.6.7 | 47.6 MB | ##3 | 23%  2025-05-07T19:45:57.0072061Z 2025-05-07T19:45:57.0072066Z 2025-05-07T19:45:57.0072069Z 2025-05-07T19:45:57.0072073Z 2025-05-07T19:45:57.0072077Z 2025-05-07T19:45:57.0072080Z 2025-05-07T19:45:57.0072084Z 2025-05-07T19:45:57.0719586Z libnpp-12.3.1.54 | 93.4 MB | ###1 | 32%  2025-05-07T19:45:57.0719914Z 2025-05-07T19:45:57.0719922Z 2025-05-07T19:45:57.0719927Z 2025-05-07T19:45:57.0719932Z 2025-05-07T19:45:57.0719937Z 2025-05-07T19:45:57.0719944Z 2025-05-07T19:45:57.0719949Z 2025-05-07T19:45:57.0719961Z 2025-05-07T19:45:57.1100348Z cuda-nvdisasm-12.6.7 | 47.6 MB | ###8 | 39%  2025-05-07T19:45:57.1100705Z 2025-05-07T19:45:57.1100710Z 2025-05-07T19:45:57.1100749Z 2025-05-07T19:45:57.1100753Z 2025-05-07T19:45:57.1100757Z 2025-05-07T19:45:57.1100761Z 2025-05-07T19:45:57.1100765Z 2025-05-07T19:45:57.1720461Z libnpp-12.3.1.54 | 93.4 MB | ###8 | 39%  2025-05-07T19:45:57.1720827Z 2025-05-07T19:45:57.1720831Z 2025-05-07T19:45:57.1720856Z 2025-05-07T19:45:57.1720860Z 2025-05-07T19:45:57.1720863Z 2025-05-07T19:45:57.1720867Z 2025-05-07T19:45:57.1720870Z 2025-05-07T19:45:57.1720875Z 2025-05-07T19:45:57.2399516Z cuda-nvdisasm-12.6.7 | 47.6 MB | #####8 | 58%  2025-05-07T19:45:57.2399867Z 2025-05-07T19:45:57.2399872Z 2025-05-07T19:45:57.2399876Z 2025-05-07T19:45:57.2399880Z 2025-05-07T19:45:57.2399887Z 2025-05-07T19:45:57.2399892Z 2025-05-07T19:45:57.2399898Z 2025-05-07T19:45:57.2587684Z libnpp-12.3.1.54 | 93.4 MB | ####4 | 45%  2025-05-07T19:45:57.2588007Z 2025-05-07T19:45:57.2588013Z 2025-05-07T19:45:57.2588018Z 2025-05-07T19:45:57.2720198Z libcusparse-12.5.4.2 | 118.6 MB | ########## | 100%  2025-05-07T19:45:57.2720551Z 2025-05-07T19:45:57.2720556Z 2025-05-07T19:45:57.2720560Z 2025-05-07T19:45:57.2720564Z 2025-05-07T19:45:57.2720568Z 2025-05-07T19:45:57.2720572Z 2025-05-07T19:45:57.2720596Z 2025-05-07T19:45:57.2720599Z 2025-05-07T19:45:57.3006992Z cuda-nvdisasm-12.6.7 | 47.6 MB | #######9 | 79%  2025-05-07T19:45:57.3007373Z 2025-05-07T19:45:57.3007378Z 2025-05-07T19:45:57.3007382Z 2025-05-07T19:45:57.3007385Z 2025-05-07T19:45:57.3007389Z 2025-05-07T19:45:57.3007392Z 2025-05-07T19:45:57.3399855Z libcusolver-11.7.1.2 | 95.8 MB | ########## | 100%  2025-05-07T19:45:57.3400215Z 2025-05-07T19:45:57.3400220Z 2025-05-07T19:45:57.3400225Z 2025-05-07T19:45:57.3400229Z 2025-05-07T19:45:57.3400233Z 2025-05-07T19:45:57.3400238Z 2025-05-07T19:45:57.3400243Z 2025-05-07T19:45:57.3593601Z libnpp-12.3.1.54 | 93.4 MB | #####1 | 51%  2025-05-07T19:45:57.3594041Z 2025-05-07T19:45:57.3594046Z 2025-05-07T19:45:57.3594051Z 2025-05-07T19:45:57.3594056Z 2025-05-07T19:45:57.3594061Z 2025-05-07T19:45:57.3594065Z 2025-05-07T19:45:57.3594070Z 2025-05-07T19:45:57.3594074Z 2025-05-07T19:45:57.3594079Z 2025-05-07T19:45:57.3825661Z libcurand-10.3.7.77 | 39.9 MB | | 0%  2025-05-07T19:45:57.3826014Z 2025-05-07T19:45:57.3826018Z 2025-05-07T19:45:57.3826022Z 2025-05-07T19:45:57.3826026Z 2025-05-07T19:45:57.3826029Z 2025-05-07T19:45:57.3826033Z 2025-05-07T19:45:57.3826036Z 2025-05-07T19:45:57.3826041Z 2025-05-07T19:45:57.4400690Z cuda-nvdisasm-12.6.7 | 47.6 MB | #########7 | 97%  2025-05-07T19:45:57.4401082Z 2025-05-07T19:45:57.4401088Z 2025-05-07T19:45:57.4401092Z 2025-05-07T19:45:57.4401098Z 2025-05-07T19:45:57.4401104Z 2025-05-07T19:45:57.4401110Z 2025-05-07T19:45:57.4401116Z 2025-05-07T19:45:57.4596754Z libnpp-12.3.1.54 | 93.4 MB | #####8 | 59%  2025-05-07T19:45:57.4597129Z 2025-05-07T19:45:57.4597145Z 2025-05-07T19:45:57.4597150Z 2025-05-07T19:45:57.4597155Z 2025-05-07T19:45:57.4597159Z 2025-05-07T19:45:57.4597164Z 2025-05-07T19:45:57.4597169Z 2025-05-07T19:45:57.4597173Z 2025-05-07T19:45:57.4597209Z 2025-05-07T19:45:57.5401216Z libcurand-10.3.7.77 | 39.9 MB | #3 | 13%  2025-05-07T19:45:57.5401646Z 2025-05-07T19:45:57.5401650Z 2025-05-07T19:45:57.5401654Z 2025-05-07T19:45:57.5401658Z 2025-05-07T19:45:57.5401661Z 2025-05-07T19:45:57.5401664Z 2025-05-07T19:45:57.5401668Z 2025-05-07T19:45:57.5601729Z libnpp-12.3.1.54 | 93.4 MB | ######6 | 67%  2025-05-07T19:45:57.5602715Z 2025-05-07T19:45:57.5602730Z 2025-05-07T19:45:57.5602741Z 2025-05-07T19:45:57.5602752Z 2025-05-07T19:45:57.5602764Z 2025-05-07T19:45:57.5602775Z 2025-05-07T19:45:57.5602786Z 2025-05-07T19:45:57.5602797Z 2025-05-07T19:45:57.5602808Z 2025-05-07T19:45:57.6407014Z libcurand-10.3.7.77 | 39.9 MB | ##7 | 27%  2025-05-07T19:45:57.6407397Z 2025-05-07T19:45:57.6407402Z 2025-05-07T19:45:57.6407406Z 2025-05-07T19:45:57.6407411Z 2025-05-07T19:45:57.6407416Z 2025-05-07T19:45:57.6407420Z 2025-05-07T19:45:57.6407448Z 2025-05-07T19:45:57.6760422Z libnpp-12.3.1.54 | 93.4 MB | #######5 | 76%  2025-05-07T19:45:57.6760786Z 2025-05-07T19:45:57.6760791Z 2025-05-07T19:45:57.6760795Z 2025-05-07T19:45:57.6760798Z 2025-05-07T19:45:57.6760802Z 2025-05-07T19:45:57.6760805Z 2025-05-07T19:45:57.6760808Z 2025-05-07T19:45:57.6760812Z 2025-05-07T19:45:57.6760815Z 2025-05-07T19:45:57.7532974Z libcurand-10.3.7.77 | 39.9 MB | ###7 | 38%  2025-05-07T19:45:57.7533369Z 2025-05-07T19:45:57.7533377Z 2025-05-07T19:45:57.7533382Z 2025-05-07T19:45:57.7533386Z 2025-05-07T19:45:57.7533390Z 2025-05-07T19:45:57.7533393Z 2025-05-07T19:45:57.7533397Z 2025-05-07T19:45:57.7906644Z libnpp-12.3.1.54 | 93.4 MB | ########3 | 84%  2025-05-07T19:45:57.7907024Z 2025-05-07T19:45:57.7907029Z 2025-05-07T19:45:57.7907033Z 2025-05-07T19:45:57.7907037Z 2025-05-07T19:45:57.7907042Z 2025-05-07T19:45:57.7907047Z 2025-05-07T19:45:57.7907052Z 2025-05-07T19:45:57.7907085Z 2025-05-07T19:45:57.7907088Z 2025-05-07T19:45:57.8534401Z libcurand-10.3.7.77 | 39.9 MB | ####7 | 48%  2025-05-07T19:45:57.8534793Z 2025-05-07T19:45:57.8534799Z 2025-05-07T19:45:57.8534808Z 2025-05-07T19:45:57.8534813Z 2025-05-07T19:45:57.8534819Z 2025-05-07T19:45:57.8534825Z 2025-05-07T19:45:57.8534830Z 2025-05-07T19:45:57.9091038Z libnpp-12.3.1.54 | 93.4 MB | #########2 | 93%  2025-05-07T19:45:57.9091371Z 2025-05-07T19:45:57.9091377Z 2025-05-07T19:45:57.9091382Z 2025-05-07T19:45:57.9091386Z 2025-05-07T19:45:57.9091391Z 2025-05-07T19:45:57.9091396Z 2025-05-07T19:45:57.9091401Z 2025-05-07T19:45:57.9091406Z 2025-05-07T19:45:57.9091410Z 2025-05-07T19:45:58.0114018Z libcurand-10.3.7.77 | 39.9 MB | #####7 | 57%  2025-05-07T19:45:58.0114409Z 2025-05-07T19:45:58.0114414Z 2025-05-07T19:45:58.0114417Z 2025-05-07T19:45:58.0114422Z 2025-05-07T19:45:58.0114426Z 2025-05-07T19:45:58.0114678Z 2025-05-07T19:45:58.0114682Z 2025-05-07T19:45:58.0114685Z 2025-05-07T19:45:58.0114688Z 2025-05-07T19:45:58.0553973Z libcurand-10.3.7.77 | 39.9 MB | #######1 | 72%  2025-05-07T19:45:58.0554326Z 2025-05-07T19:45:58.0554349Z 2025-05-07T19:45:58.0554354Z 2025-05-07T19:45:58.0554359Z 2025-05-07T19:45:58.0554363Z 2025-05-07T19:45:58.0554368Z 2025-05-07T19:45:58.0554372Z 2025-05-07T19:45:58.0554377Z 2025-05-07T19:45:58.0951867Z cuda-nvdisasm-12.6.7 | 47.6 MB | ########## | 100%  2025-05-07T19:45:58.0952245Z 2025-05-07T19:45:58.0952251Z 2025-05-07T19:45:58.0952256Z 2025-05-07T19:45:58.0952261Z 2025-05-07T19:45:58.0952266Z 2025-05-07T19:45:58.0952270Z 2025-05-07T19:45:58.0952275Z 2025-05-07T19:45:58.0952312Z 2025-05-07T19:45:58.0952316Z 2025-05-07T19:45:58.0952320Z 2025-05-07T19:45:58.1114182Z gds-tools-1.11.1.6 | 37.8 MB | | 0%  2025-05-07T19:45:58.1114537Z 2025-05-07T19:45:58.1114564Z 2025-05-07T19:45:58.1114568Z 2025-05-07T19:45:58.1114571Z 2025-05-07T19:45:58.1114575Z 2025-05-07T19:45:58.1114578Z 2025-05-07T19:45:58.1114582Z 2025-05-07T19:45:58.1114585Z 2025-05-07T19:45:58.1114589Z 2025-05-07T19:45:58.1954719Z libcurand-10.3.7.77 | 39.9 MB | ########9 | 90%  2025-05-07T19:45:58.1955080Z 2025-05-07T19:45:58.1955103Z 2025-05-07T19:45:58.1955107Z 2025-05-07T19:45:58.1955111Z 2025-05-07T19:45:58.1955115Z 2025-05-07T19:45:58.1955119Z 2025-05-07T19:45:58.1955122Z 2025-05-07T19:45:58.1955125Z 2025-05-07T19:45:58.1955129Z 2025-05-07T19:45:58.1955132Z 2025-05-07T19:45:58.2955613Z gds-tools-1.11.1.6 | 37.8 MB | ## | 20%  2025-05-07T19:45:58.2955997Z 2025-05-07T19:45:58.2956028Z 2025-05-07T19:45:58.2956033Z 2025-05-07T19:45:58.2956037Z 2025-05-07T19:45:58.2956041Z 2025-05-07T19:45:58.2956045Z 2025-05-07T19:45:58.2956049Z 2025-05-07T19:45:58.2956053Z 2025-05-07T19:45:58.2956057Z 2025-05-07T19:45:58.2956080Z 2025-05-07T19:45:58.3956909Z gds-tools-1.11.1.6 | 37.8 MB | ####3 | 44%  2025-05-07T19:45:58.3957278Z 2025-05-07T19:45:58.3957305Z 2025-05-07T19:45:58.3957311Z 2025-05-07T19:45:58.3957317Z 2025-05-07T19:45:58.3957322Z 2025-05-07T19:45:58.3957327Z 2025-05-07T19:45:58.3957332Z 2025-05-07T19:45:58.3957336Z 2025-05-07T19:45:58.3957341Z 2025-05-07T19:45:58.3957346Z 2025-05-07T19:45:58.4957180Z gds-tools-1.11.1.6 | 37.8 MB | ######6 | 67%  2025-05-07T19:45:58.4957543Z 2025-05-07T19:45:58.4957548Z 2025-05-07T19:45:58.4957553Z 2025-05-07T19:45:58.4957556Z 2025-05-07T19:45:58.4957561Z 2025-05-07T19:45:58.4957564Z 2025-05-07T19:45:58.4957567Z 2025-05-07T19:45:58.4957571Z 2025-05-07T19:45:58.4957826Z 2025-05-07T19:45:58.4957831Z 2025-05-07T19:45:58.6827968Z gds-tools-1.11.1.6 | 37.8 MB | #########1 | 92%  2025-05-07T19:45:58.6828322Z 2025-05-07T19:45:58.6828327Z 2025-05-07T19:45:58.6828368Z 2025-05-07T19:45:58.6828371Z 2025-05-07T19:45:58.6828375Z 2025-05-07T19:45:58.6828379Z 2025-05-07T19:45:58.6828382Z 2025-05-07T19:45:58.6828386Z 2025-05-07T19:45:58.6828392Z 2025-05-07T19:45:58.7562163Z libcurand-10.3.7.77 | 39.9 MB | ########## | 100%  2025-05-07T19:45:58.7562512Z 2025-05-07T19:45:58.7562517Z 2025-05-07T19:45:58.7562522Z 2025-05-07T19:45:58.7562528Z 2025-05-07T19:45:58.7562532Z 2025-05-07T19:45:58.7562537Z 2025-05-07T19:45:58.7562542Z 2025-05-07T19:45:58.7562547Z 2025-05-07T19:45:58.7562552Z 2025-05-07T19:45:58.7562557Z 2025-05-07T19:45:58.7562561Z 2025-05-07T19:45:58.8564721Z cuda-nvcc-tools-12.6 | 23.0 MB | | 0%  2025-05-07T19:45:58.8565148Z 2025-05-07T19:45:58.8565153Z 2025-05-07T19:45:58.8565162Z 2025-05-07T19:45:58.8565166Z 2025-05-07T19:45:58.8565170Z 2025-05-07T19:45:58.8565174Z 2025-05-07T19:45:58.8565178Z 2025-05-07T19:45:58.8565182Z 2025-05-07T19:45:58.8565186Z 2025-05-07T19:45:58.8565430Z 2025-05-07T19:45:58.8565434Z 2025-05-07T19:45:58.8928467Z cuda-nvcc-tools-12.6 | 23.0 MB | ##9 | 29%  2025-05-07T19:45:58.8928826Z 2025-05-07T19:45:58.9324873Z libcublas-12.6.4.1 | 256.2 MB | ########## | 100%  2025-05-07T19:45:58.9325184Z 2025-05-07T19:45:58.9325189Z 2025-05-07T19:45:58.9325194Z 2025-05-07T19:45:58.9325199Z 2025-05-07T19:45:58.9325203Z 2025-05-07T19:45:58.9325220Z 2025-05-07T19:45:58.9325224Z 2025-05-07T19:45:58.9325227Z 2025-05-07T19:45:58.9325230Z 2025-05-07T19:45:58.9325234Z 2025-05-07T19:45:58.9325237Z 2025-05-07T19:45:58.9325241Z 2025-05-07T19:45:58.9566242Z cuda-nvrtc-12.6.85 | 17.3 MB | | 0%  2025-05-07T19:45:58.9566684Z 2025-05-07T19:45:58.9566690Z 2025-05-07T19:45:58.9566695Z 2025-05-07T19:45:58.9566699Z 2025-05-07T19:45:58.9566704Z 2025-05-07T19:45:58.9566709Z 2025-05-07T19:45:58.9566713Z 2025-05-07T19:45:58.9566718Z 2025-05-07T19:45:58.9566748Z 2025-05-07T19:45:58.9566778Z 2025-05-07T19:45:58.9566782Z 2025-05-07T19:45:58.9785448Z cuda-nvcc-tools-12.6 | 23.0 MB | ###### | 60%  2025-05-07T19:45:58.9785815Z 2025-05-07T19:45:58.9785996Z 2025-05-07T19:45:58.9786011Z 2025-05-07T19:45:58.9786018Z 2025-05-07T19:45:58.9786027Z 2025-05-07T19:45:58.9821609Z cuda-nvvp-12.6.80 | 109.3 MB | ########## | 100%  2025-05-07T19:45:58.9821981Z 2025-05-07T19:45:58.9821989Z 2025-05-07T19:45:58.9821996Z 2025-05-07T19:45:58.9822002Z 2025-05-07T19:45:58.9822009Z 2025-05-07T19:45:58.9822015Z 2025-05-07T19:45:58.9822022Z 2025-05-07T19:45:58.9822028Z 2025-05-07T19:45:58.9822035Z 2025-05-07T19:45:58.9822041Z 2025-05-07T19:45:59.0265251Z gds-tools-1.11.1.6 | 37.8 MB | ########## | 100%  2025-05-07T19:45:59.0265631Z 2025-05-07T19:45:59.0265636Z 2025-05-07T19:45:59.0265640Z 2025-05-07T19:45:59.0265643Z 2025-05-07T19:45:59.0265647Z 2025-05-07T19:45:59.0265689Z 2025-05-07T19:45:59.0265692Z 2025-05-07T19:45:59.0265696Z 2025-05-07T19:45:59.0265699Z 2025-05-07T19:45:59.0265702Z 2025-05-07T19:45:59.0265706Z 2025-05-07T19:45:59.0265710Z 2025-05-07T19:45:59.0265713Z 2025-05-07T19:45:59.0325839Z libnvjitlink-12.6.85 | 14.9 MB | | 0%  2025-05-07T19:45:59.0326218Z 2025-05-07T19:45:59.0326222Z 2025-05-07T19:45:59.0326226Z 2025-05-07T19:45:59.0326229Z 2025-05-07T19:45:59.0326233Z 2025-05-07T19:45:59.0326236Z 2025-05-07T19:45:59.0326240Z 2025-05-07T19:45:59.0326243Z 2025-05-07T19:45:59.0326246Z 2025-05-07T19:45:59.0326250Z 2025-05-07T19:45:59.0326253Z 2025-05-07T19:45:59.0326257Z 2025-05-07T19:45:59.0796854Z cuda-nvrtc-12.6.85 | 17.3 MB | ####8 | 49%  2025-05-07T19:45:59.0797228Z 2025-05-07T19:45:59.0797233Z 2025-05-07T19:45:59.0797237Z 2025-05-07T19:45:59.0797241Z 2025-05-07T19:45:59.0797248Z 2025-05-07T19:45:59.0797251Z 2025-05-07T19:45:59.0797269Z 2025-05-07T19:45:59.0797273Z 2025-05-07T19:45:59.0797276Z 2025-05-07T19:45:59.0797279Z 2025-05-07T19:45:59.0797283Z 2025-05-07T19:45:59.1025661Z cuda-nvcc-tools-12.6 | 23.0 MB | ########3 | 83%  2025-05-07T19:45:59.1026056Z 2025-05-07T19:45:59.1026061Z 2025-05-07T19:45:59.1268058Z libcufft-11.3.0.4 | 156.2 MB | ########## | 100%  2025-05-07T19:45:59.1268941Z 2025-05-07T19:45:59.1268956Z 2025-05-07T19:45:59.1268966Z 2025-05-07T19:45:59.1268978Z 2025-05-07T19:45:59.1268989Z 2025-05-07T19:45:59.1268999Z 2025-05-07T19:45:59.1269009Z 2025-05-07T19:45:59.1269021Z 2025-05-07T19:45:59.1269032Z 2025-05-07T19:45:59.1269042Z 2025-05-07T19:45:59.1269053Z 2025-05-07T19:45:59.1269063Z 2025-05-07T19:45:59.1269108Z 2025-05-07T19:45:59.1328988Z libnvjitlink-12.6.85 | 14.9 MB | ###8 | 38%  2025-05-07T19:45:59.1329480Z 2025-05-07T19:45:59.1329486Z 2025-05-07T19:45:59.1329490Z 2025-05-07T19:45:59.1329494Z 2025-05-07T19:45:59.1329722Z 2025-05-07T19:45:59.1329725Z 2025-05-07T19:45:59.1329729Z 2025-05-07T19:45:59.1329732Z 2025-05-07T19:45:59.1329735Z 2025-05-07T19:45:59.1329739Z 2025-05-07T19:45:59.1329742Z 2025-05-07T19:45:59.1329746Z 2025-05-07T19:45:59.2109742Z cuda-nvrtc-12.6.85 | 17.3 MB | ########6 | 87%  2025-05-07T19:45:59.2110104Z 2025-05-07T19:45:59.2110200Z 2025-05-07T19:45:59.2110204Z 2025-05-07T19:45:59.2110232Z 2025-05-07T19:45:59.2110236Z 2025-05-07T19:45:59.2110319Z 2025-05-07T19:45:59.2110353Z 2025-05-07T19:45:59.2266715Z libnpp-12.3.1.54 | 93.4 MB | ########## | 100%  2025-05-07T19:45:59.2267104Z 2025-05-07T19:45:59.2267111Z 2025-05-07T19:45:59.2267116Z 2025-05-07T19:45:59.2267161Z 2025-05-07T19:45:59.2267166Z 2025-05-07T19:45:59.2267171Z 2025-05-07T19:45:59.2267175Z 2025-05-07T19:45:59.2267183Z 2025-05-07T19:45:59.2267186Z 2025-05-07T19:45:59.2267191Z 2025-05-07T19:45:59.2267195Z 2025-05-07T19:45:59.2267223Z 2025-05-07T19:45:59.2267258Z 2025-05-07T19:45:59.2633478Z libnvjitlink-12.6.85 | 14.9 MB | #########6 | 97%  2025-05-07T19:45:59.2633861Z 2025-05-07T19:45:59.2633866Z 2025-05-07T19:45:59.2633869Z 2025-05-07T19:45:59.2633873Z 2025-05-07T19:45:59.2633876Z 2025-05-07T19:45:59.2633880Z 2025-05-07T19:45:59.2633905Z 2025-05-07T19:45:59.2633908Z 2025-05-07T19:45:59.2633911Z 2025-05-07T19:45:59.2633915Z 2025-05-07T19:45:59.2633918Z 2025-05-07T19:45:59.2633922Z 2025-05-07T19:45:59.2633925Z 2025-05-07T19:45:59.2633928Z 2025-05-07T19:45:59.3634914Z cuda-nvcc-dev_linux- | 10.8 MB | | 0%  2025-05-07T19:45:59.3635347Z 2025-05-07T19:45:59.3635351Z 2025-05-07T19:45:59.3635396Z 2025-05-07T19:45:59.3635400Z 2025-05-07T19:45:59.3635403Z 2025-05-07T19:45:59.3635407Z 2025-05-07T19:45:59.3635411Z 2025-05-07T19:45:59.3635415Z 2025-05-07T19:45:59.3635418Z 2025-05-07T19:45:59.3635423Z 2025-05-07T19:45:59.3635448Z 2025-05-07T19:45:59.3635451Z 2025-05-07T19:45:59.3635455Z 2025-05-07T19:45:59.3635458Z 2025-05-07T19:45:59.3649358Z cuda-nvcc-dev_linux- | 10.8 MB | ####1 | 42%  2025-05-07T19:45:59.3649772Z 2025-05-07T19:45:59.3649777Z 2025-05-07T19:45:59.3649782Z 2025-05-07T19:45:59.3649785Z 2025-05-07T19:45:59.3649789Z 2025-05-07T19:45:59.3649792Z 2025-05-07T19:45:59.3649796Z 2025-05-07T19:45:59.3649799Z 2025-05-07T19:45:59.3649802Z 2025-05-07T19:45:59.3649806Z 2025-05-07T19:45:59.3649809Z 2025-05-07T19:45:59.3649812Z 2025-05-07T19:45:59.3971655Z cuda-nvrtc-12.6.85 | 17.3 MB | ########## | 100%  2025-05-07T19:45:59.3972018Z 2025-05-07T19:45:59.3972022Z 2025-05-07T19:45:59.3972284Z 2025-05-07T19:45:59.3972289Z 2025-05-07T19:45:59.3972293Z 2025-05-07T19:45:59.3972296Z 2025-05-07T19:45:59.3972299Z 2025-05-07T19:45:59.3972303Z 2025-05-07T19:45:59.3972306Z 2025-05-07T19:45:59.3972310Z 2025-05-07T19:45:59.3972324Z 2025-05-07T19:45:59.3972327Z 2025-05-07T19:45:59.3972358Z 2025-05-07T19:45:59.4184719Z libnvjitlink-12.6.85 | 14.9 MB | ########## | 100%  2025-05-07T19:45:59.4185092Z 2025-05-07T19:45:59.4185118Z 2025-05-07T19:45:59.4185126Z 2025-05-07T19:45:59.4185136Z 2025-05-07T19:45:59.4185143Z 2025-05-07T19:45:59.4185149Z 2025-05-07T19:45:59.4185156Z 2025-05-07T19:45:59.4185163Z 2025-05-07T19:45:59.4185169Z 2025-05-07T19:45:59.4185176Z 2025-05-07T19:45:59.4185181Z 2025-05-07T19:45:59.4185186Z 2025-05-07T19:45:59.4185189Z 2025-05-07T19:45:59.4185193Z 2025-05-07T19:45:59.4205188Z 2025-05-07T19:45:59.4205792Z cuda-nvvm-tools-12.6 | 10.4 MB | | 0%  2025-05-07T19:45:59.4206191Z 2025-05-07T19:45:59.4206196Z 2025-05-07T19:45:59.4206201Z 2025-05-07T19:45:59.4206206Z 2025-05-07T19:45:59.4206211Z 2025-05-07T19:45:59.4206216Z 2025-05-07T19:45:59.4206236Z 2025-05-07T19:45:59.4206241Z 2025-05-07T19:45:59.4206482Z 2025-05-07T19:45:59.4206486Z 2025-05-07T19:45:59.4206489Z 2025-05-07T19:45:59.4448205Z cuda-nvcc-tools-12.6 | 23.0 MB | ########## | 100%  2025-05-07T19:45:59.4448575Z 2025-05-07T19:45:59.4448580Z 2025-05-07T19:45:59.4448600Z 2025-05-07T19:45:59.4448605Z 2025-05-07T19:45:59.4448609Z 2025-05-07T19:45:59.4448614Z 2025-05-07T19:45:59.4448617Z 2025-05-07T19:45:59.4448622Z 2025-05-07T19:45:59.4448626Z 2025-05-07T19:45:59.4448631Z 2025-05-07T19:45:59.4448635Z 2025-05-07T19:45:59.4448639Z 2025-05-07T19:45:59.4448643Z 2025-05-07T19:45:59.4448647Z 2025-05-07T19:45:59.4448651Z 2025-05-07T19:45:59.4448655Z 2025-05-07T19:45:59.4631507Z cuda-sanitizer-api-1 | 8.9 MB | | 0%  2025-05-07T19:45:59.4631909Z 2025-05-07T19:45:59.4631913Z 2025-05-07T19:45:59.4631917Z 2025-05-07T19:45:59.4631920Z 2025-05-07T19:45:59.4631924Z 2025-05-07T19:45:59.4631927Z 2025-05-07T19:45:59.4631932Z 2025-05-07T19:45:59.4631951Z 2025-05-07T19:45:59.4756710Z cuda-nvdisasm-12.6.7 | 47.6 MB | ########## | 100%  2025-05-07T19:45:59.4757079Z 2025-05-07T19:45:59.4757083Z 2025-05-07T19:45:59.4757087Z 2025-05-07T19:45:59.4757091Z 2025-05-07T19:45:59.4757095Z 2025-05-07T19:45:59.4757098Z 2025-05-07T19:45:59.4757102Z 2025-05-07T19:45:59.4757105Z 2025-05-07T19:45:59.4757110Z 2025-05-07T19:45:59.4757114Z 2025-05-07T19:45:59.4757118Z 2025-05-07T19:45:59.4757121Z 2025-05-07T19:45:59.4757125Z 2025-05-07T19:45:59.4757128Z 2025-05-07T19:45:59.4757132Z 2025-05-07T19:45:59.4757135Z 2025-05-07T19:45:59.4760078Z 2025-05-07T19:45:59.5187543Z cuda-nvvm-impl-12.6. | 7.7 MB | | 0%  2025-05-07T19:45:59.5188685Z 2025-05-07T19:45:59.5188700Z 2025-05-07T19:45:59.5188747Z 2025-05-07T19:45:59.5188758Z 2025-05-07T19:45:59.5188768Z 2025-05-07T19:45:59.5188780Z 2025-05-07T19:45:59.5188791Z 2025-05-07T19:45:59.5188803Z 2025-05-07T19:45:59.5188841Z 2025-05-07T19:45:59.5188852Z 2025-05-07T19:45:59.5188862Z 2025-05-07T19:45:59.5188872Z 2025-05-07T19:45:59.5188882Z 2025-05-07T19:45:59.5188893Z 2025-05-07T19:45:59.5188904Z 2025-05-07T19:45:59.5450147Z cuda-nvvm-tools-12.6 | 10.4 MB | ######9 | 70%  2025-05-07T19:45:59.5450542Z 2025-05-07T19:45:59.5450546Z 2025-05-07T19:45:59.5450550Z 2025-05-07T19:45:59.5450554Z 2025-05-07T19:45:59.5450558Z 2025-05-07T19:45:59.5450561Z 2025-05-07T19:45:59.5450564Z 2025-05-07T19:45:59.5450568Z 2025-05-07T19:45:59.5450571Z 2025-05-07T19:45:59.5450575Z 2025-05-07T19:45:59.5450600Z 2025-05-07T19:45:59.5450603Z 2025-05-07T19:45:59.5450606Z 2025-05-07T19:45:59.5450610Z 2025-05-07T19:45:59.5450613Z 2025-05-07T19:45:59.5450928Z 2025-05-07T19:45:59.5675570Z cuda-sanitizer-api-1 | 8.9 MB | #########5 | 95%  2025-05-07T19:45:59.5675997Z 2025-05-07T19:45:59.5676002Z 2025-05-07T19:45:59.5676023Z 2025-05-07T19:45:59.5676026Z 2025-05-07T19:45:59.5676030Z 2025-05-07T19:45:59.5676033Z 2025-05-07T19:45:59.5676037Z 2025-05-07T19:45:59.5676040Z 2025-05-07T19:45:59.5676044Z 2025-05-07T19:45:59.5676047Z 2025-05-07T19:45:59.5676051Z 2025-05-07T19:45:59.5676054Z 2025-05-07T19:45:59.5676058Z 2025-05-07T19:45:59.5676061Z 2025-05-07T19:45:59.5676390Z cuda-nvcc-dev_linux- | 10.8 MB | ########## | 100%  2025-05-07T19:45:59.5676765Z 2025-05-07T19:45:59.5676769Z 2025-05-07T19:45:59.5676772Z 2025-05-07T19:45:59.5676776Z 2025-05-07T19:45:59.5676779Z 2025-05-07T19:45:59.5676783Z 2025-05-07T19:45:59.5676786Z 2025-05-07T19:45:59.5676790Z 2025-05-07T19:45:59.5676793Z 2025-05-07T19:45:59.5676796Z 2025-05-07T19:45:59.5676805Z 2025-05-07T19:45:59.5676809Z 2025-05-07T19:45:59.5676812Z 2025-05-07T19:45:59.5676815Z 2025-05-07T19:45:59.5758826Z cuda-nvcc-dev_linux- | 10.8 MB | ########## | 100%  2025-05-07T19:45:59.5759504Z 2025-05-07T19:45:59.5759508Z 2025-05-07T19:45:59.5759512Z 2025-05-07T19:45:59.5759516Z 2025-05-07T19:45:59.5759519Z 2025-05-07T19:45:59.5759523Z 2025-05-07T19:45:59.5759526Z 2025-05-07T19:45:59.5759529Z 2025-05-07T19:45:59.5759533Z 2025-05-07T19:45:59.5759536Z 2025-05-07T19:45:59.5759540Z 2025-05-07T19:45:59.5759543Z 2025-05-07T19:45:59.5759569Z 2025-05-07T19:45:59.5759572Z 2025-05-07T19:45:59.5759575Z 2025-05-07T19:45:59.5759579Z 2025-05-07T19:45:59.5759582Z 2025-05-07T19:45:59.6425880Z cuda-nvvm-impl-12.6. | 7.7 MB | ###9 | 39%  2025-05-07T19:45:59.6426285Z 2025-05-07T19:45:59.6426291Z 2025-05-07T19:45:59.6426319Z 2025-05-07T19:45:59.6426325Z 2025-05-07T19:45:59.6426330Z 2025-05-07T19:45:59.6426360Z 2025-05-07T19:45:59.6426365Z 2025-05-07T19:45:59.6426368Z 2025-05-07T19:45:59.6426372Z 2025-05-07T19:45:59.6426376Z 2025-05-07T19:45:59.6426381Z 2025-05-07T19:45:59.6426385Z 2025-05-07T19:45:59.6426388Z 2025-05-07T19:45:59.6426415Z 2025-05-07T19:45:59.6426419Z 2025-05-07T19:45:59.6426423Z 2025-05-07T19:45:59.6511295Z cuda-sanitizer-api-1 | 8.9 MB | ########## | 100%  2025-05-07T19:45:59.6511722Z 2025-05-07T19:45:59.6511726Z 2025-05-07T19:45:59.6511730Z 2025-05-07T19:45:59.6511733Z 2025-05-07T19:45:59.6511737Z 2025-05-07T19:45:59.6511740Z 2025-05-07T19:45:59.6511743Z 2025-05-07T19:45:59.6511747Z 2025-05-07T19:45:59.6511750Z 2025-05-07T19:45:59.6511754Z 2025-05-07T19:45:59.6511757Z 2025-05-07T19:45:59.6511760Z 2025-05-07T19:45:59.6511764Z 2025-05-07T19:45:59.6511767Z 2025-05-07T19:45:59.6511771Z 2025-05-07T19:45:59.6511774Z 2025-05-07T19:45:59.6511806Z 2025-05-07T19:45:59.6511809Z 2025-05-07T19:45:59.6757444Z cuda-cupti-dev-12.6. | 3.4 MB | | 0%  2025-05-07T19:45:59.6757838Z 2025-05-07T19:45:59.6757843Z 2025-05-07T19:45:59.6757847Z 2025-05-07T19:45:59.6757851Z 2025-05-07T19:45:59.6757867Z 2025-05-07T19:45:59.6757871Z 2025-05-07T19:45:59.6855276Z libcusolver-11.7.1.2 | 95.8 MB | ########## | 100%  2025-05-07T19:45:59.6856265Z 2025-05-07T19:45:59.6856279Z 2025-05-07T19:45:59.6856289Z 2025-05-07T19:45:59.6856300Z 2025-05-07T19:45:59.6856310Z 2025-05-07T19:45:59.6856321Z 2025-05-07T19:45:59.6856331Z 2025-05-07T19:45:59.6856368Z 2025-05-07T19:45:59.6856379Z 2025-05-07T19:45:59.6856389Z 2025-05-07T19:45:59.6856399Z 2025-05-07T19:45:59.6856409Z 2025-05-07T19:45:59.6856419Z 2025-05-07T19:45:59.6856429Z 2025-05-07T19:45:59.6856439Z 2025-05-07T19:45:59.6856450Z 2025-05-07T19:45:59.6856461Z 2025-05-07T19:45:59.6856471Z 2025-05-07T19:45:59.6856481Z 2025-05-07T19:45:59.7073738Z ... (more hidden) ... 2025-05-07T19:45:59.7074124Z 2025-05-07T19:45:59.7074129Z 2025-05-07T19:45:59.7074134Z 2025-05-07T19:45:59.7074137Z 2025-05-07T19:45:59.7074141Z 2025-05-07T19:45:59.7074144Z 2025-05-07T19:45:59.7074157Z 2025-05-07T19:45:59.7074160Z 2025-05-07T19:45:59.7074164Z 2025-05-07T19:45:59.7074167Z 2025-05-07T19:45:59.7074170Z 2025-05-07T19:45:59.7074174Z 2025-05-07T19:45:59.7074177Z 2025-05-07T19:45:59.7074181Z 2025-05-07T19:45:59.7074184Z 2025-05-07T19:45:59.7074188Z 2025-05-07T19:45:59.7074191Z 2025-05-07T19:45:59.7074680Z cuda-nvvm-impl-12.6. | 7.7 MB | ########## | 100%  2025-05-07T19:45:59.7075043Z 2025-05-07T19:45:59.7075059Z 2025-05-07T19:45:59.7075062Z 2025-05-07T19:45:59.7075066Z 2025-05-07T19:45:59.7075069Z 2025-05-07T19:45:59.7075072Z 2025-05-07T19:45:59.7075076Z 2025-05-07T19:45:59.7075079Z 2025-05-07T19:45:59.7075083Z 2025-05-07T19:45:59.7075108Z 2025-05-07T19:45:59.7075111Z 2025-05-07T19:45:59.7075119Z 2025-05-07T19:45:59.7075122Z 2025-05-07T19:45:59.7075126Z 2025-05-07T19:45:59.7075130Z 2025-05-07T19:45:59.7075133Z 2025-05-07T19:45:59.7075136Z 2025-05-07T19:45:59.7268356Z cuda-nvvm-impl-12.6. | 7.7 MB | ########## | 100%  2025-05-07T19:45:59.7268992Z 2025-05-07T19:45:59.7268996Z 2025-05-07T19:45:59.7269000Z 2025-05-07T19:45:59.7269004Z 2025-05-07T19:45:59.7269007Z 2025-05-07T19:45:59.7269011Z 2025-05-07T19:45:59.7269014Z 2025-05-07T19:45:59.7269018Z 2025-05-07T19:45:59.7269021Z 2025-05-07T19:45:59.7269024Z 2025-05-07T19:45:59.7269028Z 2025-05-07T19:45:59.7269031Z 2025-05-07T19:45:59.7269035Z 2025-05-07T19:45:59.7269038Z 2025-05-07T19:45:59.7269042Z 2025-05-07T19:45:59.7345636Z cuda-nvvm-tools-12.6 | 10.4 MB | ########## | 100%  2025-05-07T19:45:59.7346055Z 2025-05-07T19:45:59.7346059Z 2025-05-07T19:45:59.7346063Z 2025-05-07T19:45:59.7346066Z 2025-05-07T19:45:59.7346083Z 2025-05-07T19:45:59.7346087Z 2025-05-07T19:45:59.7346090Z 2025-05-07T19:45:59.7346094Z 2025-05-07T19:45:59.7346097Z 2025-05-07T19:45:59.7346101Z 2025-05-07T19:45:59.7346104Z 2025-05-07T19:45:59.7346108Z 2025-05-07T19:45:59.7346124Z 2025-05-07T19:45:59.7346147Z 2025-05-07T19:45:59.7346151Z 2025-05-07T19:45:59.7346155Z 2025-05-07T19:45:59.7346158Z 2025-05-07T19:45:59.7346161Z 2025-05-07T19:45:59.7388314Z cuda-cupti-dev-12.6. | 3.4 MB | ########## | 100%  2025-05-07T19:45:59.7388719Z 2025-05-07T19:45:59.7388748Z 2025-05-07T19:45:59.7388752Z 2025-05-07T19:45:59.7388756Z 2025-05-07T19:45:59.7388759Z 2025-05-07T19:45:59.7388763Z 2025-05-07T19:45:59.7388766Z 2025-05-07T19:45:59.7388770Z 2025-05-07T19:45:59.7388773Z 2025-05-07T19:45:59.7388777Z 2025-05-07T19:45:59.7388780Z 2025-05-07T19:45:59.7388784Z 2025-05-07T19:45:59.7388787Z 2025-05-07T19:45:59.7388791Z 2025-05-07T19:45:59.7388794Z 2025-05-07T19:45:59.7388798Z 2025-05-07T19:45:59.7388813Z 2025-05-07T19:45:59.7388816Z 2025-05-07T19:45:59.7388820Z 2025-05-07T19:45:59.8383471Z ... (more hidden) ... 2025-05-07T19:45:59.8383829Z 2025-05-07T19:45:59.8383862Z 2025-05-07T19:45:59.8383866Z 2025-05-07T19:45:59.8383871Z 2025-05-07T19:45:59.8383875Z 2025-05-07T19:45:59.8383880Z 2025-05-07T19:45:59.8383884Z 2025-05-07T19:45:59.8383889Z 2025-05-07T19:45:59.8383894Z 2025-05-07T19:46:00.0284515Z libcurand-10.3.7.77 | 39.9 MB | ########## | 100%  2025-05-07T19:46:00.0284888Z 2025-05-07T19:46:00.0284893Z 2025-05-07T19:46:00.0284899Z 2025-05-07T19:46:00.0284904Z 2025-05-07T19:46:00.0284907Z 2025-05-07T19:46:00.0284911Z 2025-05-07T19:46:00.0284914Z 2025-05-07T19:46:00.0284939Z 2025-05-07T19:46:00.0284942Z 2025-05-07T19:46:00.0284946Z 2025-05-07T19:46:00.3540936Z gds-tools-1.11.1.6 | 37.8 MB | ########## | 100%  2025-05-07T19:46:00.3541369Z 2025-05-07T19:46:00.3541659Z 2025-05-07T19:46:00.3541665Z 2025-05-07T19:46:00.3541670Z 2025-05-07T19:46:00.3541700Z 2025-05-07T19:46:00.3541705Z 2025-05-07T19:46:00.3541710Z 2025-05-07T19:46:00.3541715Z 2025-05-07T19:46:00.3541719Z 2025-05-07T19:46:00.3541747Z 2025-05-07T19:46:00.3541750Z 2025-05-07T19:46:00.3541754Z 2025-05-07T19:46:00.6168497Z cuda-nvrtc-12.6.85 | 17.3 MB | ########## | 100%  2025-05-07T19:46:00.6168906Z 2025-05-07T19:46:00.6168940Z 2025-05-07T19:46:00.6168946Z 2025-05-07T19:46:00.6168952Z 2025-05-07T19:46:00.6168958Z 2025-05-07T19:46:00.6168964Z 2025-05-07T19:46:00.6168992Z 2025-05-07T19:46:00.6168998Z 2025-05-07T19:46:00.6169006Z 2025-05-07T19:46:00.6169013Z 2025-05-07T19:46:00.6169019Z 2025-05-07T19:46:00.6169025Z 2025-05-07T19:46:00.6169030Z 2025-05-07T19:46:00.9973940Z libnvjitlink-12.6.85 | 14.9 MB | ########## | 100%  2025-05-07T19:46:01.0348488Z nsight-compute-2024. | 443.1 MB | ########## | 100% 2025-05-07T19:46:01.0348870Z 2025-05-07T19:46:01.0349004Z 2025-05-07T19:46:01.0349089Z 2025-05-07T19:46:01.0349094Z 2025-05-07T19:46:01.0349264Z 2025-05-07T19:46:01.0349342Z 2025-05-07T19:46:01.0349349Z 2025-05-07T19:46:01.0349644Z 2025-05-07T19:46:01.0349648Z 2025-05-07T19:46:01.0349653Z 2025-05-07T19:46:01.0349657Z 2025-05-07T19:46:01.0557151Z cuda-nvcc-tools-12.6 | 23.0 MB | ########## | 100%  2025-05-07T19:46:01.0557557Z 2025-05-07T19:46:01.0557561Z 2025-05-07T19:46:01.0557565Z 2025-05-07T19:46:01.0557569Z 2025-05-07T19:46:01.0557573Z 2025-05-07T19:46:01.0557576Z 2025-05-07T19:46:01.0557579Z 2025-05-07T19:46:01.2116289Z libnpp-12.3.1.54 | 93.4 MB | ########## | 100%  2025-05-07T19:46:01.2116675Z 2025-05-07T19:46:01.2116679Z 2025-05-07T19:46:01.2116684Z 2025-05-07T19:46:01.2116687Z 2025-05-07T19:46:01.2116691Z 2025-05-07T19:46:01.2116694Z 2025-05-07T19:46:01.2116698Z 2025-05-07T19:46:01.2116701Z 2025-05-07T19:46:01.2116744Z 2025-05-07T19:46:01.2116747Z 2025-05-07T19:46:01.2116751Z 2025-05-07T19:46:01.2116754Z 2025-05-07T19:46:01.2116758Z 2025-05-07T19:46:01.2116761Z 2025-05-07T19:46:01.2116764Z 2025-05-07T19:46:01.2116768Z 2025-05-07T19:46:01.2402834Z cuda-sanitizer-api-1 | 8.9 MB | ########## | 100%  2025-05-07T19:46:01.2403226Z 2025-05-07T19:46:01.2403365Z 2025-05-07T19:46:01.2403370Z 2025-05-07T19:46:01.2403392Z 2025-05-07T19:46:01.2403396Z 2025-05-07T19:46:01.2403418Z 2025-05-07T19:46:01.2403466Z 2025-05-07T19:46:01.2403470Z 2025-05-07T19:46:01.2403494Z 2025-05-07T19:46:01.2403499Z 2025-05-07T19:46:01.2403519Z 2025-05-07T19:46:01.2403546Z 2025-05-07T19:46:01.2403550Z 2025-05-07T19:46:01.2403578Z 2025-05-07T19:46:01.3151587Z cuda-nvcc-dev_linux- | 10.8 MB | ########## | 100%  2025-05-07T19:46:01.3152022Z 2025-05-07T19:46:01.3152027Z 2025-05-07T19:46:01.3152033Z 2025-05-07T19:46:01.3152076Z 2025-05-07T19:46:01.3152082Z 2025-05-07T19:46:01.3152085Z 2025-05-07T19:46:01.3152092Z 2025-05-07T19:46:01.3152097Z 2025-05-07T19:46:01.3152101Z 2025-05-07T19:46:01.3152106Z 2025-05-07T19:46:01.3152111Z 2025-05-07T19:46:01.3152114Z 2025-05-07T19:46:01.3152140Z 2025-05-07T19:46:01.3152144Z 2025-05-07T19:46:01.3152147Z 2025-05-07T19:46:01.3152151Z 2025-05-07T19:46:01.3152154Z 2025-05-07T19:46:01.3907614Z cuda-nvvm-impl-12.6. | 7.7 MB | ########## | 100%  2025-05-07T19:46:01.3908038Z 2025-05-07T19:46:01.3908044Z 2025-05-07T19:46:01.3908049Z 2025-05-07T19:46:01.3908054Z 2025-05-07T19:46:01.3908060Z 2025-05-07T19:46:01.3908066Z 2025-05-07T19:46:01.3908072Z 2025-05-07T19:46:01.3908090Z 2025-05-07T19:46:01.3908095Z 2025-05-07T19:46:01.3908100Z 2025-05-07T19:46:01.3908104Z 2025-05-07T19:46:01.3908107Z 2025-05-07T19:46:01.3908110Z 2025-05-07T19:46:01.3908114Z 2025-05-07T19:46:01.3908117Z 2025-05-07T19:46:01.4234386Z cuda-nvvm-tools-12.6 | 10.4 MB | ########## | 100%  2025-05-07T19:46:01.4234773Z 2025-05-07T19:46:01.4234777Z 2025-05-07T19:46:01.4234781Z 2025-05-07T19:46:01.4234785Z 2025-05-07T19:46:01.4234788Z 2025-05-07T19:46:01.4234808Z 2025-05-07T19:46:01.4234811Z 2025-05-07T19:46:01.4234815Z 2025-05-07T19:46:01.4234818Z 2025-05-07T19:46:01.4234821Z 2025-05-07T19:46:01.4234825Z 2025-05-07T19:46:01.4234829Z 2025-05-07T19:46:01.4234846Z 2025-05-07T19:46:01.4234849Z 2025-05-07T19:46:01.4234853Z 2025-05-07T19:46:01.4234856Z 2025-05-07T19:46:01.4234859Z 2025-05-07T19:46:01.4234879Z 2025-05-07T19:46:01.4235229Z cuda-cupti-dev-12.6. | 3.4 MB | ########## | 100%  2025-05-07T19:46:01.4235581Z 2025-05-07T19:46:01.4235599Z 2025-05-07T19:46:01.4235602Z 2025-05-07T19:46:01.4235606Z 2025-05-07T19:46:01.4235610Z 2025-05-07T19:46:01.4235613Z 2025-05-07T19:46:01.4235616Z 2025-05-07T19:46:01.4235620Z 2025-05-07T19:46:01.4235627Z 2025-05-07T19:46:01.4235631Z 2025-05-07T19:46:01.4235634Z 2025-05-07T19:46:01.4235637Z 2025-05-07T19:46:01.4235641Z 2025-05-07T19:46:01.4235644Z 2025-05-07T19:46:01.4235647Z 2025-05-07T19:46:01.4235651Z 2025-05-07T19:46:01.4235654Z 2025-05-07T19:46:01.4235813Z 2025-05-07T19:46:01.4536140Z cuda-cupti-dev-12.6. | 3.4 MB | ########## | 100%  2025-05-07T19:46:01.4536537Z 2025-05-07T19:46:01.4536542Z 2025-05-07T19:46:01.4536545Z 2025-05-07T19:46:01.4536549Z 2025-05-07T19:46:01.4536552Z 2025-05-07T19:46:01.4536556Z 2025-05-07T19:46:01.4536559Z 2025-05-07T19:46:01.4536564Z 2025-05-07T19:46:01.4536568Z 2025-05-07T19:46:01.4536571Z 2025-05-07T19:46:01.4536575Z 2025-05-07T19:46:01.4536578Z 2025-05-07T19:46:01.4536581Z 2025-05-07T19:46:01.4536600Z 2025-05-07T19:46:01.4536603Z 2025-05-07T19:46:01.4536606Z 2025-05-07T19:46:01.4536610Z 2025-05-07T19:46:01.4536613Z 2025-05-07T19:46:01.4536616Z 2025-05-07T19:46:01.4536887Z ... (more hidden) ... 2025-05-07T19:46:01.4537187Z 2025-05-07T19:46:01.4537190Z 2025-05-07T19:46:01.4537194Z 2025-05-07T19:46:01.4537197Z 2025-05-07T19:46:01.4537217Z 2025-05-07T19:46:01.4537220Z 2025-05-07T19:46:01.4537232Z 2025-05-07T19:46:01.4537235Z 2025-05-07T19:46:01.4537250Z 2025-05-07T19:46:01.4537253Z 2025-05-07T19:46:01.4537257Z 2025-05-07T19:46:01.4537260Z 2025-05-07T19:46:01.4537264Z 2025-05-07T19:46:01.4537267Z 2025-05-07T19:46:01.4537271Z 2025-05-07T19:46:01.4537274Z 2025-05-07T19:46:01.4537278Z 2025-05-07T19:46:01.4537281Z 2025-05-07T19:46:01.4537284Z 2025-05-07T19:46:03.1729716Z ... (more hidden) ... 2025-05-07T19:46:03.1730709Z 2025-05-07T19:46:05.7215546Z libcublas-12.6.4.1 | 256.2 MB | ########## | 100%  2025-05-07T19:46:05.7218974Z nsight-compute-2024. | 443.1 MB | ########## | 100% 2025-05-07T19:46:05.7219258Z 2025-05-07T19:46:05.7219263Z 2025-05-07T19:46:05.7219299Z 2025-05-07T19:46:05.7219303Z 2025-05-07T19:46:05.7219307Z 2025-05-07T19:46:05.7219310Z 2025-05-07T19:46:05.7219314Z 2025-05-07T19:46:05.7219317Z 2025-05-07T19:46:05.7219321Z 2025-05-07T19:46:05.7219336Z 2025-05-07T19:46:05.7219361Z 2025-05-07T19:46:05.7219365Z 2025-05-07T19:46:05.7219368Z 2025-05-07T19:46:05.7219372Z 2025-05-07T19:46:05.7219375Z 2025-05-07T19:46:05.7219378Z 2025-05-07T19:46:05.7219382Z 2025-05-07T19:46:05.7219385Z 2025-05-07T19:46:05.7219389Z 2025-05-07T19:46:05.7219494Z 2025-05-07T19:46:05.7219843Z  2025-05-07T19:46:05.7220193Z 2025-05-07T19:46:05.7220405Z 2025-05-07T19:46:05.7220596Z  2025-05-07T19:46:05.7220814Z 2025-05-07T19:46:05.7220818Z 2025-05-07T19:46:05.7220988Z  2025-05-07T19:46:05.7221206Z 2025-05-07T19:46:05.7221461Z 2025-05-07T19:46:05.7221467Z 2025-05-07T19:46:05.7221669Z  2025-05-07T19:46:05.7221894Z 2025-05-07T19:46:05.7221898Z 2025-05-07T19:46:05.7221902Z 2025-05-07T19:46:05.7221923Z 2025-05-07T19:46:05.7222116Z  2025-05-07T19:46:05.7222339Z 2025-05-07T19:46:05.7222342Z 2025-05-07T19:46:05.7222346Z 2025-05-07T19:46:05.7222349Z 2025-05-07T19:46:05.7222352Z 2025-05-07T19:46:05.7222532Z  2025-05-07T19:46:05.7222778Z 2025-05-07T19:46:05.7222782Z 2025-05-07T19:46:05.7222785Z 2025-05-07T19:46:05.7222789Z 2025-05-07T19:46:05.7222792Z 2025-05-07T19:46:05.7222795Z 2025-05-07T19:46:05.7223001Z  2025-05-07T19:46:05.7223249Z 2025-05-07T19:46:05.7223253Z 2025-05-07T19:46:05.7223257Z 2025-05-07T19:46:05.7223260Z 2025-05-07T19:46:05.7223269Z 2025-05-07T19:46:05.7223272Z 2025-05-07T19:46:05.7223276Z 2025-05-07T19:46:05.7223462Z  2025-05-07T19:46:05.7223710Z 2025-05-07T19:46:05.7223714Z 2025-05-07T19:46:05.7223843Z 2025-05-07T19:46:05.7223846Z 2025-05-07T19:46:05.7223850Z 2025-05-07T19:46:05.7223853Z 2025-05-07T19:46:05.7223856Z 2025-05-07T19:46:05.7223860Z 2025-05-07T19:46:05.7224049Z  2025-05-07T19:46:05.7224301Z 2025-05-07T19:46:05.7224304Z 2025-05-07T19:46:05.7224308Z 2025-05-07T19:46:05.7224311Z 2025-05-07T19:46:05.7224315Z 2025-05-07T19:46:05.7224318Z 2025-05-07T19:46:05.7224321Z 2025-05-07T19:46:05.7224325Z 2025-05-07T19:46:05.7224328Z 2025-05-07T19:46:05.7224577Z  2025-05-07T19:46:05.7224846Z 2025-05-07T19:46:05.7224850Z 2025-05-07T19:46:05.7224853Z 2025-05-07T19:46:05.7224856Z 2025-05-07T19:46:05.7224864Z 2025-05-07T19:46:05.7224868Z 2025-05-07T19:46:05.7224871Z 2025-05-07T19:46:05.7224875Z 2025-05-07T19:46:05.7224878Z 2025-05-07T19:46:05.7224882Z 2025-05-07T19:46:05.7225078Z  2025-05-07T19:46:05.7225339Z 2025-05-07T19:46:05.7225343Z 2025-05-07T19:46:05.7225346Z 2025-05-07T19:46:05.7225350Z 2025-05-07T19:46:05.7225353Z 2025-05-07T19:46:05.7225356Z 2025-05-07T19:46:05.7225360Z 2025-05-07T19:46:05.7225363Z 2025-05-07T19:46:05.7225367Z 2025-05-07T19:46:05.7225370Z 2025-05-07T19:46:05.7225373Z 2025-05-07T19:46:05.7225579Z  2025-05-07T19:46:05.7225841Z 2025-05-07T19:46:05.7225845Z 2025-05-07T19:46:05.7225848Z 2025-05-07T19:46:05.7225852Z 2025-05-07T19:46:05.7225855Z 2025-05-07T19:46:05.7225859Z 2025-05-07T19:46:05.7225862Z 2025-05-07T19:46:05.7225866Z 2025-05-07T19:46:05.7225869Z 2025-05-07T19:46:05.7225876Z 2025-05-07T19:46:05.7225880Z 2025-05-07T19:46:05.7225883Z 2025-05-07T19:46:05.7226103Z  2025-05-07T19:46:05.7226348Z 2025-05-07T19:46:05.7226351Z 2025-05-07T19:46:05.7226359Z 2025-05-07T19:46:05.7226362Z 2025-05-07T19:46:05.7226365Z 2025-05-07T19:46:05.7226369Z 2025-05-07T19:46:05.7226372Z 2025-05-07T19:46:05.7226376Z 2025-05-07T19:46:05.7226379Z 2025-05-07T19:46:05.7226382Z 2025-05-07T19:46:05.7226386Z 2025-05-07T19:46:05.7226390Z 2025-05-07T19:46:05.7226393Z 2025-05-07T19:46:05.7226615Z  2025-05-07T19:46:05.7226857Z 2025-05-07T19:46:05.7226861Z 2025-05-07T19:46:05.7226864Z 2025-05-07T19:46:05.7226868Z 2025-05-07T19:46:05.7226871Z 2025-05-07T19:46:05.7226874Z 2025-05-07T19:46:05.7226878Z 2025-05-07T19:46:05.7226881Z 2025-05-07T19:46:05.7226884Z 2025-05-07T19:46:05.7226888Z 2025-05-07T19:46:05.7226945Z 2025-05-07T19:46:05.7226950Z 2025-05-07T19:46:05.7226966Z 2025-05-07T19:46:05.7226969Z 2025-05-07T19:46:05.7227185Z  2025-05-07T19:46:05.7227436Z 2025-05-07T19:46:05.7227439Z 2025-05-07T19:46:05.7227443Z 2025-05-07T19:46:05.7227446Z 2025-05-07T19:46:05.7227449Z 2025-05-07T19:46:05.7227453Z 2025-05-07T19:46:05.7227456Z 2025-05-07T19:46:05.7227473Z 2025-05-07T19:46:05.7227476Z 2025-05-07T19:46:05.7227479Z 2025-05-07T19:46:05.7227501Z 2025-05-07T19:46:05.7227505Z 2025-05-07T19:46:05.7227508Z 2025-05-07T19:46:05.7227512Z 2025-05-07T19:46:05.7227515Z 2025-05-07T19:46:05.7227729Z  2025-05-07T19:46:05.7227997Z 2025-05-07T19:46:05.7228000Z 2025-05-07T19:46:05.7228004Z 2025-05-07T19:46:05.7228007Z 2025-05-07T19:46:05.7228011Z 2025-05-07T19:46:05.7228014Z 2025-05-07T19:46:05.7228017Z 2025-05-07T19:46:05.7228024Z 2025-05-07T19:46:05.7228028Z 2025-05-07T19:46:05.7228031Z 2025-05-07T19:46:05.7228035Z 2025-05-07T19:46:05.7228038Z 2025-05-07T19:46:05.7228041Z 2025-05-07T19:46:05.7228045Z 2025-05-07T19:46:05.7228048Z 2025-05-07T19:46:05.7228105Z 2025-05-07T19:46:05.7228338Z  2025-05-07T19:46:05.7228591Z 2025-05-07T19:46:05.7228594Z 2025-05-07T19:46:05.7228598Z 2025-05-07T19:46:05.7228601Z 2025-05-07T19:46:05.7228604Z 2025-05-07T19:46:05.7228608Z 2025-05-07T19:46:05.7228611Z 2025-05-07T19:46:05.7228614Z 2025-05-07T19:46:05.7228618Z 2025-05-07T19:46:05.7228621Z 2025-05-07T19:46:05.7228624Z 2025-05-07T19:46:05.7228628Z 2025-05-07T19:46:05.7228631Z 2025-05-07T19:46:05.7228649Z 2025-05-07T19:46:05.7228652Z 2025-05-07T19:46:05.7228655Z 2025-05-07T19:46:05.7228659Z 2025-05-07T19:46:05.7228889Z  2025-05-07T19:46:05.7229144Z 2025-05-07T19:46:05.7229148Z 2025-05-07T19:46:05.7229152Z 2025-05-07T19:46:05.7229155Z 2025-05-07T19:46:05.7229158Z 2025-05-07T19:46:05.7229176Z 2025-05-07T19:46:05.7229179Z 2025-05-07T19:46:05.7229183Z 2025-05-07T19:46:05.7229190Z 2025-05-07T19:46:05.7229194Z 2025-05-07T19:46:05.7229197Z 2025-05-07T19:46:05.7229201Z 2025-05-07T19:46:05.7229204Z 2025-05-07T19:46:05.7229208Z 2025-05-07T19:46:05.7229211Z 2025-05-07T19:46:05.7229214Z 2025-05-07T19:46:05.7229218Z 2025-05-07T19:46:05.7229221Z 2025-05-07T19:46:05.7229551Z  2025-05-07T19:46:05.7229827Z 2025-05-07T19:46:05.7229830Z 2025-05-07T19:46:05.7229935Z  2025-05-07T19:46:05.7230041Z 2025-05-07T19:46:05.7230044Z 2025-05-07T19:46:05.7230158Z  2025-05-07T19:46:05.7230268Z 2025-05-07T19:46:05.7230272Z 2025-05-07T19:46:05.7230276Z 2025-05-07T19:46:05.7230376Z  2025-05-07T19:46:05.7230510Z 2025-05-07T19:46:05.7230513Z 2025-05-07T19:46:05.7230517Z 2025-05-07T19:46:05.7230520Z 2025-05-07T19:46:05.7230646Z  2025-05-07T19:46:05.7230781Z 2025-05-07T19:46:05.7230785Z 2025-05-07T19:46:05.7230788Z 2025-05-07T19:46:05.7230796Z 2025-05-07T19:46:05.7230799Z 2025-05-07T19:46:05.7230905Z  2025-05-07T19:46:05.7231031Z 2025-05-07T19:46:05.7231034Z 2025-05-07T19:46:05.7231038Z 2025-05-07T19:46:05.7231041Z 2025-05-07T19:46:05.7231044Z 2025-05-07T19:46:05.7231048Z 2025-05-07T19:46:05.7231171Z  2025-05-07T19:46:05.7231305Z 2025-05-07T19:46:05.7231308Z 2025-05-07T19:46:05.7231312Z 2025-05-07T19:46:05.7231315Z 2025-05-07T19:46:05.7231319Z 2025-05-07T19:46:05.7231322Z 2025-05-07T19:46:05.7231325Z 2025-05-07T19:46:05.7231453Z  2025-05-07T19:46:05.7231597Z 2025-05-07T19:46:05.7231601Z 2025-05-07T19:46:05.7231604Z 2025-05-07T19:46:05.7231608Z 2025-05-07T19:46:05.7231611Z 2025-05-07T19:46:05.7231615Z 2025-05-07T19:46:05.7231680Z 2025-05-07T19:46:05.7231685Z 2025-05-07T19:46:05.7231810Z  2025-05-07T19:46:05.7231981Z 2025-05-07T19:46:05.7231985Z 2025-05-07T19:46:05.7231988Z 2025-05-07T19:46:05.7231992Z 2025-05-07T19:46:05.7231996Z 2025-05-07T19:46:05.7232003Z 2025-05-07T19:46:05.7232006Z 2025-05-07T19:46:05.7232010Z 2025-05-07T19:46:05.7232013Z 2025-05-07T19:46:05.7232136Z  2025-05-07T19:46:05.7232315Z 2025-05-07T19:46:05.7232318Z 2025-05-07T19:46:05.7232322Z 2025-05-07T19:46:05.7232325Z 2025-05-07T19:46:05.7232328Z 2025-05-07T19:46:05.7232332Z 2025-05-07T19:46:05.7232335Z 2025-05-07T19:46:05.7232338Z 2025-05-07T19:46:05.7232342Z 2025-05-07T19:46:05.7232345Z 2025-05-07T19:46:05.7232477Z  2025-05-07T19:46:05.7232663Z 2025-05-07T19:46:05.7232667Z 2025-05-07T19:46:05.7232670Z 2025-05-07T19:46:05.7232673Z 2025-05-07T19:46:05.7232677Z 2025-05-07T19:46:05.7232680Z 2025-05-07T19:46:05.7232683Z 2025-05-07T19:46:05.7232687Z 2025-05-07T19:46:05.7232694Z 2025-05-07T19:46:05.7232697Z 2025-05-07T19:46:05.7232701Z 2025-05-07T19:46:05.7232832Z  2025-05-07T19:46:05.7233033Z 2025-05-07T19:46:05.7233037Z 2025-05-07T19:46:05.7233040Z 2025-05-07T19:46:05.7233098Z 2025-05-07T19:46:05.7233102Z 2025-05-07T19:46:05.7233105Z 2025-05-07T19:46:05.7233108Z 2025-05-07T19:46:05.7233112Z 2025-05-07T19:46:05.7233115Z 2025-05-07T19:46:05.7233119Z 2025-05-07T19:46:05.7233122Z 2025-05-07T19:46:05.7233126Z 2025-05-07T19:46:05.7233259Z  2025-05-07T19:46:05.7233462Z 2025-05-07T19:46:05.7233466Z 2025-05-07T19:46:05.7233469Z 2025-05-07T19:46:05.7233472Z 2025-05-07T19:46:05.7233476Z 2025-05-07T19:46:05.7233479Z 2025-05-07T19:46:05.7233483Z 2025-05-07T19:46:05.7233486Z 2025-05-07T19:46:05.7233489Z 2025-05-07T19:46:05.7233493Z 2025-05-07T19:46:05.7233496Z 2025-05-07T19:46:05.7233500Z 2025-05-07T19:46:05.7233503Z 2025-05-07T19:46:05.7233661Z  2025-05-07T19:46:05.7233860Z 2025-05-07T19:46:05.7233864Z 2025-05-07T19:46:05.7233868Z 2025-05-07T19:46:05.7233871Z 2025-05-07T19:46:05.7233875Z 2025-05-07T19:46:05.7233878Z 2025-05-07T19:46:05.7233881Z 2025-05-07T19:46:05.7233888Z 2025-05-07T19:46:05.7233892Z 2025-05-07T19:46:05.7233895Z 2025-05-07T19:46:05.7233899Z 2025-05-07T19:46:05.7233902Z 2025-05-07T19:46:05.7233905Z 2025-05-07T19:46:05.7233909Z 2025-05-07T19:46:05.7234087Z  2025-05-07T19:46:05.7234292Z 2025-05-07T19:46:05.7234295Z 2025-05-07T19:46:05.7234299Z 2025-05-07T19:46:05.7234302Z 2025-05-07T19:46:05.7234306Z 2025-05-07T19:46:05.7234309Z 2025-05-07T19:46:05.7234326Z 2025-05-07T19:46:05.7234330Z 2025-05-07T19:46:05.7234333Z 2025-05-07T19:46:05.7234336Z 2025-05-07T19:46:05.7234340Z 2025-05-07T19:46:05.7234343Z 2025-05-07T19:46:05.7234347Z 2025-05-07T19:46:05.7234350Z 2025-05-07T19:46:05.7234353Z 2025-05-07T19:46:05.7234502Z  2025-05-07T19:46:05.7234714Z 2025-05-07T19:46:05.7234718Z 2025-05-07T19:46:05.7234736Z 2025-05-07T19:46:05.7234739Z 2025-05-07T19:46:05.7234743Z 2025-05-07T19:46:05.7234746Z 2025-05-07T19:46:05.7234750Z 2025-05-07T19:46:05.7234753Z 2025-05-07T19:46:05.7234760Z 2025-05-07T19:46:05.7234763Z 2025-05-07T19:46:05.7234766Z 2025-05-07T19:46:05.7234770Z 2025-05-07T19:46:05.7234773Z 2025-05-07T19:46:05.7234776Z 2025-05-07T19:46:05.7234779Z 2025-05-07T19:46:05.7234783Z 2025-05-07T19:46:05.7234937Z  2025-05-07T19:46:05.7235169Z 2025-05-07T19:46:05.7235173Z 2025-05-07T19:46:05.7235176Z 2025-05-07T19:46:05.7235179Z 2025-05-07T19:46:05.7235183Z 2025-05-07T19:46:05.7235187Z 2025-05-07T19:46:05.7235190Z 2025-05-07T19:46:05.7235193Z 2025-05-07T19:46:05.7235197Z 2025-05-07T19:46:05.7235201Z 2025-05-07T19:46:05.7235204Z 2025-05-07T19:46:05.7235207Z 2025-05-07T19:46:05.7235211Z 2025-05-07T19:46:05.7235214Z 2025-05-07T19:46:05.7235217Z 2025-05-07T19:46:05.7236082Z 2025-05-07T19:46:05.7236087Z 2025-05-07T19:46:05.7236277Z  2025-05-07T19:46:05.7236502Z 2025-05-07T19:46:05.7236505Z 2025-05-07T19:46:05.7236509Z 2025-05-07T19:46:05.7236512Z 2025-05-07T19:46:05.7236519Z 2025-05-07T19:46:05.7236523Z 2025-05-07T19:46:05.7236526Z 2025-05-07T19:46:05.7236529Z 2025-05-07T19:46:05.7236533Z 2025-05-07T19:46:05.7236536Z 2025-05-07T19:46:05.7236539Z 2025-05-07T19:46:05.7236543Z 2025-05-07T19:46:05.7236546Z 2025-05-07T19:46:05.7236550Z 2025-05-07T19:46:05.7236572Z 2025-05-07T19:46:05.7236576Z 2025-05-07T19:46:05.7236579Z 2025-05-07T19:46:05.7236583Z 2025-05-07T19:46:05.7236755Z  2025-05-07T19:46:05.7236983Z 2025-05-07T19:46:05.7236987Z 2025-05-07T19:46:05.7237103Z  2025-05-07T19:46:05.7237211Z 2025-05-07T19:46:05.7237215Z 2025-05-07T19:46:05.7237314Z  2025-05-07T19:46:05.7237424Z 2025-05-07T19:46:05.7237443Z 2025-05-07T19:46:05.7237450Z 2025-05-07T19:46:05.7237551Z  2025-05-07T19:46:05.7237663Z 2025-05-07T19:46:05.7237666Z 2025-05-07T19:46:05.7237670Z 2025-05-07T19:46:05.7237673Z 2025-05-07T19:46:05.7237791Z  2025-05-07T19:46:05.7237911Z 2025-05-07T19:46:05.7237969Z 2025-05-07T19:46:05.7237972Z 2025-05-07T19:46:05.7237975Z 2025-05-07T19:46:05.7237979Z 2025-05-07T19:46:05.7238085Z  2025-05-07T19:46:05.7238231Z 2025-05-07T19:46:05.7238235Z 2025-05-07T19:46:05.7238238Z 2025-05-07T19:46:05.7238242Z 2025-05-07T19:46:05.7238245Z 2025-05-07T19:46:05.7238249Z 2025-05-07T19:46:05.7238357Z  2025-05-07T19:46:05.7238490Z 2025-05-07T19:46:05.7238494Z 2025-05-07T19:46:05.7238497Z 2025-05-07T19:46:05.7238501Z 2025-05-07T19:46:05.7238518Z 2025-05-07T19:46:05.7238522Z 2025-05-07T19:46:05.7238525Z 2025-05-07T19:46:05.7238700Z  2025-05-07T19:46:05.7238856Z 2025-05-07T19:46:05.7238860Z 2025-05-07T19:46:05.7238864Z 2025-05-07T19:46:05.7238867Z 2025-05-07T19:46:05.7238874Z 2025-05-07T19:46:05.7238878Z 2025-05-07T19:46:05.7238881Z 2025-05-07T19:46:05.7238885Z 2025-05-07T19:46:05.7239002Z  2025-05-07T19:46:05.7239168Z 2025-05-07T19:46:05.7239171Z 2025-05-07T19:46:05.7239178Z 2025-05-07T19:46:05.7239182Z 2025-05-07T19:46:05.7239185Z 2025-05-07T19:46:05.7239188Z 2025-05-07T19:46:05.7239192Z 2025-05-07T19:46:05.7239195Z 2025-05-07T19:46:05.7239199Z 2025-05-07T19:46:05.7239323Z  2025-05-07T19:46:05.7239498Z 2025-05-07T19:46:05.7239502Z 2025-05-07T19:46:05.7239505Z 2025-05-07T19:46:05.7239509Z 2025-05-07T19:46:05.7239512Z 2025-05-07T19:46:05.7239515Z 2025-05-07T19:46:05.7239519Z 2025-05-07T19:46:05.7239522Z 2025-05-07T19:46:05.7239525Z 2025-05-07T19:46:05.7239529Z 2025-05-07T19:46:05.7239654Z  2025-05-07T19:46:05.7239836Z 2025-05-07T19:46:05.7239839Z 2025-05-07T19:46:05.7239843Z 2025-05-07T19:46:05.7239846Z 2025-05-07T19:46:05.7239850Z 2025-05-07T19:46:05.7239856Z 2025-05-07T19:46:05.7239860Z 2025-05-07T19:46:05.7239864Z 2025-05-07T19:46:05.7239867Z 2025-05-07T19:46:05.7239870Z 2025-05-07T19:46:05.7239874Z 2025-05-07T19:46:05.7240009Z  2025-05-07T19:46:05.7240220Z 2025-05-07T19:46:05.7240227Z 2025-05-07T19:46:05.7240231Z 2025-05-07T19:46:05.7240234Z 2025-05-07T19:46:05.7240237Z 2025-05-07T19:46:05.7240241Z 2025-05-07T19:46:05.7240244Z 2025-05-07T19:46:05.7240247Z 2025-05-07T19:46:05.7240251Z 2025-05-07T19:46:05.7240254Z 2025-05-07T19:46:05.7240257Z 2025-05-07T19:46:05.7240261Z 2025-05-07T19:46:05.7240396Z  2025-05-07T19:46:05.7240607Z 2025-05-07T19:46:05.7240611Z 2025-05-07T19:46:05.7240614Z 2025-05-07T19:46:05.7240617Z 2025-05-07T19:46:05.7240621Z 2025-05-07T19:46:05.7240624Z 2025-05-07T19:46:05.7240627Z 2025-05-07T19:46:05.7240630Z 2025-05-07T19:46:05.7240634Z 2025-05-07T19:46:05.7240637Z 2025-05-07T19:46:05.7240641Z 2025-05-07T19:46:05.7240644Z 2025-05-07T19:46:05.7240701Z 2025-05-07T19:46:05.7240840Z  2025-05-07T19:46:05.7241059Z 2025-05-07T19:46:05.7241063Z 2025-05-07T19:46:05.7241066Z 2025-05-07T19:46:05.7241069Z 2025-05-07T19:46:05.7241073Z 2025-05-07T19:46:05.7241080Z 2025-05-07T19:46:05.7241084Z 2025-05-07T19:46:05.7241087Z 2025-05-07T19:46:05.7241090Z 2025-05-07T19:46:05.7241094Z 2025-05-07T19:46:05.7241097Z 2025-05-07T19:46:05.7241101Z 2025-05-07T19:46:05.7241104Z 2025-05-07T19:46:05.7241107Z 2025-05-07T19:46:05.7241249Z  2025-05-07T19:46:05.7241465Z 2025-05-07T19:46:05.7241469Z 2025-05-07T19:46:05.7241473Z 2025-05-07T19:46:05.7241476Z 2025-05-07T19:46:05.7241479Z 2025-05-07T19:46:05.7241483Z 2025-05-07T19:46:05.7241486Z 2025-05-07T19:46:05.7241489Z 2025-05-07T19:46:05.7241493Z 2025-05-07T19:46:05.7241496Z 2025-05-07T19:46:05.7241500Z 2025-05-07T19:46:05.7241503Z 2025-05-07T19:46:05.7241506Z 2025-05-07T19:46:05.7241510Z 2025-05-07T19:46:05.7241513Z 2025-05-07T19:46:05.7241678Z  2025-05-07T19:46:05.7241887Z 2025-05-07T19:46:05.7241891Z 2025-05-07T19:46:05.7241894Z 2025-05-07T19:46:05.7241897Z 2025-05-07T19:46:05.7241901Z 2025-05-07T19:46:05.7241962Z 2025-05-07T19:46:05.7241966Z 2025-05-07T19:46:05.7241969Z 2025-05-07T19:46:05.7241973Z 2025-05-07T19:46:05.7241976Z 2025-05-07T19:46:05.7241980Z 2025-05-07T19:46:05.7241983Z 2025-05-07T19:46:05.7241987Z 2025-05-07T19:46:05.7242005Z 2025-05-07T19:46:05.7242008Z 2025-05-07T19:46:05.7242012Z 2025-05-07T19:46:05.7242164Z  2025-05-07T19:46:05.7242378Z 2025-05-07T19:46:05.7242382Z 2025-05-07T19:46:05.7242385Z 2025-05-07T19:46:05.7242389Z 2025-05-07T19:46:05.7242393Z 2025-05-07T19:46:05.7242396Z 2025-05-07T19:46:05.7242399Z 2025-05-07T19:46:05.7242416Z 2025-05-07T19:46:05.7242419Z 2025-05-07T19:46:05.7242423Z 2025-05-07T19:46:05.7242426Z 2025-05-07T19:46:05.7242429Z 2025-05-07T19:46:05.7242433Z 2025-05-07T19:46:05.7242440Z 2025-05-07T19:46:05.7242444Z 2025-05-07T19:46:05.7242447Z 2025-05-07T19:46:05.7242450Z 2025-05-07T19:46:05.7242606Z  2025-05-07T19:46:05.7242828Z 2025-05-07T19:46:05.7242849Z 2025-05-07T19:46:05.7242853Z 2025-05-07T19:46:05.7242856Z 2025-05-07T19:46:05.7242859Z 2025-05-07T19:46:05.7242862Z 2025-05-07T19:46:05.7242866Z 2025-05-07T19:46:05.7242869Z 2025-05-07T19:46:05.7242873Z 2025-05-07T19:46:05.7242876Z 2025-05-07T19:46:05.7242880Z 2025-05-07T19:46:05.7242883Z 2025-05-07T19:46:05.7242886Z 2025-05-07T19:46:05.7242890Z 2025-05-07T19:46:05.7242893Z 2025-05-07T19:46:05.7242896Z 2025-05-07T19:46:05.7242899Z 2025-05-07T19:46:05.7242903Z 2025-05-07T19:46:05.7243070Z  2025-05-07T19:46:05.7243317Z 2025-05-07T19:46:05.7243321Z 2025-05-07T19:46:05.7243417Z  2025-05-07T19:46:05.7243523Z 2025-05-07T19:46:05.7243527Z 2025-05-07T19:46:05.7243643Z  2025-05-07T19:46:05.7243753Z 2025-05-07T19:46:05.7243757Z 2025-05-07T19:46:05.7243760Z 2025-05-07T19:46:05.7243860Z  2025-05-07T19:46:05.7243988Z 2025-05-07T19:46:05.7243992Z 2025-05-07T19:46:05.7243996Z 2025-05-07T19:46:05.7243999Z 2025-05-07T19:46:05.7244105Z  2025-05-07T19:46:05.7244224Z 2025-05-07T19:46:05.7244228Z 2025-05-07T19:46:05.7244232Z 2025-05-07T19:46:05.7244249Z 2025-05-07T19:46:05.7244252Z 2025-05-07T19:46:05.7244388Z  2025-05-07T19:46:05.7244512Z 2025-05-07T19:46:05.7244516Z 2025-05-07T19:46:05.7244519Z 2025-05-07T19:46:05.7244523Z 2025-05-07T19:46:05.7244526Z 2025-05-07T19:46:05.7244530Z 2025-05-07T19:46:05.7244652Z  2025-05-07T19:46:05.7244782Z 2025-05-07T19:46:05.7244785Z 2025-05-07T19:46:05.7244789Z 2025-05-07T19:46:05.7244792Z 2025-05-07T19:46:05.7244796Z 2025-05-07T19:46:05.7244799Z 2025-05-07T19:46:05.7244802Z 2025-05-07T19:46:05.7245009Z  2025-05-07T19:46:05.7245152Z 2025-05-07T19:46:05.7245156Z 2025-05-07T19:46:05.7245213Z 2025-05-07T19:46:05.7245217Z 2025-05-07T19:46:05.7245220Z 2025-05-07T19:46:05.7245223Z 2025-05-07T19:46:05.7245227Z 2025-05-07T19:46:05.7245230Z 2025-05-07T19:46:05.7245348Z  2025-05-07T19:46:05.7245520Z 2025-05-07T19:46:05.7245524Z 2025-05-07T19:46:05.7245527Z 2025-05-07T19:46:05.7245531Z 2025-05-07T19:46:05.7245534Z 2025-05-07T19:46:05.7245537Z 2025-05-07T19:46:05.7245541Z 2025-05-07T19:46:05.7245544Z 2025-05-07T19:46:05.7245548Z 2025-05-07T19:46:05.7245669Z  2025-05-07T19:46:05.7245844Z 2025-05-07T19:46:05.7245848Z 2025-05-07T19:46:05.7245852Z 2025-05-07T19:46:05.7245855Z 2025-05-07T19:46:05.7245859Z 2025-05-07T19:46:05.7245862Z 2025-05-07T19:46:05.7245866Z 2025-05-07T19:46:05.7245869Z 2025-05-07T19:46:05.7245873Z 2025-05-07T19:46:05.7245876Z 2025-05-07T19:46:05.7246002Z  2025-05-07T19:46:05.7246184Z 2025-05-07T19:46:05.7246188Z 2025-05-07T19:46:05.7246192Z 2025-05-07T19:46:05.7246198Z 2025-05-07T19:46:05.7246202Z 2025-05-07T19:46:05.7246205Z 2025-05-07T19:46:05.7246209Z 2025-05-07T19:46:05.7246212Z 2025-05-07T19:46:05.7246215Z 2025-05-07T19:46:05.7246219Z 2025-05-07T19:46:05.7246222Z 2025-05-07T19:46:05.7246403Z  2025-05-07T19:46:05.7246602Z 2025-05-07T19:46:05.7246605Z 2025-05-07T19:46:05.7246608Z 2025-05-07T19:46:05.7246612Z 2025-05-07T19:46:05.7246615Z 2025-05-07T19:46:05.7246618Z 2025-05-07T19:46:05.7246622Z 2025-05-07T19:46:05.7246625Z 2025-05-07T19:46:05.7246628Z 2025-05-07T19:46:05.7246632Z 2025-05-07T19:46:05.7246635Z 2025-05-07T19:46:05.7246639Z 2025-05-07T19:46:05.7246770Z  2025-05-07T19:46:05.7246975Z 2025-05-07T19:46:05.7246979Z 2025-05-07T19:46:05.7246982Z 2025-05-07T19:46:05.7246986Z 2025-05-07T19:46:05.7246989Z 2025-05-07T19:46:05.7246993Z 2025-05-07T19:46:05.7246996Z 2025-05-07T19:46:05.7246999Z 2025-05-07T19:46:05.7247003Z 2025-05-07T19:46:05.7247006Z 2025-05-07T19:46:05.7247013Z 2025-05-07T19:46:05.7247016Z 2025-05-07T19:46:05.7247020Z 2025-05-07T19:46:05.7247157Z  2025-05-07T19:46:05.7247377Z 2025-05-07T19:46:05.7247381Z 2025-05-07T19:46:05.7247384Z 2025-05-07T19:46:05.7247391Z 2025-05-07T19:46:05.7247395Z 2025-05-07T19:46:05.7247398Z 2025-05-07T19:46:05.7247402Z 2025-05-07T19:46:05.7247405Z 2025-05-07T19:46:05.7247409Z 2025-05-07T19:46:05.7247412Z 2025-05-07T19:46:05.7247416Z 2025-05-07T19:46:05.7247419Z 2025-05-07T19:46:05.7247422Z 2025-05-07T19:46:05.7247425Z 2025-05-07T19:46:05.7247587Z  2025-05-07T19:46:05.7247791Z 2025-05-07T19:46:05.7247795Z 2025-05-07T19:46:05.7247798Z 2025-05-07T19:46:05.7247801Z 2025-05-07T19:46:05.7247805Z 2025-05-07T19:46:05.7247808Z 2025-05-07T19:46:05.7247811Z 2025-05-07T19:46:05.7247815Z 2025-05-07T19:46:05.7247818Z 2025-05-07T19:46:05.7247821Z 2025-05-07T19:46:05.7247825Z 2025-05-07T19:46:05.7247828Z 2025-05-07T19:46:05.7247835Z 2025-05-07T19:46:05.7247838Z 2025-05-07T19:46:05.7247842Z 2025-05-07T19:46:05.7248000Z  2025-05-07T19:46:05.7248209Z 2025-05-07T19:46:05.7248212Z 2025-05-07T19:46:05.7248216Z 2025-05-07T19:46:05.7248222Z 2025-05-07T19:46:05.7248226Z 2025-05-07T19:46:05.7248229Z 2025-05-07T19:46:05.7248232Z 2025-05-07T19:46:05.7248236Z 2025-05-07T19:46:05.7248239Z 2025-05-07T19:46:05.7248242Z 2025-05-07T19:46:05.7248246Z 2025-05-07T19:46:05.7248263Z 2025-05-07T19:46:05.7248266Z 2025-05-07T19:46:05.7248269Z 2025-05-07T19:46:05.7248273Z 2025-05-07T19:46:05.7248276Z 2025-05-07T19:46:05.7248426Z  2025-05-07T19:46:05.7248643Z 2025-05-07T19:46:05.7248646Z 2025-05-07T19:46:05.7248649Z 2025-05-07T19:46:05.7248653Z 2025-05-07T19:46:05.7248656Z 2025-05-07T19:46:05.7248675Z 2025-05-07T19:46:05.7248678Z 2025-05-07T19:46:05.7248681Z 2025-05-07T19:46:05.7248685Z 2025-05-07T19:46:05.7248688Z 2025-05-07T19:46:05.7248742Z 2025-05-07T19:46:05.7248746Z 2025-05-07T19:46:05.7248750Z 2025-05-07T19:46:05.7248753Z 2025-05-07T19:46:05.7248756Z 2025-05-07T19:46:05.7248760Z 2025-05-07T19:46:05.7248763Z 2025-05-07T19:46:05.7248920Z  2025-05-07T19:46:05.7249160Z 2025-05-07T19:46:05.7249164Z 2025-05-07T19:46:05.7249167Z 2025-05-07T19:46:05.7249171Z 2025-05-07T19:46:05.7249174Z 2025-05-07T19:46:05.7249178Z 2025-05-07T19:46:05.7249181Z 2025-05-07T19:46:05.7249184Z 2025-05-07T19:46:05.7249188Z 2025-05-07T19:46:05.7249191Z 2025-05-07T19:46:05.7249194Z 2025-05-07T19:46:05.7249198Z 2025-05-07T19:46:05.7249201Z 2025-05-07T19:46:05.7249205Z 2025-05-07T19:46:05.7249208Z 2025-05-07T19:46:05.7249211Z 2025-05-07T19:46:05.7249215Z 2025-05-07T19:46:05.7249218Z 2025-05-07T19:46:05.7249396Z  2025-05-07T19:46:05.7249621Z 2025-05-07T19:46:05.7249624Z 2025-05-07T19:46:05.7249725Z  2025-05-07T19:46:05.7249831Z 2025-05-07T19:46:05.7249857Z 2025-05-07T19:46:05.7249956Z  2025-05-07T19:46:05.7250066Z 2025-05-07T19:46:05.7250069Z 2025-05-07T19:46:05.7250073Z 2025-05-07T19:46:05.7250173Z  2025-05-07T19:46:05.7250299Z 2025-05-07T19:46:05.7250302Z 2025-05-07T19:46:05.7250357Z 2025-05-07T19:46:05.7250360Z 2025-05-07T19:46:05.7250465Z  2025-05-07T19:46:05.7250586Z 2025-05-07T19:46:05.7250589Z 2025-05-07T19:46:05.7250609Z 2025-05-07T19:46:05.7250612Z 2025-05-07T19:46:05.7250615Z 2025-05-07T19:46:05.7250721Z  2025-05-07T19:46:05.7250849Z 2025-05-07T19:46:05.7250853Z 2025-05-07T19:46:05.7250856Z 2025-05-07T19:46:05.7250859Z 2025-05-07T19:46:05.7250863Z 2025-05-07T19:46:05.7250866Z 2025-05-07T19:46:05.7250992Z  2025-05-07T19:46:05.7251124Z 2025-05-07T19:46:05.7251127Z 2025-05-07T19:46:05.7251131Z 2025-05-07T19:46:05.7251134Z 2025-05-07T19:46:05.7251137Z 2025-05-07T19:46:05.7251141Z 2025-05-07T19:46:05.7251144Z 2025-05-07T19:46:05.7251279Z  2025-05-07T19:46:05.7251423Z 2025-05-07T19:46:05.7251427Z 2025-05-07T19:46:05.7251430Z 2025-05-07T19:46:05.7251433Z 2025-05-07T19:46:05.7251437Z 2025-05-07T19:46:05.7251440Z 2025-05-07T19:46:05.7251443Z 2025-05-07T19:46:05.7251447Z 2025-05-07T19:46:05.7251569Z  2025-05-07T19:46:05.7251739Z 2025-05-07T19:46:05.7251742Z 2025-05-07T19:46:05.7251746Z 2025-05-07T19:46:05.7251749Z 2025-05-07T19:46:05.7251752Z 2025-05-07T19:46:05.7251756Z 2025-05-07T19:46:05.7251759Z 2025-05-07T19:46:05.7251763Z 2025-05-07T19:46:05.7251766Z 2025-05-07T19:46:05.7251890Z  2025-05-07T19:46:05.7252066Z 2025-05-07T19:46:05.7252069Z 2025-05-07T19:46:05.7252073Z 2025-05-07T19:46:05.7252076Z 2025-05-07T19:46:05.7252080Z 2025-05-07T19:46:05.7252083Z 2025-05-07T19:46:05.7252087Z 2025-05-07T19:46:05.7252090Z 2025-05-07T19:46:05.7252093Z 2025-05-07T19:46:05.7252096Z 2025-05-07T19:46:05.7252222Z  2025-05-07T19:46:05.7252422Z 2025-05-07T19:46:05.7252429Z 2025-05-07T19:46:05.7252433Z 2025-05-07T19:46:05.7252436Z 2025-05-07T19:46:05.7252439Z 2025-05-07T19:46:05.7252443Z 2025-05-07T19:46:05.7252446Z 2025-05-07T19:46:05.7252449Z 2025-05-07T19:46:05.7252453Z 2025-05-07T19:46:05.7252459Z 2025-05-07T19:46:05.7252463Z 2025-05-07T19:46:05.7252593Z  2025-05-07T19:46:05.7252789Z 2025-05-07T19:46:05.7252792Z 2025-05-07T19:46:05.7252796Z 2025-05-07T19:46:05.7252799Z 2025-05-07T19:46:05.7252803Z 2025-05-07T19:46:05.7252806Z 2025-05-07T19:46:05.7252809Z 2025-05-07T19:46:05.7252813Z 2025-05-07T19:46:05.7252816Z 2025-05-07T19:46:05.7252820Z 2025-05-07T19:46:05.7252824Z 2025-05-07T19:46:05.7252827Z 2025-05-07T19:46:05.7252958Z  2025-05-07T19:46:05.7253163Z 2025-05-07T19:46:05.7253166Z 2025-05-07T19:46:05.7253170Z 2025-05-07T19:46:05.7253173Z 2025-05-07T19:46:05.7253176Z 2025-05-07T19:46:05.7253180Z 2025-05-07T19:46:05.7253183Z 2025-05-07T19:46:05.7253186Z 2025-05-07T19:46:05.7253253Z 2025-05-07T19:46:05.7253257Z 2025-05-07T19:46:05.7253261Z 2025-05-07T19:46:05.7253264Z 2025-05-07T19:46:05.7253267Z 2025-05-07T19:46:05.7253407Z  2025-05-07T19:46:05.7253621Z 2025-05-07T19:46:05.7253628Z 2025-05-07T19:46:05.7253632Z 2025-05-07T19:46:05.7253636Z 2025-05-07T19:46:05.7253640Z 2025-05-07T19:46:05.7253643Z 2025-05-07T19:46:05.7253647Z 2025-05-07T19:46:05.7253650Z 2025-05-07T19:46:05.7253653Z 2025-05-07T19:46:05.7253657Z 2025-05-07T19:46:05.7253660Z 2025-05-07T19:46:05.7253663Z 2025-05-07T19:46:05.7253667Z 2025-05-07T19:46:05.7253670Z 2025-05-07T19:46:05.7253827Z  2025-05-07T19:46:05.7254031Z 2025-05-07T19:46:05.7254034Z 2025-05-07T19:46:05.7254037Z 2025-05-07T19:46:05.7254041Z 2025-05-07T19:46:05.7254044Z 2025-05-07T19:46:05.7254047Z 2025-05-07T19:46:05.7254051Z 2025-05-07T19:46:05.7254054Z 2025-05-07T19:46:05.7254057Z 2025-05-07T19:46:05.7254061Z 2025-05-07T19:46:05.7254067Z 2025-05-07T19:46:05.7254070Z 2025-05-07T19:46:05.7254074Z 2025-05-07T19:46:05.7254077Z 2025-05-07T19:46:05.7254081Z 2025-05-07T19:46:05.7254242Z  2025-05-07T19:46:05.7254452Z 2025-05-07T19:46:05.7254509Z 2025-05-07T19:46:05.7254512Z 2025-05-07T19:46:05.7254516Z 2025-05-07T19:46:05.7254519Z 2025-05-07T19:46:05.7254522Z 2025-05-07T19:46:05.7254526Z 2025-05-07T19:46:05.7254529Z 2025-05-07T19:46:05.7254533Z 2025-05-07T19:46:05.7254536Z 2025-05-07T19:46:05.7254540Z 2025-05-07T19:46:05.7254557Z 2025-05-07T19:46:05.7254561Z 2025-05-07T19:46:05.7254564Z 2025-05-07T19:46:05.7254567Z 2025-05-07T19:46:05.7254571Z 2025-05-07T19:46:05.7254725Z  2025-05-07T19:46:05.7254940Z 2025-05-07T19:46:05.7254943Z 2025-05-07T19:46:05.7254947Z 2025-05-07T19:46:05.7254950Z 2025-05-07T19:46:05.7254954Z 2025-05-07T19:46:05.7254974Z 2025-05-07T19:46:05.7254977Z 2025-05-07T19:46:05.7254981Z 2025-05-07T19:46:05.7254987Z 2025-05-07T19:46:05.7254991Z 2025-05-07T19:46:05.7254994Z 2025-05-07T19:46:05.7254997Z 2025-05-07T19:46:05.7255001Z 2025-05-07T19:46:05.7255004Z 2025-05-07T19:46:05.7255007Z 2025-05-07T19:46:05.7255011Z 2025-05-07T19:46:05.7255018Z 2025-05-07T19:46:05.7255174Z  2025-05-07T19:46:05.7255411Z 2025-05-07T19:46:05.7255414Z 2025-05-07T19:46:05.7255418Z 2025-05-07T19:46:05.7255421Z 2025-05-07T19:46:05.7255424Z 2025-05-07T19:46:05.7255428Z 2025-05-07T19:46:05.7255431Z 2025-05-07T19:46:05.7255434Z 2025-05-07T19:46:05.7255438Z 2025-05-07T19:46:05.7255441Z 2025-05-07T19:46:05.7255445Z 2025-05-07T19:46:05.7255448Z 2025-05-07T19:46:05.7255452Z 2025-05-07T19:46:05.7255455Z 2025-05-07T19:46:05.7255458Z 2025-05-07T19:46:05.7255462Z 2025-05-07T19:46:05.7255465Z 2025-05-07T19:46:05.7255468Z 2025-05-07T19:46:05.7255650Z  2025-05-07T19:46:05.7255874Z 2025-05-07T19:46:05.7255877Z 2025-05-07T19:46:05.7255978Z  2025-05-07T19:46:05.7256086Z 2025-05-07T19:46:05.7256103Z 2025-05-07T19:46:05.7256200Z  2025-05-07T19:46:05.7256308Z 2025-05-07T19:46:05.7256312Z 2025-05-07T19:46:05.7256315Z 2025-05-07T19:46:05.7256446Z  2025-05-07T19:46:05.7256560Z 2025-05-07T19:46:05.7256564Z 2025-05-07T19:46:05.7256567Z 2025-05-07T19:46:05.7256570Z 2025-05-07T19:46:05.7256672Z  2025-05-07T19:46:05.7256805Z 2025-05-07T19:46:05.7256809Z 2025-05-07T19:46:05.7256812Z 2025-05-07T19:46:05.7256815Z 2025-05-07T19:46:05.7256819Z 2025-05-07T19:46:05.7256923Z  2025-05-07T19:46:05.7257047Z 2025-05-07T19:46:05.7257050Z 2025-05-07T19:46:05.7257053Z 2025-05-07T19:46:05.7257057Z 2025-05-07T19:46:05.7257076Z 2025-05-07T19:46:05.7257080Z 2025-05-07T19:46:05.7257188Z  2025-05-07T19:46:05.7257317Z 2025-05-07T19:46:05.7257321Z 2025-05-07T19:46:05.7257324Z 2025-05-07T19:46:05.7257328Z 2025-05-07T19:46:05.7257331Z 2025-05-07T19:46:05.7257334Z 2025-05-07T19:46:05.7257402Z 2025-05-07T19:46:05.7257530Z  2025-05-07T19:46:05.7257670Z 2025-05-07T19:46:05.7257675Z 2025-05-07T19:46:05.7257678Z 2025-05-07T19:46:05.7257681Z 2025-05-07T19:46:05.7257684Z 2025-05-07T19:46:05.7257692Z 2025-05-07T19:46:05.7257695Z 2025-05-07T19:46:05.7257699Z 2025-05-07T19:46:05.7257840Z  done 2025-05-07T19:46:05.9364111Z Preparing transaction: / - done 2025-05-07T19:46:06.6386159Z Verifying transaction: | / - \ | / - done 2025-05-07T19:46:06.9437714Z Executing transaction: | / - done 2025-05-07T19:46:08.8949709Z [INSTALL] Fixing file placements for CUDA 12.6.3+ ... 2025-05-07T19:46:08.8950224Z [INSTALL] Creating symlinks: libnvToolsExt.so 2025-05-07T19:46:08.8951058Z + ln -sf /github/home/miniconda/envs/build_binary/lib/libnvToolsExt.so.1 /github/home/miniconda/envs/build_binary/lib/libnvToolsExt.so 2025-05-07T19:46:08.8951700Z 2025-05-07T19:46:08.8972002Z 2025-05-07T19:46:08.8972957Z + ln -sf /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvToolsExt.so.1 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvToolsExt.so 2025-05-07T19:46:08.8973805Z 2025-05-07T19:46:08.8988336Z 2025-05-07T19:46:08.8989084Z [INSTALL] Copying nvtx3 headers ... 2025-05-07T19:46:08.9007587Z + cp -r /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExt.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtCuda.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtCudaRt.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtOpenCL.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtSync.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvtx3.hpp /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvtxDetail /github/home/miniconda/envs/build_binary/include/ 2025-05-07T19:46:08.9011805Z 2025-05-07T19:46:08.9209026Z 2025-05-07T19:46:08.9215464Z + cp -r /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExt.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtCuda.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtCudaRt.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtOpenCL.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtSync.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvtx3.hpp /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvtxDetail /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include/ 2025-05-07T19:46:08.9219794Z 2025-05-07T19:46:08.9231608Z 2025-05-07T19:46:08.9232316Z [INSTALL] Appending libcuda.so path to LD_LIBRARY_PATH ... 2025-05-07T19:46:08.9628256Z [ENV] Appending to LD_LIBRARY_PATH: /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs ... 2025-05-07T19:46:10.7433545Z ERROR conda.cli.main_run:execute(125): `conda run printenv LD_LIBRARY_PATH` failed. (See above for error) 2025-05-07T19:46:10.8025611Z + conda env config vars set -n build_binary LD_LIBRARY_PATH=/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs 2025-05-07T19:46:10.8027364Z 2025-05-07T19:46:11.2070828Z 2025-05-07T19:46:11.2073062Z [INSTALL] Setting environment variable NVML_LIB_PATH ... 2025-05-07T19:46:11.2442475Z + conda env config vars set -n build_binary NVML_LIB_PATH=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:46:11.2443073Z 2025-05-07T19:46:11.6490117Z 2025-05-07T19:46:11.6491060Z [INSTALL] Setting environment variable CUDA_INCLUDE_DIRS ... 2025-05-07T19:46:11.6494053Z + conda env config vars set -n build_binary CUDA_INCLUDE_DIRS="/github/home/miniconda/envs/build_binary/include/:/github/home/miniconda/envs/build_binary/targets/x86_64-linux/include/" 2025-05-07T19:46:11.6495442Z 2025-05-07T19:46:12.0703157Z 2025-05-07T19:46:14.0054953Z [CHECK] cuda_runtime.h found in CONDA_PREFIX PATH (file): /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include/cuda_runtime.h 2025-05-07T19:46:15.9427425Z [CHECK] libcuda.so found in CONDA_PREFIX PATH (file): /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:46:17.8862850Z [CHECK] libnvToolsExt.so found in CONDA_PREFIX PATH (symbolic link): /github/home/miniconda/envs/build_binary/lib/libnvToolsExt.so 2025-05-07T19:46:17.8863803Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvToolsExt.so 2025-05-07T19:46:19.8167714Z [CHECK] libnvidia-ml.so found in CONDA_PREFIX PATH (file): /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libnvidia-ml.so 2025-05-07T19:46:21.6111233Z /github/home/miniconda/envs/build_binary/bin/nvcc 2025-05-07T19:46:21.6111538Z 2025-05-07T19:46:21.6841830Z [CHECK] Binary nvcc found in PATH 2025-05-07T19:46:25.3523963Z /tmp/tmp2ohepqtt: line 3: clang: command not found 2025-05-07T19:46:25.3524380Z 2025-05-07T19:46:25.3524792Z ERROR conda.cli.main_run:execute(125): `conda run clang --version` failed. (See above for error) 2025-05-07T19:46:25.4148427Z + ls -la /github/home/miniconda/envs/build_binary/etc/conda/activate.d 2025-05-07T19:46:25.4148833Z 2025-05-07T19:46:25.4174386Z total 56 2025-05-07T19:46:25.4174756Z drwxr-xr-x. 2 root root 16384 May 7 19:46 . 2025-05-07T19:46:25.4175293Z drwxr-xr-x. 5 root root 62 May 7 19:44 .. 2025-05-07T19:46:25.4175812Z -rw-r--r--. 2 root root 3778 Jun 10 2024 activate-binutils_linux-64.sh 2025-05-07T19:46:25.4176350Z -rw-r--r--. 2 root root 11630 Jun 10 2024 activate-gcc_linux-64.sh 2025-05-07T19:46:25.4176851Z -rw-r--r--. 2 root root 5190 Jun 10 2024 activate-gxx_linux-64.sh 2025-05-07T19:46:25.4177344Z -rw-r--r--. 2 root root 136 Mar 27 01:27 libglib_activate.sh 2025-05-07T19:46:25.4177802Z -rw-r--r--. 2 root root 872 May 7 16:10 libxml2_activate.sh 2025-05-07T19:46:25.4178229Z -rw-r--r--. 2 root root 499 Mar 28 22:35 openjdk_activate.sh 2025-05-07T19:46:25.4178686Z -rw-r--r--. 2 root root 2932 Nov 20 20:32 ~cuda-nvcc_activate.sh 2025-05-07T19:46:25.4178970Z 2025-05-07T19:46:25.4179219Z [INSTALL] Removing the -ccbin=CXX hook from NVCC activation scripts ... 2025-05-07T19:46:25.4179923Z + sed -i /-ccbin=/d /github/home/miniconda/envs/build_binary/etc/conda/activate.d/*cuda-nvcc_activate.sh 2025-05-07T19:46:25.4180405Z 2025-05-07T19:46:25.4192802Z 2025-05-07T19:46:25.4193442Z + conda run -n build_binary c++ --version | grep -i clang 2025-05-07T19:46:25.4194255Z 2025-05-07T19:46:27.2582485Z 2025-05-07T19:46:27.2583231Z [BUILD] Setting prepend flags for NVCC ... 2025-05-07T19:46:27.2584948Z + conda env config vars set -n build_binary NVCC_PREPEND_FLAGS="-allow-unsupported-compiler" 2025-05-07T19:46:27.6826811Z 2025-05-07T19:46:27.6826840Z 2025-05-07T19:46:27.6827733Z + conda run -n build_binary printenv NVCC_PREPEND_FLAGS 2025-05-07T19:46:27.6828597Z 2025-05-07T19:46:29.4773719Z -allow-unsupported-compiler 2025-05-07T19:46:29.4774386Z 2025-05-07T19:46:29.5343837Z 2025-05-07T19:46:29.5344681Z [INFO] Printing out all preprocessor defines in nvcc ... 2025-05-07T19:46:29.5345525Z + conda run -n build_binary nvcc --compiler-options -dM -E -x cu - < /dev/null 2025-05-07T19:46:29.5345881Z 2025-05-07T19:46:31.4082089Z #define _GLIBCXX_DEPRECATED_SUGGEST(ALT) __attribute__ ((__deprecated__ ("use '" ALT "' instead"))) 2025-05-07T19:46:31.4082797Z #define M_PIl 3.141592653589793238462643383279502884L 2025-05-07T19:46:31.4083469Z #define _IO_CURRENTLY_PUTTING 0x800 2025-05-07T19:46:31.4083902Z #define __W_EXITCODE(ret,sig) ((ret) << 8 | (sig)) 2025-05-07T19:46:31.4084345Z #define __DBL_MIN_EXP__ (-1021) 2025-05-07T19:46:31.4084795Z #define _STL_PAIR_H 1 2025-05-07T19:46:31.4085124Z #define __cpp_attributes 200809L 2025-05-07T19:46:31.4085585Z #define __cpp_nontype_template_parameter_auto 201606L 2025-05-07T19:46:31.4086021Z #define __DELETE_THROW throw() 2025-05-07T19:46:31.4086320Z #define _PTRDIFF_T_ 2025-05-07T19:46:31.4086622Z #define M_PI_4 0.78539816339744830962 2025-05-07T19:46:31.4087006Z #define __UINT_LEAST16_MAX__ 0xffff 2025-05-07T19:46:31.4087309Z #define _IO_LEFT 02 2025-05-07T19:46:31.4087600Z #define __ATOMIC_ACQUIRE 2 2025-05-07T19:46:31.4087900Z #define _POSIX2_BC_SCALE_MAX 99 2025-05-07T19:46:31.4088237Z #define _GLIBCXX_USE_RANDOM_TR1 1 2025-05-07T19:46:31.4088734Z #define _GLIBCXX_MOVE_BACKWARD3(_Tp,_Up,_Vp) std::move_backward(_Tp, _Up, _Vp) 2025-05-07T19:46:31.4089237Z #define __FLT128_MAX_10_EXP__ 4932 2025-05-07T19:46:31.4089556Z #define RE_DUP_MAX (0x7fff) 2025-05-07T19:46:31.4089874Z #define _IOS_OUTPUT 2 2025-05-07T19:46:31.4090225Z #define __FLT_MIN__ 1.17549435082228750796873653722224568e-38F 2025-05-07T19:46:31.4090633Z #define toascii_l(c,l) __toascii_l ((c), (l)) 2025-05-07T19:46:31.4091174Z #define __GCC_IEC_559_COMPLEX 2 2025-05-07T19:46:31.4091478Z #define _GLIBCXX_USE_FCHMOD 1 2025-05-07T19:46:31.4091818Z #define __cpp_aggregate_nsdmi 201304L 2025-05-07T19:46:31.4092718Z #define __bswap_16(x) (__extension__ ({ unsigned short int __v, __x = (unsigned short int) (x); if (__builtin_constant_p (__x)) __v = __bswap_constant_16 (__x); else __asm__ ("rorw $8, %w0" : "=r" (__v) : "0" (__x) : "cc"); __v; })) 2025-05-07T19:46:31.4093672Z #define __UINT_LEAST8_TYPE__ unsigned char 2025-05-07T19:46:31.4094047Z #define __SIZEOF_FLOAT80__ 16 2025-05-07T19:46:31.4094384Z #define cudaTextureTypeCubemapLayered 0xFC 2025-05-07T19:46:31.4094764Z #define _T_WCHAR_ 2025-05-07T19:46:31.4095015Z #define stdout stdout 2025-05-07T19:46:31.4095429Z #define _GLIBCXX_ABI_TAG_CXX11 __attribute ((__abi_tag__ ("cxx11"))) 2025-05-07T19:46:31.4095865Z #define CHAR_BIT __CHAR_BIT__ 2025-05-07T19:46:31.4096181Z #define __flexarr [] 2025-05-07T19:46:31.4096453Z #define _GLIBCXX_HAVE_FINITEF 1 2025-05-07T19:46:31.4096855Z #define __islower_l(c,l) __isctype_l((c), _ISlower, (l)) 2025-05-07T19:46:31.4097273Z #define _IO_FLAGS2_USER_WBUF 8 2025-05-07T19:46:31.4097560Z #define _MATH_H 1 2025-05-07T19:46:31.4097896Z #define cudaOccupancyDisableCachingOverride 0x01 2025-05-07T19:46:31.4098272Z #define __S64_TYPE long int 2025-05-07T19:46:31.4098585Z #define __stub_fchflags 2025-05-07T19:46:31.4098882Z #define cudaDeviceScheduleMask 0x07 2025-05-07T19:46:31.4099307Z #define __SQUAD_TYPE long int 2025-05-07T19:46:31.4099606Z #define __INTMAX_C(c) c ## L 2025-05-07T19:46:31.4099935Z #define _BSD_SIZE_T_DEFINED_ 2025-05-07T19:46:31.4100228Z #define NL_NMAX INT_MAX 2025-05-07T19:46:31.4100526Z #define _BITS_TIME_H 1 2025-05-07T19:46:31.4100869Z #define M_LN10l 2.302585092994045684017991454684364208L 2025-05-07T19:46:31.4101234Z #define _GLIBCXX_TXN_SAFE_DYN 2025-05-07T19:46:31.4101607Z #define cudaStreamTailLaunch ((cudaStream_t)0x3) 2025-05-07T19:46:31.4101999Z #define M_El 2.718281828459045235360287471352662498L 2025-05-07T19:46:31.4102468Z #define _PSTL_PRAGMA_DECLARE_SIMD _PSTL_PRAGMA(omp declare simd) 2025-05-07T19:46:31.4102881Z #define __CHAR_BIT__ 8 2025-05-07T19:46:31.4103212Z #define __FSWORD_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:31.4103573Z #define _PSTL_STRING_CONCAT(x,y) x #y 2025-05-07T19:46:31.4103930Z #define _GLIBCXX98_USE_C99_MATH 1 2025-05-07T19:46:31.4104264Z #define FP_NAN 0 2025-05-07T19:46:31.4104562Z #define makedev(maj,min) gnu_dev_makedev (maj, min) 2025-05-07T19:46:31.4105084Z #define __glibcxx_requires_sorted_set_pred(_First1,_Last1,_First2,_Pred) 2025-05-07T19:46:31.4105633Z #define cudaGetDeviceProperties cudaGetDeviceProperties_v2 2025-05-07T19:46:31.4106085Z #define __cudaCDP2GetErrorString 2025-05-07T19:46:31.4106514Z #define SHRT_MAX __SHRT_MAX__ 2025-05-07T19:46:31.4106851Z #define _GLIBCXX_X86_RDSEED 1 2025-05-07T19:46:31.4107129Z #define _NEW 2025-05-07T19:46:31.4107416Z #define CLOCK_PROCESS_CPUTIME_ID 2 2025-05-07T19:46:31.4107770Z #define __UINT8_MAX__ 0xff 2025-05-07T19:46:31.4108198Z #define _PSTL_ASSERT_MSG(_Condition,_Message) __glibcxx_assert(_Condition) 2025-05-07T19:46:31.4108692Z #define __SCHAR_WIDTH__ 8 2025-05-07T19:46:31.4108966Z #define __USE_ANSI 1 2025-05-07T19:46:31.4109379Z #define _IO_BE(expr,res) __builtin_expect ((expr), res) 2025-05-07T19:46:31.4109827Z #define __isupper_l(c,l) __isctype_l((c), _ISupper, (l)) 2025-05-07T19:46:31.4110311Z #define __cudaCDP2Memcpy2DAsync_ptsz 2025-05-07T19:46:31.4110650Z #define __WINT_MAX__ 0xffffffffU 2025-05-07T19:46:31.4110991Z #define __SIZEOF_PTHREAD_ATTR_T 56 2025-05-07T19:46:31.4111337Z #define __FLT32_MIN_EXP__ (-125) 2025-05-07T19:46:31.4112072Z #define _GLIBCXX_END_NAMESPACE_LDBL 2025-05-07T19:46:31.4112426Z #define PIPE_BUF 4096 2025-05-07T19:46:31.4112792Z #define _PSTL_PRAGMA_SIMD_ORDERED_MONOTONIC_2ARGS(PRM1,PRM2) 2025-05-07T19:46:31.4113234Z #define ADJ_TICK 0x4000 2025-05-07T19:46:31.4113548Z #define _PSTL_VERSION_PATCH (_PSTL_VERSION % 10) 2025-05-07T19:46:31.4113953Z #define MQ_PRIO_MAX 32768 2025-05-07T19:46:31.4114326Z #define __SIZEOF_PTHREAD_MUTEXATTR_T 4 2025-05-07T19:46:31.4114721Z #define __WAIT_INT(status) (*(int *) &(status)) 2025-05-07T19:46:31.4115280Z #define __GLIBC_PREREQ(maj,min) ((__GLIBC__ << 16) + __GLIBC_MINOR__ >= ((maj) << 16) + (min)) 2025-05-07T19:46:31.4115870Z #define cudaCooperativeLaunchMultiDeviceNoPreSync 0x01 2025-05-07T19:46:31.4116313Z #define _XOPEN_SOURCE 700 2025-05-07T19:46:31.4116605Z #define _POSIX2_BC_DIM_MAX 2048 2025-05-07T19:46:31.4116950Z #define __VECTOR_FUNCTIONS_HPP__ 2025-05-07T19:46:31.4117279Z #define __cpp_static_assert 201411L 2025-05-07T19:46:31.4117697Z #define __WEXITSTATUS(status) (((status) & 0xff00) >> 8) 2025-05-07T19:46:31.4118087Z #define _GLIBCXX_HAVE_STRXFRM_L 1 2025-05-07T19:46:31.4118435Z #define _POSIX_TTY_NAME_MAX 9 2025-05-07T19:46:31.4118775Z #define _GLIBCXX_USE_WEAK_REF __GXX_WEAK__ 2025-05-07T19:46:31.4119123Z #define __OFF_T_MATCHES_OFF64_T 1 2025-05-07T19:46:31.4119466Z #define __ORDER_LITTLE_ENDIAN__ 1234 2025-05-07T19:46:31.4119809Z #define __SIZE_MAX__ 0xffffffffffffffffUL 2025-05-07T19:46:31.4120239Z #define __ispunct_l(c,l) __isctype_l((c), _ISpunct, (l)) 2025-05-07T19:46:31.4120624Z #define __WCHAR_MAX__ 0x7fffffff 2025-05-07T19:46:31.4120958Z #define _GLIBCXX_USE_CLOCK_MONOTONIC 1 2025-05-07T19:46:31.4121305Z #define __BLKCNT_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:31.4121726Z #define __isprint_l(c,l) __isctype_l((c), _ISprint, (l)) 2025-05-07T19:46:31.4122119Z #define cudaNvSciSyncAttrSignal 0x1 2025-05-07T19:46:31.4122473Z #define _GLIBCXX_USE_LONG_LONG 1 2025-05-07T19:46:31.4122829Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1 1 2025-05-07T19:46:31.4123190Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2 1 2025-05-07T19:46:31.4123571Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4 1 2025-05-07T19:46:31.4124021Z #define __DBL_DENORM_MIN__ double(4.94065645841246544176568792868221372e-324L) 2025-05-07T19:46:31.4147776Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8 1 2025-05-07T19:46:31.4148280Z #define ADJ_ESTERROR 0x0008 2025-05-07T19:46:31.4148661Z #define __GCC_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:46:31.4148996Z #define __GCC_IEC_559 2 2025-05-07T19:46:31.4149469Z #define __cpp_lib_transformation_trait_aliases 201304 2025-05-07T19:46:31.4149865Z #define _IO_flockfile(_fp) 2025-05-07T19:46:31.4150204Z #define CLOCK_MONOTONIC_RAW 4 2025-05-07T19:46:31.4150532Z #define __FLT32X_DECIMAL_DIG__ 17 2025-05-07T19:46:31.4150872Z #define _IOFBF 0 2025-05-07T19:46:31.4151119Z #define __USE_BSD 1 2025-05-07T19:46:31.4151412Z #define __FLT_EVAL_METHOD__ 0 2025-05-07T19:46:31.4151754Z #define SHRT_MIN (-SHRT_MAX - 1) 2025-05-07T19:46:31.4152063Z #define _IO_USER_LOCK 0x8000 2025-05-07T19:46:31.4152385Z #define _IO_NO_WRITES 8 2025-05-07T19:46:31.4152672Z #define _GLIBCXX_PSEUDO_VISIBILITY(V) 2025-05-07T19:46:31.4153219Z #define __ASMNAME2(prefix,cname) __STRING (prefix) cname 2025-05-07T19:46:31.4153623Z #define _GLIBCXX_HAVE_SYS_STAT_H 1 2025-05-07T19:46:31.4154001Z #define MB_CUR_MAX (__ctype_get_mb_cur_max ()) 2025-05-07T19:46:31.4154367Z #define __cpp_binary_literals 201304L 2025-05-07T19:46:31.4154737Z #define _CPP_TYPE_TRAITS_H 1 2025-05-07T19:46:31.4155051Z #define __BEGIN_NAMESPACE_C99 2025-05-07T19:46:31.4155411Z #define __FLT64_DECIMAL_DIG__ 17 2025-05-07T19:46:31.4155793Z #define _GLIBCXX_SYNCHRONIZATION_HAPPENS_AFTER(A) 2025-05-07T19:46:31.4156230Z #define _G_HAVE_ST_BLKSIZE defined (_STATBUF_ST_BLKSIZE) 2025-05-07T19:46:31.4156673Z #define __cpp_noexcept_function_type 201510L 2025-05-07T19:46:31.4157011Z #define M_PI 3.14159265358979323846 2025-05-07T19:46:31.4157386Z #define _GLIBCXX_PACKAGE_NAME "package-unused" 2025-05-07T19:46:31.4157753Z #define _GLIBCXX_HAVE_BUILTIN_IS_SAME 1 2025-05-07T19:46:31.4158128Z #define __GCC_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:46:31.4158477Z #define _POSIX_DELAYTIMER_MAX 32 2025-05-07T19:46:31.4158815Z #define _GLIBCXX_USE_UTIME 1 2025-05-07T19:46:31.4159139Z #define _STL_ITERATOR_BASE_FUNCS_H 1 2025-05-07T19:46:31.4159801Z #define _IO_peekc_unlocked(_fp) (_IO_BE ((_fp)->_IO_read_ptr >= (_fp)->_IO_read_end, 0) && __underflow (_fp) == EOF ? EOF : *(unsigned char *) (_fp)->_IO_read_ptr) 2025-05-07T19:46:31.4160597Z #define _GLIBCXX_TR1_ELL_INTEGRAL_TCC 1 2025-05-07T19:46:31.4160962Z #define w_termsig __wait_terminated.__w_termsig 2025-05-07T19:46:31.4161334Z #define __FLOAT_WORD_ORDER __BYTE_ORDER 2025-05-07T19:46:31.4161704Z #define __cudaCDP2GetErrorName 2025-05-07T19:46:31.4162038Z #define XATTR_SIZE_MAX 65536 2025-05-07T19:46:31.4162333Z #define be64toh(x) __bswap_64 (x) 2025-05-07T19:46:31.4162696Z #define __ASSERT_VOID_CAST static_cast 2025-05-07T19:46:31.4163059Z #define __cpp_variadic_templates 200704L 2025-05-07T19:46:31.4163419Z #define RAND_MAX 2147483647 2025-05-07T19:46:31.4163718Z #define _GLIBCXX_USE_C99_COMPLEX_TR1 1 2025-05-07T19:46:31.4164117Z #define __UINT_FAST64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:46:31.4164471Z #define __SM_90_RT_H__ 2025-05-07T19:46:31.4164774Z #define __SIG_ATOMIC_TYPE__ int 2025-05-07T19:46:31.4165096Z #define __COMPAR_FN_T 2025-05-07T19:46:31.4165372Z #define __GID_T_TYPE __U32_TYPE 2025-05-07T19:46:31.4165692Z #define _IO_BAD_SEEN 0x4000 2025-05-07T19:46:31.4166222Z #define _PSTL_PRAGMA_MESSAGE_IMPL(x) _PSTL_PRAGMA(message(_PSTL_STRING_CONCAT(_PSTL_PRAGMA_LOCATION, x))) 2025-05-07T19:46:31.4166835Z #define __DBL_MIN_10_EXP__ (-307) 2025-05-07T19:46:31.4167213Z #define __glibcxx_requires_sorted_pred(_First,_Last,_Pred) 2025-05-07T19:46:31.4167650Z #define __FINITE_MATH_ONLY__ 0 2025-05-07T19:46:31.4167979Z #define _PSTL_PRAGMA_SIMD_INCLUSIVE_SCAN(PRM) 2025-05-07T19:46:31.4168382Z #define cudaArrayColorAttachment 0x20 2025-05-07T19:46:31.4168764Z #define __cpp_variable_templates 201304L 2025-05-07T19:46:31.4169328Z #define cudaKernelNodeAttributeMemSyncDomainMap cudaLaunchAttributeMemSyncDomainMap 2025-05-07T19:46:31.4169966Z #define __cpp_lib_integral_constant_callable 201304 2025-05-07T19:46:31.4170341Z #define _GLIBCXX_HAVE_SINHF 1 2025-05-07T19:46:31.4170674Z #define MOD_TIMECONST ADJ_TIMECONST 2025-05-07T19:46:31.4171005Z #define __cpp_lib_result_of_sfinae 201210 2025-05-07T19:46:31.4171373Z #define __SM_30_INTRINSICS_H__ 2025-05-07T19:46:31.4171671Z #define __FLT32X_MAX_EXP__ 1024 2025-05-07T19:46:31.4171998Z #define _GLIBCXX_USE_WCHAR_T 1 2025-05-07T19:46:31.4172282Z #define _GLIBCXX_MATH_H 1 2025-05-07T19:46:31.4172579Z #define __u_char_defined 2025-05-07T19:46:31.4172948Z #define WIFEXITED(status) __WIFEXITED (__WAIT_INT (status)) 2025-05-07T19:46:31.4173347Z #define STA_PPSERROR 0x0800 2025-05-07T19:46:31.4173653Z #define _GLIBCXX_STD_A std 2025-05-07T19:46:31.4173931Z #define __FLT32_HAS_DENORM__ 1 2025-05-07T19:46:31.4174263Z #define _GLIBCXX_BEGIN_NAMESPACE_VERSION 2025-05-07T19:46:31.4174748Z #define __device_builtin_texture_type__ __location__(device_builtin_texture_type) 2025-05-07T19:46:31.4175318Z #define FP_INFINITE 1 2025-05-07T19:46:31.4175725Z #define _GLIBCXX11_DEPRECATED_SUGGEST(ALT) _GLIBCXX_DEPRECATED_SUGGEST(ALT) 2025-05-07T19:46:31.4176225Z #define _IO_pid_t __pid_t 2025-05-07T19:46:31.4176540Z #define __UINT_FAST8_MAX__ 0xff 2025-05-07T19:46:31.4176843Z #define __LEAF , __leaf__ 2025-05-07T19:46:31.4177139Z #define PATH_MAX 4096 2025-05-07T19:46:31.4177419Z #define __cpp_rvalue_reference 200610L 2025-05-07T19:46:31.4177812Z #define __LDBL_REDIR1(name,proto,alias) name proto 2025-05-07T19:46:31.4178171Z #define _LIMITS_H___ 2025-05-07T19:46:31.4178450Z #define __size_t 2025-05-07T19:46:31.4178698Z #define _GLIBCXX_HAVE_FREXPF 1 2025-05-07T19:46:31.4179333Z #define STA_RONLY (STA_PPSSIGNAL | STA_PPSJITTER | STA_PPSWANDER | STA_PPSERROR | STA_CLOCKERR | STA_NANO | STA_MODE | STA_CLK) 2025-05-07T19:46:31.4179998Z #define _GLIBCXX_HAVE_FREXPL 1 2025-05-07T19:46:31.4180338Z #define __cpp_nested_namespace_definitions 201411L 2025-05-07T19:46:31.4180733Z #define __DEC64_MAX_EXP__ 385 2025-05-07T19:46:31.4181023Z #define _WCHAR_T_DEFINED 2025-05-07T19:46:31.4181446Z #define __glibcxx_requires_can_decrement_range(_First1,_Last1,_First2) 2025-05-07T19:46:31.4181891Z #define MOD_STATUS ADJ_STATUS 2025-05-07T19:46:31.4182259Z #define _GLIBCXX_PURE __attribute__ ((__pure__)) 2025-05-07T19:46:31.4182685Z #define _GLIBCXX_HAVE_STDINT_H 1 2025-05-07T19:46:31.4183032Z #define __SIZEOF_PTHREAD_CONDATTR_T 4 2025-05-07T19:46:31.4183345Z #define __INT8_C(c) c 2025-05-07T19:46:31.4183656Z #define __cudaCDP2GetParameterBuffer 2025-05-07T19:46:31.4184020Z #define _GLIBCXX_HAVE_COSHF 1 2025-05-07T19:46:31.4184324Z #define _GLIBCXX_HAVE_COSHL 1 2025-05-07T19:46:31.4185365Z #define __SM_70_RT_HPP__ 2025-05-07T19:46:31.4185649Z #define __INT_LEAST8_WIDTH__ 8 2025-05-07T19:46:31.4185990Z #define __cpp_variadic_using 201611L 2025-05-07T19:46:31.4186359Z #define __UINT_LEAST64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:46:31.4186756Z #define __INT_LEAST8_MAX__ 0x7f 2025-05-07T19:46:31.4187065Z #define __SM_61_INTRINSICS_HPP__ 2025-05-07T19:46:31.4187405Z #define _IO_FLAGS2_MMAP 1 2025-05-07T19:46:31.4187702Z #define __cpp_capture_star_this 201603L 2025-05-07T19:46:31.4188082Z #define __cudaCDP2LaunchDeviceV2_ptsz 2025-05-07T19:46:31.4188452Z #define _GLIBCXX_HAVE_ENDIAN_H 1 2025-05-07T19:46:31.4188863Z #define __always_inline __inline __attribute__ ((__always_inline__)) 2025-05-07T19:46:31.4189409Z #define NFDBITS __NFDBITS 2025-05-07T19:46:31.4189714Z #define _PSTL_PRAGMA_FORCEINLINE 2025-05-07T19:46:31.4190066Z #define _GLIBCXX_HAVE_SYS_STATVFS_H 1 2025-05-07T19:46:31.4190428Z #define __glibcxx_requires_sorted(_First,_Last) 2025-05-07T19:46:31.4190824Z #define __SHRT_MAX__ 0x7fff 2025-05-07T19:46:31.4191119Z #define _GLIBCXX_SYMVER_GNU 1 2025-05-07T19:46:31.4191470Z #define w_stopval __wait_stopped.__w_stopval 2025-05-07T19:46:31.4191834Z #define STA_UNSYNC 0x0040 2025-05-07T19:46:31.4192173Z #define __LDBL_MAX__ 1.18973149535723176502126385303097021e+4932L 2025-05-07T19:46:31.4192662Z #define _GLIBCXX_USE_C99_COMPLEX _GLIBCXX11_USE_C99_COMPLEX 2025-05-07T19:46:31.4193060Z #define __FLT64X_MAX_10_EXP__ 4932 2025-05-07T19:46:31.4193398Z #define __cpp_if_constexpr 201606L 2025-05-07T19:46:31.4193745Z #define __glibcxx_class_requires4(_a,_b,_c,_d,_e) 2025-05-07T19:46:31.4194178Z #define cudaStreamFireAndForget ((cudaStream_t)0x4) 2025-05-07T19:46:31.4194552Z #define _GLIBCXX_HAVE_WCHAR_H 1 2025-05-07T19:46:31.4194928Z #define _GLIBCXX_USE_C99_STDIO _GLIBCXX11_USE_C99_STDIO 2025-05-07T19:46:31.4195325Z #define __daddr_t_defined 2025-05-07T19:46:31.4195606Z #define __LDBL_IS_IEC_60559__ 2 2025-05-07T19:46:31.4195932Z #define _GLIBCXX_TR1_RIEMANN_ZETA_TCC 1 2025-05-07T19:46:31.4196279Z #define _GLIBCXX_HAVE_STRUCT_DIRENT_D_TYPE 1 2025-05-07T19:46:31.4196875Z #define _PSTL_CPP11_STD_ROTATE_BROKEN ((__GLIBCXX__ && __GLIBCXX__ < 20150716) || (_MSC_VER && _MSC_VER < 1800)) 2025-05-07T19:46:31.4197421Z #define _ACRTIMP 2025-05-07T19:46:31.4197691Z #define _IO_EOF_SEEN 0x10 2025-05-07T19:46:31.4198117Z #define _GLIBCXX_TR1_POLY_LAGUERRE_TCC 1 2025-05-07T19:46:31.4198465Z #define _IOS_BIN 128 2025-05-07T19:46:31.4198885Z #define __fortify_function __extern_always_inline __attribute_artificial__ 2025-05-07T19:46:31.4199350Z #define __FLT64X_HAS_QUIET_NAN__ 1 2025-05-07T19:46:31.4199682Z #define UNDERFLOW 4 2025-05-07T19:46:31.4199930Z #define NAME_MAX 255 2025-05-07T19:46:31.4200230Z #define SCHAR_MAX __SCHAR_MAX__ 2025-05-07T19:46:31.4200529Z #define __UINT_LEAST8_MAX__ 0xff 2025-05-07T19:46:31.4200876Z #define __GCC_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:46:31.4201310Z #define _IO_UNIFIED_JUMPTABLES 1 2025-05-07T19:46:31.4201729Z #define __FLT128_DENORM_MIN__ 6.47517511943802511092443895822764655e-4966F128 2025-05-07T19:46:31.4202142Z #define __ptr_t void * 2025-05-07T19:46:31.4202423Z #define M_E 2.7182818284590452354 2025-05-07T19:46:31.4202754Z #define cudaSurfaceType1D 0x01 2025-05-07T19:46:31.4203043Z #define __USE_ISOCXX11 1 2025-05-07T19:46:31.4203352Z #define __UINTMAX_TYPE__ long unsigned int 2025-05-07T19:46:31.4203685Z #define cudaDeviceBlockingSync 0x04 2025-05-07T19:46:31.4204012Z #define CLOCK_MONOTONIC_COARSE 6 2025-05-07T19:46:31.4204298Z #define _GLIBCXX_OS_DEFINES 1 2025-05-07T19:46:31.4204626Z #define _GLIBCXX_NODISCARD [[__nodiscard__]] 2025-05-07T19:46:31.4205050Z #define cudaSurfaceType2D 0x02 2025-05-07T19:46:31.4205359Z #define __linux 1 2025-05-07T19:46:31.4205611Z #define __DEC32_EPSILON__ 1E-6DF 2025-05-07T19:46:31.4205936Z #define cudaDeviceMask 0xff 2025-05-07T19:46:31.4206248Z #define _GLIBCXX_END_NAMESPACE_ALGO 2025-05-07T19:46:31.4206559Z #define __CUDA_API_VER_MAJOR__ 12 2025-05-07T19:46:31.4206885Z #define htobe16(x) __bswap_16 (x) 2025-05-07T19:46:31.4207189Z #define HUGE_VALF (__builtin_huge_valf()) 2025-05-07T19:46:31.4207535Z #define __FLT_EVAL_METHOD_TS_18661_3__ 0 2025-05-07T19:46:31.4207848Z #define HUGE_VALL (__builtin_huge_vall()) 2025-05-07T19:46:31.4208183Z #define _BITS_TYPES_H 1 2025-05-07T19:46:31.4208476Z #define ULONG_LONG_MAX (LONG_LONG_MAX * 2ULL + 1ULL) 2025-05-07T19:46:31.4208856Z #define _IO_cleanup_region_end(_Doit) 2025-05-07T19:46:31.4209192Z #define cudaSurfaceType3D 0x03 2025-05-07T19:46:31.4209473Z #define _GLIBCXX_HAVE_SYS_TIME_H 1 2025-05-07T19:46:31.4209798Z #define __cudaGet_blockIdx() blockIdx 2025-05-07T19:46:31.4210095Z #define _IO_DONT_CLOSE 0100000 2025-05-07T19:46:31.4210940Z #define __MATHDECLX(type,function,suffix,args,attrib) __MATHDECL_1(type, function,suffix, args) __attribute__ (attrib); __MATHDECL_1(type, __CONCAT(__,function),suffix, args) __attribute__ (attrib) 2025-05-07T19:46:31.4211796Z #define cudaHostRegisterDefault 0x00 2025-05-07T19:46:31.4212114Z #define __unix 1 2025-05-07T19:46:31.4212339Z #define MATH_ERRNO 1 2025-05-07T19:46:31.4212617Z #define _GLIBCXX_STDIO_SEEK_END 2 2025-05-07T19:46:31.4212936Z #define _GLIBCXX_USE_FCHMODAT 1 2025-05-07T19:46:31.4213215Z #define __UINT32_MAX__ 0xffffffffU 2025-05-07T19:46:31.4213534Z #define __GXX_EXPERIMENTAL_CXX0X__ 1 2025-05-07T19:46:31.4213827Z #define __UID_T_TYPE __U32_TYPE 2025-05-07T19:46:31.4214156Z #define _GLIBCXX_HAVE_ATOMIC_LOCK_POLICY 1 2025-05-07T19:46:31.4214630Z #define __CUDART_API_VERSION ((__CUDA_API_VER_MAJOR__ * 1000) + (__CUDA_API_VER_MINOR__ * 10)) 2025-05-07T19:46:31.4215141Z #define __nv_pure__ __location__(nv_pure) 2025-05-07T19:46:31.4215452Z #define CUDARTAPI_CDECL 2025-05-07T19:46:31.4215741Z #define _PSTL_USAGE_WARNINGS 0 2025-05-07T19:46:31.4216048Z #define _GLIBCXX98_USE_C99_COMPLEX 1 2025-05-07T19:46:31.4216341Z #define __cpp_lib_void_t 201411 2025-05-07T19:46:31.4216635Z #define _POSIX_AIO_MAX 1 2025-05-07T19:46:31.4216877Z #define __SIZE_T 2025-05-07T19:46:31.4217162Z #define isgraph_l(c,l) __isgraph_l ((c), (l)) 2025-05-07T19:46:31.4217498Z #define _GLIBCXX_FULLY_DYNAMIC_STRING 0 2025-05-07T19:46:31.4217832Z #define _POSIX_PIPE_BUF 512 2025-05-07T19:46:31.4218100Z #define _GLIBCXX_HAVE_STRTOLD 1 2025-05-07T19:46:31.4218394Z #define _ATFILE_SOURCE 1 2025-05-07T19:46:31.4218793Z #define __glibcxx_assert(cond) do { __glibcxx_constexpr_assert(cond); } while (false) 2025-05-07T19:46:31.4219339Z #define __WAIT_STATUS void * 2025-05-07T19:46:31.4219612Z #define __MATH_FUNCTIONS_H__ 2025-05-07T19:46:31.4219916Z #define _GLIBCXX_HAVE_WCSTOF 1 2025-05-07T19:46:31.4220194Z #define __FLT128_MIN_EXP__ (-16381) 2025-05-07T19:46:31.4220528Z #define _GLIBCXX_HAVE_LC_MESSAGES 1 2025-05-07T19:46:31.4220845Z #define __WINT_MIN__ 0U 2025-05-07T19:46:31.4221456Z #define _PSTL_CPP14_VARIABLE_TEMPLATES_PRESENT (!__INTEL_COMPILER || __INTEL_COMPILER >= 1700) && (_MSC_FULL_VER >= 190023918 || __cplusplus >= 201402L) 2025-05-07T19:46:31.4222168Z #define isdigit_l(c,l) __isdigit_l ((c), (l)) 2025-05-07T19:46:31.4222485Z #define WUNTRACED 2 2025-05-07T19:46:31.4222770Z #define _GLIBCXX_HAVE_SQRTF 1 2025-05-07T19:46:31.4223059Z #define __SIZEOF_PTHREAD_RWLOCKATTR_T 8 2025-05-07T19:46:31.4223391Z #define NZERO 20 2025-05-07T19:46:31.4223631Z #define _GLIBCXX_HAVE_MEMALIGN 1 2025-05-07T19:46:31.4223949Z #define _PSTL_PRAGMA(x) _Pragma(#x) 2025-05-07T19:46:31.4224288Z #define MOD_CLKA ADJ_OFFSET_SINGLESHOT 2025-05-07T19:46:31.4224593Z #define MOD_CLKB ADJ_TICK 2025-05-07T19:46:31.4224881Z #define __FLT128_MIN_10_EXP__ (-4931) 2025-05-07T19:46:31.4225178Z #define __FLT32X_IS_IEC_60559__ 2 2025-05-07T19:46:31.4225493Z #define __DEVICE_FUNCTIONS_H__ 2025-05-07T19:46:31.4225835Z #define SCHAR_MIN (-SCHAR_MAX - 1) 2025-05-07T19:46:31.4226146Z #define EXIT_FAILURE 1 2025-05-07T19:46:31.4226397Z #define ADJ_MAXERROR 0x0004 2025-05-07T19:46:31.4226704Z #define __INT_LEAST16_WIDTH__ 16 2025-05-07T19:46:31.4226990Z #define _SIZE_T_DEFINED_ 2025-05-07T19:46:31.4227299Z #define _POSIX_AIO_LISTIO_MAX 2 2025-05-07T19:46:31.4227617Z #define __cudaCDP2DeviceGetLimit 2025-05-07T19:46:31.4227963Z #define __LDBL_REDIR_NTH(name,proto) name proto __THROW 2025-05-07T19:46:31.4228360Z #define __cudaCDP2FuncGetAttributes 2025-05-07T19:46:31.4228656Z #define __SCHAR_MAX__ 0x7f 2025-05-07T19:46:31.4228944Z #define __FLT128_MANT_DIG__ 113 2025-05-07T19:46:31.4229222Z #define __USING_NAMESPACE_STD(name) 2025-05-07T19:46:31.4229642Z #define _GLIBCXX_HAVE_OBSOLETE_ISINF 1 2025-05-07T19:46:31.4230142Z #define __WCHAR_MIN__ (-__WCHAR_MAX__ - 1) 2025-05-07T19:46:31.4230500Z #define SEEK_DATA 3 2025-05-07T19:46:31.4230870Z #define __KERNEL_STRICT_NAMES 2025-05-07T19:46:31.4231197Z #define _IO_stderr ((_IO_FILE*)(&_IO_2_1_stderr_)) 2025-05-07T19:46:31.4231706Z #define _IO_ferror_unlocked(__fp) (((__fp)->_flags & _IO_ERR_SEEN) != 0) 2025-05-07T19:46:31.4232145Z #define _FUNCTEXCEPT_H 1 2025-05-07T19:46:31.4232446Z #define __INT64_C(c) c ## L 2025-05-07T19:46:31.4232743Z #define __NTH(fct) __LEAF_ATTR fct throw () 2025-05-07T19:46:31.4233136Z #define _GLIBCXX_CONST __attribute__ ((__const__)) 2025-05-07T19:46:31.4233495Z #define _GLIBCXX_HAVE_LINK 1 2025-05-07T19:46:31.4233825Z #define cudaNvSciSyncAttrWait 0x2 2025-05-07T19:46:31.4234147Z #define __GCC_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:46:31.4234503Z #define STA_PPSWANDER 0x0400 2025-05-07T19:46:31.4234817Z #define __INT_WCHAR_T_H 2025-05-07T19:46:31.4235083Z #define WSTOPPED 2 2025-05-07T19:46:31.4235372Z #define _POSIX_THREAD_THREADS_MAX 64 2025-05-07T19:46:31.4235687Z #define _POSIX_MQ_OPEN_MAX 8 2025-05-07T19:46:31.4235990Z #define FP_NORMAL 4 2025-05-07T19:46:31.4236257Z #define __cudaCDP2LaunchDevice_ptsz 2025-05-07T19:46:31.4236599Z #define _BITS_TIMEX_H 1 2025-05-07T19:46:31.4236864Z #define _POSIX_LINK_MAX 8 2025-05-07T19:46:31.4237180Z #define _GLIBCXX_HAVE_LIMIT_FSIZE 1 2025-05-07T19:46:31.4237495Z #define _GLIBCXX_HAVE_ATAN2F 1 2025-05-07T19:46:31.4237820Z #define cudaTextureType1D 0x01 2025-05-07T19:46:31.4238145Z #define _GLIBCXX_HAVE_ATAN2L 1 2025-05-07T19:46:31.4238439Z #define COLL_WEIGHTS_MAX 255 2025-05-07T19:46:31.4238770Z #define __isascii(c) (((c) & ~0x7f) == 0) 2025-05-07T19:46:31.4239103Z #define __toascii(c) ((c) & 0x7f) 2025-05-07T19:46:31.4239608Z #define __attribute_format_strfmon__(a,b) __attribute__ ((__format__ (__strfmon__, a, b))) 2025-05-07T19:46:31.4240117Z #define _IO_MAGIC 0xFBAD0000 2025-05-07T19:46:31.4240443Z #define _GLIBCXX_USE_SENDFILE 1 2025-05-07T19:46:31.4244302Z #define _POSIX_SOURCE 1 2025-05-07T19:46:31.4244706Z #define cudaTextureType2D 0x02 2025-05-07T19:46:31.4244987Z #define _PTR_TRAITS_H 1 2025-05-07T19:46:31.4245305Z #define _GLIBCXX_NOEXCEPT_QUAL noexcept (_NE) 2025-05-07T19:46:31.4245680Z #define _GLIBCXX_HAVE_POWF 1 2025-05-07T19:46:31.4245968Z #define _POSIX2_BC_STRING_MAX 1000 2025-05-07T19:46:31.4246333Z #define __attribute_used__ __attribute__ ((__used__)) 2025-05-07T19:46:31.4246688Z #define cudaTextureType3D 0x03 2025-05-07T19:46:31.4246997Z #define _STDIO_USES_IOSTREAM 2025-05-07T19:46:31.4247275Z #define CLOCK_REALTIME 0 2025-05-07T19:46:31.4247566Z #define __FLT32X_MANT_DIG__ 53 2025-05-07T19:46:31.4247855Z #define __GCC_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:46:31.4248201Z #define __cpp_aligned_new 201606L 2025-05-07T19:46:31.4248520Z #define __USER_LABEL_PREFIX__ 2025-05-07T19:46:31.4248809Z #define cudaEventBlockingSync 0x01 2025-05-07T19:46:31.4249136Z #define _GLIBCXX_HAVE_TANL 1 2025-05-07T19:46:31.4249426Z #define _GLIBCXX_USE_PTHREAD_RWLOCK_T 1 2025-05-07T19:46:31.4249775Z #define _GLIBCXX_HAVE_LINUX_RANDOM_H 1 2025-05-07T19:46:31.4250081Z #define _GLIBCXX_USE_C99_FENV_TR1 1 2025-05-07T19:46:31.4250395Z #define __FLT32_MAX_10_EXP__ 38 2025-05-07T19:46:31.4250652Z #define __GLIBC__ 2 2025-05-07T19:46:31.4251001Z #define __END_DECLS } 2025-05-07T19:46:31.4251252Z #define FP_ILOGB0 (-2147483647 - 1) 2025-05-07T19:46:31.4251664Z #define __FLT64X_EPSILON__ 1.08420217248550443400745280086994171e-19F64x 2025-05-07T19:46:31.4252094Z #define __CONCAT(x,y) x ## y 2025-05-07T19:46:31.4252359Z #define WCONTINUED 8 2025-05-07T19:46:31.4252639Z #define __STDC_HOSTED__ 1 2025-05-07T19:46:31.4252910Z #define _GLIBCXX_HAVE_ARPA_INET_H 1 2025-05-07T19:46:31.4253217Z #define _ALLOCA_H 1 2025-05-07T19:46:31.4253457Z #define __host__ __location__(host) 2025-05-07T19:46:31.4253918Z #define __warndecl(name,msg) extern void name (void) __attribute__((__warning__ (msg))) 2025-05-07T19:46:31.4254377Z #define __SLONG32_TYPE int 2025-05-07T19:46:31.4254679Z #define _GLIBCXX_DEBUG_ASSERTIONS_H 1 2025-05-07T19:46:31.4254972Z #define _SYS_SELECT_H 1 2025-05-07T19:46:31.4255242Z #define _IO_LINE_BUF 0x200 2025-05-07T19:46:31.4255523Z #define _IOS_NOCREATE 32 2025-05-07T19:46:31.4255779Z #define __DEC64_MIN_EXP__ (-382) 2025-05-07T19:46:31.4256094Z #define __cudaGet_warpSize() warpSize 2025-05-07T19:46:31.4256401Z #define __SSIZE_T_TYPE __SWORD_TYPE 2025-05-07T19:46:31.4256722Z #define _GLIBCXX_HAVE_LIMIT_VMEM 0 2025-05-07T19:46:31.4257022Z #define __global__ __location__(global) 2025-05-07T19:46:31.4257346Z #define __GNU_LIBRARY__ 6 2025-05-07T19:46:31.4257613Z #define __cpp_decltype_auto 201304L 2025-05-07T19:46:31.4257928Z #define __DBL_DIG__ 15 2025-05-07T19:46:31.4258168Z #define TIME_UTC 1 2025-05-07T19:46:31.4258429Z #define __FLT32_DIG__ 6 2025-05-07T19:46:31.4258799Z #define __forceinline__ __inline__ __attribute__((always_inline)) 2025-05-07T19:46:31.4259219Z #define cudaHostAllocWriteCombined 0x04 2025-05-07T19:46:31.4259594Z #define cudaDeviceScheduleAuto 0x00 2025-05-07T19:46:31.4259917Z #define iscntrl_l(c,l) __iscntrl_l ((c), (l)) 2025-05-07T19:46:31.4260262Z #define _G_BUFSIZ 8192 2025-05-07T19:46:31.4260574Z #define __FLT_EPSILON__ 1.19209289550781250000000000000000000e-7F 2025-05-07T19:46:31.4260989Z #define cudaTextureTypeCubemap 0x0C 2025-05-07T19:46:31.4261297Z #define __cudaCDP2GetDevice 2025-05-07T19:46:31.4261622Z #define __cudaCDP2PeekAtLastError 2025-05-07T19:46:31.4261948Z #define STA_CLOCKERR 0x1000 2025-05-07T19:46:31.4262208Z #define __GXX_WEAK__ 1 2025-05-07T19:46:31.4262505Z #define __RLIM_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:31.4262823Z #define _GLIBCXX_HAVE_ISNANF 1 2025-05-07T19:46:31.4263123Z #define __SHRT_WIDTH__ 16 2025-05-07T19:46:31.4263607Z #define __cpp_lib_robust_nonmodifying_seq_ops 201304 2025-05-07T19:46:31.4264003Z #define _GLIBCXX_BITS_SPECFUN_H 1 2025-05-07T19:46:31.4264309Z #define _GLIBCXX_HAVE_ISNANL 1 2025-05-07T19:46:31.4264646Z #define isblank_l(c,l) __isblank_l ((c), (l)) 2025-05-07T19:46:31.4265045Z #define _G_config_h 1 2025-05-07T19:46:31.4265380Z #define M_LOG2El 1.442695040888963407359924681001892137L 2025-05-07T19:46:31.4265789Z #define ADJ_OFFSET_SINGLESHOT 0x8001 2025-05-07T19:46:31.4266096Z #define _GCC_WCHAR_T 2025-05-07T19:46:31.4266391Z #define TMP_MAX 238328 2025-05-07T19:46:31.4266656Z #define __FLT32_IS_IEC_60559__ 2 2025-05-07T19:46:31.4266973Z #define __DEVICE_TYPES_H__ 2025-05-07T19:46:31.4267254Z #define __DEV_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:31.4267573Z #define _EXT_NUMERIC_TRAITS 1 2025-05-07T19:46:31.4267874Z #define _GLIBCXX_BEGIN_NAMESPACE_ALGO 2025-05-07T19:46:31.4268203Z #define _IO_SKIPWS 01 2025-05-07T19:46:31.4268636Z #define cudaStreamGraphFireAndForgetAsSibling (cudaStream_t)0x0300000000000000 2025-05-07T19:46:31.4269163Z #define _IO_SCIENTIFIC 04000 2025-05-07T19:46:31.4269559Z #define _GLIBCXX_HAVE_STRING_H 1 2025-05-07T19:46:31.4270083Z #define __LDBL_MIN__ 3.36210314311209350626267781732175260e-4932L 2025-05-07T19:46:31.4270521Z #define cudaDeviceScheduleSpin 0x01 2025-05-07T19:46:31.4270933Z #define __nonnull(params) __attribute__ ((__nonnull__ params)) 2025-05-07T19:46:31.4271370Z #define __DBL_IS_IEC_60559__ 2 2025-05-07T19:46:31.4271650Z #define le32toh(x) (x) 2025-05-07T19:46:31.4271937Z #define _SIZE_T_DEFINED 2025-05-07T19:46:31.4272279Z #define _GLIBCXX_HAVE_XLOCALE_H 1 2025-05-07T19:46:31.4272676Z #define cudaArraySparsePropertiesSingleMipTail 0x1 2025-05-07T19:46:31.4273089Z #define __DEC32_MAX__ 9.999999E96DF 2025-05-07T19:46:31.4273528Z #define __WIFSIGNALED(status) (((signed char) (((status) & 0x7f) + 1) >> 1) > 0) 2025-05-07T19:46:31.4274022Z #define _GLIBCXX_HAVE_FMODL 1 2025-05-07T19:46:31.4274317Z #define _GLIBCXX_HAVE_POLL 1 2025-05-07T19:46:31.4274647Z #define __SM_32_INTRINSICS_H__ 2025-05-07T19:46:31.4274941Z #define _POSIX_NAME_MAX 14 2025-05-07T19:46:31.4275283Z #define __cpp_threadsafe_static_init 200806L 2025-05-07T19:46:31.4275879Z #define _GLIBCXX_MAKE_MOVE_IF_NOEXCEPT_ITERATOR(_Iter) std::__make_move_if_noexcept_iterator(_Iter) 2025-05-07T19:46:31.4276483Z #define _GLIBCXX_USE_CLOCK_REALTIME 1 2025-05-07T19:46:31.4276859Z #define __cpp_enumerator_attributes 201411L 2025-05-07T19:46:31.4277257Z #define __WCOREDUMP(status) ((status) & __WCOREFLAG) 2025-05-07T19:46:31.4277645Z #define _WCHAR_T_ 2025-05-07T19:46:31.4277941Z #define _GLIBCXX_FAST_MATH 0 2025-05-07T19:46:31.4278330Z #define __FLT64X_DENORM_MIN__ 3.64519953188247460252840593361941982e-4951F64x 2025-05-07T19:46:31.4278772Z #define RTSIG_MAX 32 2025-05-07T19:46:31.4279021Z #define _STDDEF_H 2025-05-07T19:46:31.4279289Z #define CU_UUID_HAS_BEEN_DEFINED 2025-05-07T19:46:31.4279596Z #define _VA_LIST_DEFINED 2025-05-07T19:46:31.4279884Z #define __FLT32X_HAS_INFINITY__ 1 2025-05-07T19:46:31.4280249Z #define __glibcxx_requires_non_empty_range(_First,_Last) 2025-05-07T19:46:31.4280691Z #define __grid_constant__ __location__(grid_constant) 2025-05-07T19:46:31.4281074Z #define __INT32_MAX__ 0x7fffffff 2025-05-07T19:46:31.4281382Z #define _GLIBCXX_BEGIN_EXTERN_C extern "C" { 2025-05-07T19:46:31.4281924Z #define _PSTL_CPP14_INTEGER_SEQUENCE_PRESENT (_MSC_VER >= 1900 || __cplusplus >= 201402L) 2025-05-07T19:46:31.4282592Z #define __glibcxx_digits_b(T,B) (B - __glibcxx_signed_b (T,B)) 2025-05-07T19:46:31.4283006Z #define __SIZEOF_PTHREAD_COND_T 48 2025-05-07T19:46:31.4283347Z #define _PSTL_PRAGMA_SIMD_ORDERED_MONOTONIC(PRM) 2025-05-07T19:46:31.4283700Z #define __unix__ 1 2025-05-07T19:46:31.4283941Z #define __SM_60_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:31.4284247Z #define __INT_WIDTH__ 32 2025-05-07T19:46:31.4284636Z #define __SIZEOF_LONG__ 8 2025-05-07T19:46:31.4285051Z #define _IONBF 2 2025-05-07T19:46:31.4285554Z #define __MATHCALLX(function,suffix,args,attrib) __MATHDECLX (_Mdouble_,function,suffix, args, attrib) 2025-05-07T19:46:31.4286408Z #define _IO_getc_unlocked(_fp) (_IO_BE ((_fp)->_IO_read_ptr >= (_fp)->_IO_read_end, 0) ? __uflow (_fp) : *(unsigned char *) (_fp)->_IO_read_ptr++) 2025-05-07T19:46:31.4287025Z #define __STDC_IEC_559__ 1 2025-05-07T19:46:31.4287395Z #define __STDC_ISO_10646__ 201103L 2025-05-07T19:46:31.4287694Z #define __UINT16_C(c) c 2025-05-07T19:46:31.4287951Z #define M_2_PI 0.63661977236758134308 2025-05-07T19:46:31.4288227Z #define STA_DEL 0x0020 2025-05-07T19:46:31.4288504Z #define __CUDACC_VER_MINOR__ 6 2025-05-07T19:46:31.4288767Z #define __id_t_defined 2025-05-07T19:46:31.4289077Z #define w_retcode __wait_terminated.__w_retcode 2025-05-07T19:46:31.4289551Z #define _IO_PENDING_OUTPUT_COUNT(_fp) ((_fp)->_IO_write_ptr - (_fp)->_IO_write_base) 2025-05-07T19:46:31.4290022Z #define _GLIBCXX_HAVE_MODFF 1 2025-05-07T19:46:31.4290300Z #define _GLIBCXX_HAVE_MODFL 1 2025-05-07T19:46:31.4290587Z #define __DECIMAL_DIG__ 21 2025-05-07T19:46:31.4290866Z #define _POSIX2_RE_DUP_MAX 255 2025-05-07T19:46:31.4291161Z #define __USE_FORTIFY_LEVEL 0 2025-05-07T19:46:31.4291463Z #define __STDC_IEC_559_COMPLEX__ 1 2025-05-07T19:46:31.4291746Z #define SING 2 2025-05-07T19:46:31.4291994Z #define STA_FREQHOLD 0x0080 2025-05-07T19:46:31.4292286Z #define __SM_32_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:46:31.4292610Z #define cudaStreamDefault 0x00 2025-05-07T19:46:31.4292970Z #define __FLT64_EPSILON__ 2.22044604925031308084726333618164062e-16F64 2025-05-07T19:46:31.4293395Z #define _GLIBCXX_HAVE_HYPOTL 1 2025-05-07T19:46:31.4293675Z #define _GLIBCXX_HAVE_SYS_UIO_H 1 2025-05-07T19:46:31.4294059Z #define __gnu_linux__ 1 2025-05-07T19:46:31.4294302Z #define __INT16_MAX__ 0x7fff 2025-05-07T19:46:31.4294582Z #define _LARGEFILE_SOURCE 1 2025-05-07T19:46:31.4294870Z #define MAX_INPUT 255 2025-05-07T19:46:31.4295124Z #define __FLT64_MIN_EXP__ (-1021) 2025-05-07T19:46:31.4295512Z #define __isalpha_l(c,l) __isctype_l((c), _ISalpha, (l)) 2025-05-07T19:46:31.4295924Z #define __glibcxx_requires_heap(_First,_Last) 2025-05-07T19:46:31.4296300Z #define _GLIBCXX_CPU_DEFINES 1 2025-05-07T19:46:31.4296582Z #define _GLIBCXX_HAVE_POLL_H 1 2025-05-07T19:46:31.4297043Z #define __attribute_warn_unused_result__ __attribute__ ((__warn_unused_result__)) 2025-05-07T19:46:31.4297503Z #define _IO_SHOWPOS 02000 2025-05-07T19:46:31.4297886Z #define _GLIBCXX_HAVE_SYMVER_SYMBOL_RENAMING_RUNTIME_SUPPORT 1 2025-05-07T19:46:31.4298397Z #define _Mfloat_ float 2025-05-07T19:46:31.4298752Z #define __glibcxx_requires_cond(_Cond,_Msg) 2025-05-07T19:46:31.4299075Z #define __FLT64X_MIN_10_EXP__ (-4931) 2025-05-07T19:46:31.4299364Z #define DELAYTIMER_MAX 2147483647 2025-05-07T19:46:31.4299864Z #define __glibcxx_max_b(T,B) (__glibcxx_signed_b (T,B) ? (((((T)1 << (__glibcxx_digits_b (T,B) - 1)) - 1) << 1) + 1) : ~(T)0) 2025-05-07T19:46:31.4300358Z #define __LDBL_HAS_QUIET_NAN__ 1 2025-05-07T19:46:31.4300633Z #define _GLIBCXX98_USE_C99_STDIO 1 2025-05-07T19:46:31.4300945Z #define cudaKernelNodeAttrID cudaLaunchAttributeID 2025-05-07T19:46:31.4301319Z #define __glibcxx_class_requires2(_a,_b,_c) 2025-05-07T19:46:31.4301632Z #define __USE_ISOC11 1 2025-05-07T19:46:31.4301855Z #define _BSD_SIZE_T_ 2025-05-07T19:46:31.4302098Z #define ADJ_MICRO 0x1000 2025-05-07T19:46:31.4302344Z #define _GLIBCXX_HAVE_FABSF 1 2025-05-07T19:46:31.4302609Z #define _GLIBCXX_HAVE_FABSL 1 2025-05-07T19:46:31.4302887Z #define _PSTL_PRAGMA_SIMD _PSTL_PRAGMA(omp simd) 2025-05-07T19:46:31.4303219Z #define __FLT64_MANT_DIG__ 53 2025-05-07T19:46:31.4303506Z #define __attribute_const__ __attribute__ ((__const__)) 2025-05-07T19:46:31.4303846Z #define __THROW throw () 2025-05-07T19:46:31.4304089Z #define __cudaGet_gridDim() gridDim 2025-05-07T19:46:31.4304381Z #define __SM_60_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:46:31.4304733Z #define __glibcxx_requires_heap_pred(_First,_Last,_Pred) 2025-05-07T19:46:31.4305071Z #define htobe32(x) __bswap_32 (x) 2025-05-07T19:46:31.4305361Z #define _GLIBCXX_HAVE_POWL 1 2025-05-07T19:46:31.4305626Z #define __FLT64X_MANT_DIG__ 64 2025-05-07T19:46:31.4305909Z #define __GLIBC_HAVE_LONG_LONG 1 2025-05-07T19:46:31.4306167Z #define L_tmpnam 20 2025-05-07T19:46:31.4306383Z #define ___int_wchar_t_h 2025-05-07T19:46:31.4306710Z #define WIFCONTINUED(status) __WIFCONTINUED (__WAIT_INT (status)) 2025-05-07T19:46:31.4307100Z #define isascii(c) __isascii (c) 2025-05-07T19:46:31.4307424Z #define _T_PTRDIFF 2025-05-07T19:46:31.4307728Z #define _GLIBCXX_MOVE3(_Tp,_Up,_Vp) std::move(_Tp, _Up, _Vp) 2025-05-07T19:46:31.4308112Z #define toascii(c) __toascii (c) 2025-05-07T19:46:31.4308370Z #define __GNUC__ 11 2025-05-07T19:46:31.4308652Z #define __SYSCALL_ULONG_TYPE __ULONGWORD_TYPE 2025-05-07T19:46:31.4308954Z #define __GXX_RTTI 1 2025-05-07T19:46:31.4309196Z #define __pie__ 2 2025-05-07T19:46:31.4309485Z #define __MMX__ 1 2025-05-07T19:46:31.4309900Z #define __cudaCDP2Malloc 2025-05-07T19:46:31.4310160Z #define __timespec_defined 1 2025-05-07T19:46:31.4310450Z #define L_ctermid 9 2025-05-07T19:46:31.4310688Z #define __OFF64_T_TYPE __SQUAD_TYPE 2025-05-07T19:46:31.4311047Z #define __cudaCDP2GetParameterBufferV2 2025-05-07T19:46:31.4311486Z #define offsetof(TYPE,MEMBER) __builtin_offsetof (TYPE, MEMBER) 2025-05-07T19:46:31.4311872Z #define _BITS_POSIX2_LIM_H 1 2025-05-07T19:46:31.4312179Z #define _GLIBCXX98_USE_C99_STDLIB 1 2025-05-07T19:46:31.4312503Z #define cudaMemAttachGlobal 0x01 2025-05-07T19:46:31.4312865Z #define FD_SET(fd,fdsetp) __FD_SET (fd, fdsetp) 2025-05-07T19:46:31.4313214Z #define __FLT_HAS_DENORM__ 1 2025-05-07T19:46:31.4313533Z #define __SIZEOF_LONG_DOUBLE__ 16 2025-05-07T19:46:31.4314004Z #define _GLIBCXX_NATIVE_THREAD_ID (__gthread_active_p() ? __gthread_self() : (__gthread_t)1) 2025-05-07T19:46:31.4314929Z #define assert_perror(errnum) (!(errnum) ? __ASSERT_VOID_CAST (0) : __assert_perror_fail ((errnum), __FILE__, __LINE__, __ASSERT_FUNCTION)) 2025-05-07T19:46:31.4315631Z #define _IO_HAVE_ST_BLKSIZE _G_HAVE_ST_BLKSIZE 2025-05-07T19:46:31.4315966Z #define __USE_SVID 1 2025-05-07T19:46:31.4316269Z #define __constant__ __location__(constant) 2025-05-07T19:46:31.4316614Z #define _GLIBCXX_HAVE_POSIX_MEMALIGN 1 2025-05-07T19:46:31.4316968Z #define __device__ __location__(device) 2025-05-07T19:46:31.4317317Z #define _GLIBCXX_HAVE_EXCEPTION_PTR_SINCE_GCC46 1 2025-05-07T19:46:31.4317712Z #define _GLIBCXX_RES_LIMITS 1 2025-05-07T19:46:31.4317993Z #define M_1_PI 0.31830988618379067154 2025-05-07T19:46:31.4318331Z #define CUDART_DEVICE __device__ 2025-05-07T19:46:31.4318735Z #define __LDBL_REDIR1_NTH(name,proto,alias) name proto __THROW 2025-05-07T19:46:31.4319136Z #define M_PI_2 1.57079632679489661923 2025-05-07T19:46:31.4319476Z #define __BIGGEST_ALIGNMENT__ 16 2025-05-07T19:46:31.4319852Z #define cudaExternalSemaphoreWaitSkipNvSciBufMemSync 0x02 2025-05-07T19:46:31.4320281Z #define __STDC_UTF_16__ 1 2025-05-07T19:46:31.4320540Z #define LONG_MAX __LONG_MAX__ 2025-05-07T19:46:31.4320961Z #define __glibcxx_digits10_b(T,B) (__glibcxx_digits_b (T,B) * 643L / 2136) 2025-05-07T19:46:31.4321422Z #define _POSIX_THREAD_DESTRUCTOR_ITERATIONS 4 2025-05-07T19:46:31.4321784Z #define _POSIX_HOST_NAME_MAX 255 2025-05-07T19:46:31.4322193Z #define __FLT64_MAX_10_EXP__ 308 2025-05-07T19:46:31.4322455Z #define NGROUPS_MAX 65536 2025-05-07T19:46:31.4322725Z #define _GLIBCXX_NAMESPACE_LDBL 2025-05-07T19:46:31.4322988Z #define __USE_ISOC95 1 2025-05-07T19:46:31.4323236Z #define _TIME_H 1 2025-05-07T19:46:31.4323500Z #define M_LOG10El 0.434294481903251827651128918916605082L 2025-05-07T19:46:31.4323859Z #define __USE_ISOC99 1 2025-05-07T19:46:31.4324195Z #define __ASMNAME(cname) __ASMNAME2 (__USER_LABEL_PREFIX__, cname) 2025-05-07T19:46:31.4324609Z #define HOST_NAME_MAX 64 2025-05-07T19:46:31.4324875Z #define _POSIX_SEM_NSEMS_MAX 256 2025-05-07T19:46:31.4325181Z #define _IOS_ATEND 4 2025-05-07T19:46:31.4325457Z #define __SM_35_INTRINSICS_H__ 2025-05-07T19:46:31.4325796Z #define WTERMSIG(status) __WTERMSIG (__WAIT_INT (status)) 2025-05-07T19:46:31.4326248Z #define cudaStreamAttrValue cudaLaunchAttributeValue 2025-05-07T19:46:31.4326609Z #define _GLIBCXX_HAVE_S_ISREG 1 2025-05-07T19:46:31.4326935Z #define cudaSurfaceTypeCubemap 0x0C 2025-05-07T19:46:31.4327270Z #define __cpp_delegating_constructors 200604L 2025-05-07T19:46:31.4327624Z #define __FLT32_HAS_INFINITY__ 1 2025-05-07T19:46:31.4327892Z #define _STDIO_H 1 2025-05-07T19:46:31.4328404Z #define __isctype_l(c,type,locale) ((locale)->__ctype_b[(int) (c)] & (unsigned short int) type) 2025-05-07T19:46:31.4328933Z #define _GLIBCXX_PREDEFINED_OPS_H 1 2025-05-07T19:46:31.4329311Z #define __DBL_MAX__ double(1.79769313486231570814527423731704357e+308L) 2025-05-07T19:46:31.4329744Z #define _G_IO_IO_FILE_VERSION 0x20001 2025-05-07T19:46:31.4330064Z #define _POSIX_SIGQUEUE_MAX 32 2025-05-07T19:46:31.4330381Z #define _GLIBCXX_HAVE_GETS 1 2025-05-07T19:46:31.4330676Z #define _GLIBCXX_HAVE_LINUX_TYPES_H 1 2025-05-07T19:46:31.4331018Z #define __cpp_raw_strings 200710L 2025-05-07T19:46:31.4331340Z #define __INT_FAST32_MAX__ 0x7fffffffffffffffL 2025-05-07T19:46:31.4331701Z #define _GLIBCXX_HAVE_VFWSCANF 1 2025-05-07T19:46:31.4332018Z #define __DBL_HAS_INFINITY__ 1 2025-05-07T19:46:31.4332311Z #define __STDCPP_MATH_SPEC_FUNCS__ 201003L 2025-05-07T19:46:31.4332654Z #define _GLIBCXX_STDIO_EOF -1 2025-05-07T19:46:31.4332934Z #define __SIZEOF_PTHREAD_MUTEX_T 40 2025-05-07T19:46:31.4333255Z #define __CHANNEL_DESCRIPTOR_H__ 2025-05-07T19:46:31.4333628Z #define _ISbit(bit) ((bit) < 8 ? ((1 << (bit)) << 8) : ((1 << (bit)) >> 8)) 2025-05-07T19:46:31.4334033Z #define __SIZEOF_FLOAT__ 4 2025-05-07T19:46:31.4334285Z #define __USE_XOPEN 1 2025-05-07T19:46:31.4334562Z #define __SIZEOF_PTHREAD_RWLOCK_T 56 2025-05-07T19:46:31.4335066Z #define cudaStreamAttributeMemSyncDomain cudaLaunchAttributeMemSyncDomain 2025-05-07T19:46:31.4335548Z #define __USE_XOPEN2K 1 2025-05-07T19:46:31.4335834Z #define _PSTL_UDR_PRESENT 1 2025-05-07T19:46:31.4336114Z #define __HAVE_SPECULATION_SAFE_VALUE 1 2025-05-07T19:46:31.4336452Z #define _GLIBCXX_HAVE_COSF 1 2025-05-07T19:46:31.4336733Z #define __cpp_fold_expressions 201603L 2025-05-07T19:46:31.4337290Z #define cudaWaitExternalSemaphoresAsync __CUDART_API_PTSZ(cudaWaitExternalSemaphoresAsync_v2) 2025-05-07T19:46:31.4337832Z #define NL_LANGMAX _POSIX2_LINE_MAX 2025-05-07T19:46:31.4338151Z #define __DEC32_MIN_EXP__ (-94) 2025-05-07T19:46:31.4338516Z #define __glibcxx_requires_partitioned_upper(_First,_Last,_Value) 2025-05-07T19:46:31.4338947Z #define __DADDR_T_TYPE __S32_TYPE 2025-05-07T19:46:31.4339355Z #define cudaExternalSemaphoreSignalSkipNvSciBufMemSync 0x01 2025-05-07T19:46:31.4339756Z #define __END_NAMESPACE_C99 2025-05-07T19:46:31.4340063Z #define __glibcxx_integral_traps true 2025-05-07T19:46:31.4340364Z #define _POSIX_PATH_MAX 256 2025-05-07T19:46:31.4340654Z #define __INTPTR_WIDTH__ 64 2025-05-07T19:46:31.4340921Z #define __FLT64X_HAS_INFINITY__ 1 2025-05-07T19:46:31.4341226Z #define _ISOC11_SOURCE 1 2025-05-07T19:46:31.4341487Z #define _GLIBCXX_HAVE_LINUX_FUTEX 1 2025-05-07T19:46:31.4341816Z #define __UINT_LEAST32_MAX__ 0xffffffffU 2025-05-07T19:46:31.4342136Z #define _GLIBCXX_HAVE_QUICK_EXIT 1 2025-05-07T19:46:31.4342542Z #define __glibcxx_requires_irreflexive_pred2(_First,_Last,_Pred) 2025-05-07T19:46:31.4342969Z #define LONG_MIN (-LONG_MAX - 1L) 2025-05-07T19:46:31.4343260Z #define _GLIBCXX_HAVE_SINCOSF 1 2025-05-07T19:46:31.4343582Z #define _IO_UNITBUF 020000 2025-05-07T19:46:31.4343852Z #define _GLIBCXX_HAVE_SINCOSL 1 2025-05-07T19:46:31.4344170Z #define __FD_SETSIZE 1024 2025-05-07T19:46:31.4344435Z #define getc(_fp) _IO_getc (_fp) 2025-05-07T19:46:31.4344745Z #define be32toh(x) __bswap_32 (x) 2025-05-07T19:46:31.4345100Z #define _GLIBCXX_PACKAGE__GLIBCXX_VERSION "version-unused" 2025-05-07T19:46:31.4345500Z #define __FLT32X_HAS_DENORM__ 1 2025-05-07T19:46:31.4345810Z #define __INT_FAST16_TYPE__ long int 2025-05-07T19:46:31.4346134Z #define isxdigit_l(c,l) __isxdigit_l ((c), (l)) 2025-05-07T19:46:31.4346499Z #define _GLIBCXX_HAVE_GETIPINFO 1 2025-05-07T19:46:31.4346788Z #define __MMX_WITH_SSE__ 1 2025-05-07T19:46:31.4347151Z #define __isalnum_l(c,l) __isctype_l((c), _ISalnum, (l)) 2025-05-07T19:46:31.4347507Z #define _WCHAR_T_DEFINED_ 2025-05-07T19:46:31.4347847Z #define cudaIpcMemLazyEnablePeerAccess 0x01 2025-05-07T19:46:31.4348192Z #define _GLIBCXX_HAVE_AT_QUICK_EXIT 1 2025-05-07T19:46:31.4348530Z #define __INO_T_MATCHES_INO64_T 1 2025-05-07T19:46:31.4348829Z #define __USE_POSIX199506 1 2025-05-07T19:46:31.4349133Z #define _FEATURES_H 1 2025-05-07T19:46:31.4349561Z #define __LDBL_HAS_DENORM__ 1 2025-05-07T19:46:31.4350163Z #define _PSTL_PRAGMA_SIMD_REDUCTION(PRM) _PSTL_PRAGMA(omp simd reduction(PRM)) 2025-05-07T19:46:31.4350677Z #define __stub_getmsg 2025-05-07T19:46:31.4350956Z #define _IO_FIXED 010000 2025-05-07T19:46:31.4351300Z #define __cpp_lib_addressof_constexpr 201603 2025-05-07T19:46:31.4351663Z #define _GLIBCXX11_USE_C99_STDIO 1 2025-05-07T19:46:31.4352015Z #define __stub_setlogin 2025-05-07T19:46:31.4352297Z #define __stub_fattach 2025-05-07T19:46:31.4352608Z #define __cplusplus 201703L 2025-05-07T19:46:31.4352918Z #define __cpp_ref_qualifiers 200710L 2025-05-07T19:46:31.4353263Z #define _STRUCT_TIMEVAL 1 2025-05-07T19:46:31.4353592Z #define INFINITY (__builtin_inff()) 2025-05-07T19:46:31.4353913Z #define _IO_UNBUFFERED 2 2025-05-07T19:46:31.4354494Z #define cudaStreamAttributeSynchronizationPolicy cudaLaunchAttributeSynchronizationPolicy 2025-05-07T19:46:31.4355094Z #define _IO_INTERNAL 010 2025-05-07T19:46:31.4355395Z #define __DEC32_MIN__ 1E-95DF 2025-05-07T19:46:31.4355764Z #define cudaKernelNodeAttrValue cudaLaunchAttributeValue 2025-05-07T19:46:31.4356182Z #define __dev_t_defined 2025-05-07T19:46:31.4356459Z #define __DEPRECATED 1 2025-05-07T19:46:31.4356748Z #define __S32_TYPE int 2025-05-07T19:46:31.4357118Z #define __cpp_rvalue_references 200610L 2025-05-07T19:46:31.4357448Z #define __DBL_MAX_EXP__ 1024 2025-05-07T19:46:31.4357771Z #define _IO_fpos_t _G_fpos_t 2025-05-07T19:46:31.4358056Z #define __WCHAR_WIDTH__ 32 2025-05-07T19:46:31.4358745Z #define cudaKernelNodeAttributePreferredSharedMemoryCarveout cudaLaunchAttributePreferredSharedMemoryCarveout 2025-05-07T19:46:31.4359450Z #define _G_HAVE_MREMAP 1 2025-05-07T19:46:31.4359822Z #define __FLT32_MAX__ 3.40282346638528859811704183484516925e+38F32 2025-05-07T19:46:31.4360213Z #define OVERFLOW 3 2025-05-07T19:46:31.4360522Z #define __toascii_l(c,l) ((l), __toascii (c)) 2025-05-07T19:46:31.4360900Z #define __DEC128_EPSILON__ 1E-33DL 2025-05-07T19:46:31.4361226Z #define __SM_32_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:31.4361642Z #define _GLIBCXX_DEFAULT_ABI_TAG _GLIBCXX_ABI_TAG_CXX11 2025-05-07T19:46:31.4362126Z #define __SSE2_MATH__ 1 2025-05-07T19:46:31.4362431Z #define __ATOMIC_HLE_RELEASE 131072 2025-05-07T19:46:31.4362762Z #define __FSFILCNT_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:31.4363110Z #define _IO_STDIO_H 2025-05-07T19:46:31.4363374Z #define PDP_ENDIAN __PDP_ENDIAN 2025-05-07T19:46:31.4363718Z #define isspace_l(c,l) __isspace_l ((c), (l)) 2025-05-07T19:46:31.4364057Z #define __cudaCDP2Memcpy2DAsync 2025-05-07T19:46:31.4364399Z #define __PTRDIFF_MAX__ 0x7fffffffffffffffL 2025-05-07T19:46:31.4364764Z #define _GLIBCXX_HAVE_STRERROR_R 1 2025-05-07T19:46:31.4365051Z #define __amd64 1 2025-05-07T19:46:31.4365323Z #define _POSIX_TZNAME_MAX 6 2025-05-07T19:46:31.4365609Z #define __cudaCDP2Memset3DAsync 2025-05-07T19:46:31.4365929Z #define __SYSCALL_WORDSIZE 64 2025-05-07T19:46:31.4366234Z #define _GLIBCXX_HAVE_ATTRIBUTE_VISIBILITY 1 2025-05-07T19:46:31.4366597Z #define _EXT_TYPE_TRAITS 1 2025-05-07T19:46:31.4366879Z #define _GLIBCXX_HAVE_POSIX_SEMAPHORE 1 2025-05-07T19:46:31.4367227Z #define _POSIX_RE_DUP_MAX 255 2025-05-07T19:46:31.4367502Z #define __STDC_NO_THREADS__ 1 2025-05-07T19:46:31.4367796Z #define __bounded 2025-05-07T19:46:31.4368073Z #define __USECONDS_T_TYPE __U32_TYPE 2025-05-07T19:46:31.4368372Z #define _IO_DELETE_DONT_CLOSE 0x40 2025-05-07T19:46:31.4368702Z #define __BEGIN_NAMESPACE_STD 2025-05-07T19:46:31.4368979Z #define _PTRDIFF_T_DECLARED 2025-05-07T19:46:31.4369290Z #define __OFF_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:31.4369618Z #define __W_STOPCODE(sig) ((sig) << 8 | 0x7f) 2025-05-07T19:46:31.4370046Z #define cudaStreamAttributePriority cudaLaunchAttributePriority 2025-05-07T19:46:31.4370457Z #define _GLIBCXX_HAVE_NETDB_H 1 2025-05-07T19:46:31.4370746Z #define __SM_20_INTRINSICS_HPP__ 2025-05-07T19:46:31.4371124Z #define __cpp_lib_has_unique_object_representations 201606 2025-05-07T19:46:31.4371483Z #define STA_PLL 0x0001 2025-05-07T19:46:31.4371826Z #define __ATOMIC_HLE_ACQUIRE 65536 2025-05-07T19:46:31.4372109Z #define __GNUG__ 11 2025-05-07T19:46:31.4372387Z #define _GLIBCXX_USE_GET_NPROCS 1 2025-05-07T19:46:31.4372664Z #define _T_WCHAR 2025-05-07T19:46:31.4372942Z #define __cudaCDP2GetDeviceCount 2025-05-07T19:46:31.4373253Z #define __specialization_static 2025-05-07T19:46:31.4373594Z #define __LONG_LONG_MAX__ 0x7fffffffffffffffLL 2025-05-07T19:46:31.4373918Z #define __SIZEOF_SIZE_T__ 8 2025-05-07T19:46:31.4374210Z #define cudaArraySparse 0x40 2025-05-07T19:46:31.4374516Z #define STA_PPSFREQ 0x0002 2025-05-07T19:46:31.4374777Z #define __GLIBCXX__ 20230528 2025-05-07T19:46:31.4375098Z #define _IO_stdin ((_IO_FILE*)(&_IO_2_1_stdin_)) 2025-05-07T19:46:31.4375414Z #define _WCHAR_T 2025-05-07T19:46:31.4375671Z #define __cudaCDP2Free 2025-05-07T19:46:31.4376352Z #define __FD_ZERO(fdsp) do { int __d0, __d1; __asm__ __volatile__ ("cld; rep; " __FD_ZERO_STOS : "=c" (__d0), "=D" (__d1) : "a" (0), "0" (sizeof (fd_set) / sizeof (__fd_mask)), "1" (&__FDS_BITS (fdsp)[0]) : "memory"); } while (0) 2025-05-07T19:46:31.4377133Z #define __cpp_nsdmi 200809L 2025-05-07T19:46:31.4377568Z #define __glibcxx_min_b(T,B) (__glibcxx_signed_b (T,B) ? -__glibcxx_max_b (T,B) - 1 : (T)0) 2025-05-07T19:46:31.4378132Z #define __FLT64X_MIN_EXP__ (-16381) 2025-05-07T19:46:31.4378459Z #define __SIZEOF_WINT_T__ 4 2025-05-07T19:46:31.4378741Z #define cudaArrayCubemap 0x04 2025-05-07T19:46:31.4379117Z #define _PSTL_MONOTONIC_PRESENT (__INTEL_COMPILER >= 1800) 2025-05-07T19:46:31.4379490Z #define _GLIBCXX_UTILITY 1 2025-05-07T19:46:31.4379790Z #define __NO_CTYPE 1 2025-05-07T19:46:31.4380042Z #define __stub_bdflush 2025-05-07T19:46:31.4380443Z #define _GLIBCXX_MAKE_MOVE_ITERATOR(_Iter) std::make_move_iterator(_Iter) 2025-05-07T19:46:31.4380893Z #define __CORRECT_ISO_CPP_STRING_H_PROTO 2025-05-07T19:46:31.4381242Z #define _GLIBCXX_STDC_HEADERS 1 2025-05-07T19:46:31.4381563Z #define __LONG_LONG_WIDTH__ 64 2025-05-07T19:46:31.4381853Z #define __cpp_initializer_lists 200806L 2025-05-07T19:46:31.4382207Z #define _GLIBCXX_HAVE_NETINET_TCP_H 1 2025-05-07T19:46:31.4382521Z #define __U16_TYPE unsigned short int 2025-05-07T19:46:31.4382904Z #define __glibcxx_requires_can_increment(_First,_Size) 2025-05-07T19:46:31.4383274Z #define _GLIBCXX_HAVE_SYS_PARAM_H 1 2025-05-07T19:46:31.4383616Z #define __FLT32_MAX_EXP__ 128 2025-05-07T19:46:31.4383918Z #define cudaHostRegisterIoMemory 0x04 2025-05-07T19:46:31.4384309Z #define __FD_MASK(d) ((__fd_mask) 1 << ((d) % __NFDBITS)) 2025-05-07T19:46:31.4385013Z #define __cpp_lib_is_invocable 201703 2025-05-07T19:46:31.4385345Z #define _IO_STDIO 040000 2025-05-07T19:46:31.4385803Z #define _SIGSET_NWORDS (1024 / (8 * sizeof (unsigned long int))) 2025-05-07T19:46:31.4386249Z #define cudaSurfaceType1DLayered 0xF1 2025-05-07T19:46:31.4386639Z #define cudaArraySurfaceLoadStore 0x02 2025-05-07T19:46:31.4386975Z #define _PTRDIFF_T 2025-05-07T19:46:31.4387266Z #define _MOVE_H 1 2025-05-07T19:46:31.4387536Z #define __cpp_hex_float 201603L 2025-05-07T19:46:31.4387878Z #define ADJ_TAI 0x0080 2025-05-07T19:46:31.4388152Z #define __ptrvalue 2025-05-07T19:46:31.4388455Z #define _GLIBCXX_HOSTED 1 2025-05-07T19:46:31.4388772Z #define __GXX_ABI_VERSION 1016 2025-05-07T19:46:31.4389098Z #define __WTERMSIG(status) ((status) & 0x7f) 2025-05-07T19:46:31.4389548Z #define MATH_ERREXCEPT 2 2025-05-07T19:46:31.4389830Z #define _GLIBCXX_HAS_GTHREADS 1 2025-05-07T19:46:31.4390179Z #define cudaTextureType2DLayered 0xF2 2025-05-07T19:46:31.4390623Z #define __isleap(year) ((year) % 4 == 0 && ((year) % 100 != 0 || (year) % 400 == 0)) 2025-05-07T19:46:31.4391089Z #define __USE_GNU 1 2025-05-07T19:46:31.4391358Z #define __FLT128_HAS_INFINITY__ 1 2025-05-07T19:46:31.4391701Z #define __FLT_MIN_EXP__ (-125) 2025-05-07T19:46:31.4391999Z #define __GCC_HAVE_DWARF2_CFI_ASM 1 2025-05-07T19:46:31.4392449Z #define __FD_CLR(d,set) ((void) (__FDS_BITS (set)[__FD_ELT (d)] &= ~__FD_MASK (d))) 2025-05-07T19:46:31.4392909Z #define WEXITED 4 2025-05-07T19:46:31.4393152Z #define _IO_NO_READS 4 2025-05-07T19:46:31.4393618Z #define cudaGraphKernelNodePortLaunchCompletion 2 2025-05-07T19:46:31.4393995Z #define M_LOG2E 1.4426950408889634074 2025-05-07T19:46:31.4394336Z #define _POSIX_SYMLINK_MAX 255 2025-05-07T19:46:31.4394670Z #define _GLIBCXX_HAVE_BUILTIN_HAS_UNIQ_OBJ_REP 1 2025-05-07T19:46:31.4395056Z #define __uid_t_defined 2025-05-07T19:46:31.4395336Z #define __FD_ELT(d) ((d) / __NFDBITS) 2025-05-07T19:46:31.4395684Z #define _GLIBCXX_USE_STD_SPEC_FUNCS 1 2025-05-07T19:46:31.4396016Z #define WNOHANG 1 2025-05-07T19:46:31.4396298Z #define alloca(size) __builtin_alloca (size) 2025-05-07T19:46:31.4396672Z #define _GLIBCXX_HAVE_HYPOTF 1 2025-05-07T19:46:31.4396976Z #define cudaEventDefault 0x00 2025-05-07T19:46:31.4397339Z #define __maxnreg__(a) __attribute__((maxnreg(a))) 2025-05-07T19:46:31.4397700Z #define NL_SETMAX INT_MAX 2025-05-07T19:46:31.4397996Z #define __x86_64 1 2025-05-07T19:46:31.4398264Z #define __cudaCDP2LaunchDevice 2025-05-07T19:46:31.4398738Z #define __REDIRECT(name,proto,alias) name proto __asm__ (__ASMNAME (#alias)) 2025-05-07T19:46:31.4399272Z #define _GLIBCXX_BEGIN_NAMESPACE_CXX11 namespace __cxx11 { 2025-05-07T19:46:31.4399861Z #define __extern_always_inline extern __always_inline __attribute__ ((__gnu_inline__)) 2025-05-07T19:46:31.4400469Z #define __PTRDIFF_T 2025-05-07T19:46:31.4400838Z #define __exctype_l(name) extern int name (int, __locale_t) __THROW 2025-05-07T19:46:31.4401410Z #define _GLIBCXX_HAVE_FINITEL 1 2025-05-07T19:46:31.4401722Z #define __SM_35_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:31.4402077Z #define _Mlong_double_ long double 2025-05-07T19:46:31.4402397Z #define __cpp_lambdas 200907L 2025-05-07T19:46:31.4402716Z #define _IO_DEC 020 2025-05-07T19:46:31.4402962Z #define _GLIBCXX_HAVE_SINHL 1 2025-05-07T19:46:31.4403284Z #define _POSIX_CLOCKRES_MIN 20000000 2025-05-07T19:46:31.4403626Z #define __INT_FAST64_TYPE__ long int 2025-05-07T19:46:31.4403935Z #define ADJ_TIMECONST 0x0020 2025-05-07T19:46:31.4404246Z #define _GLIBCXX_HAVE_SQRTL 1 2025-05-07T19:46:31.4404570Z #define __cudaCDP2DeviceGetSharedMemConfig 2025-05-07T19:46:31.4404947Z #define _GLIBCXX_HAVE_STDALIGN_H 1 2025-05-07T19:46:31.4405244Z #define _ANSI_STDDEF_H 2025-05-07T19:46:31.4405565Z #define _GLIBCXX_MOVE(__val) std::move(__val) 2025-05-07T19:46:31.4405913Z #define _GLIBCXX_HAVE_STRERROR_L 1 2025-05-07T19:46:31.4406329Z #define __FLT64_DENORM_MIN__ 4.94065645841246544176568792868221372e-324F64 2025-05-07T19:46:31.4406755Z #define _GLIBCXX_USE_DEV_RANDOM 1 2025-05-07T19:46:31.4407085Z #define _STL_ITERATOR_BASE_TYPES_H 1 2025-05-07T19:46:31.4407408Z #define __cpp_template_auto 201606L 2025-05-07T19:46:31.4407798Z #define __DBL_MIN__ double(2.22507385850720138309023271733240406e-308L) 2025-05-07T19:46:31.4408234Z #define _GLIBCXX_HAVE_SYS_SEM_H 1 2025-05-07T19:46:31.4408525Z #define __key_t_defined 2025-05-07T19:46:31.4408821Z #define _IO_MAGIC_MASK 0xFFFF0000 2025-05-07T19:46:31.4409219Z #define __cluster_dims__(...) __attribute__((cluster_dims(__VA_ARGS__))) 2025-05-07T19:46:31.4409757Z #define __FLT128_EPSILON__ 1.92592994438723585305597794258492732e-34F128 2025-05-07T19:46:31.4410161Z #define __GNUC_VA_LIST 2025-05-07T19:46:31.4410553Z #define __FLT64X_NORM_MAX__ 1.18973149535723176502126385303097021e+4932F64x 2025-05-07T19:46:31.4410998Z #define __SIZEOF_POINTER__ 8 2025-05-07T19:46:31.4411288Z #define CLOCK_REALTIME_COARSE 5 2025-05-07T19:46:31.4411622Z #define _GLIBCXX14_CONSTEXPR constexpr 2025-05-07T19:46:31.4411945Z #define __USE_XOPEN2KXSI 1 2025-05-07T19:46:31.4412248Z #define __WCOREFLAG 0x80 2025-05-07T19:46:31.4412525Z #define M_2_SQRTPI 1.12837916709551257390 2025-05-07T19:46:31.4412885Z #define cudaEventDisableTiming 0x02 2025-05-07T19:46:31.4413191Z #define __LP64__ 1 2025-05-07T19:46:31.4413489Z #define __isascii_l(c,l) ((l), __isascii (c)) 2025-05-07T19:46:31.4413865Z #define cudaStreamNonBlocking 0x01 2025-05-07T19:46:31.4414175Z #define _IO_off64_t __off64_t 2025-05-07T19:46:31.4414488Z #define __DBL_HAS_QUIET_NAN__ 1 2025-05-07T19:46:31.4414777Z #define __time_t_defined 1 2025-05-07T19:46:31.4415197Z #define _POSIX_SYMLOOP_MAX 8 2025-05-07T19:46:31.4415587Z #define __FLT32X_EPSILON__ 2.22044604925031308084726333618164062e-16F32x 2025-05-07T19:46:31.4416030Z #define __USE_UNIX98 1 2025-05-07T19:46:31.4416299Z #define __MODE_T_TYPE __U32_TYPE 2025-05-07T19:46:31.4416641Z #define CLOCK_REALTIME_ALARM 8 2025-05-07T19:46:31.4416943Z #define _GLIBCXX_HAVE_STRINGS_H 1 2025-05-07T19:46:31.4417303Z #define __LEAF_ATTR __attribute__ ((__leaf__)) 2025-05-07T19:46:31.4417686Z #define __DECIMAL_BID_FORMAT__ 1 2025-05-07T19:46:31.4417975Z #define SEEK_CUR 1 2025-05-07T19:46:31.4418269Z #define __RLIM64_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:31.4418668Z #define _ASSERT_H 1 2025-05-07T19:46:31.4419312Z #define _PSTL_PRAGMA_DECLARE_REDUCTION(NAME,OP) _PSTL_PRAGMA(omp declare reduction(NAME:OP : omp_out(omp_in)) initializer(omp_priv = omp_orig)) 2025-05-07T19:46:31.4419988Z #define _GLIBCXX_USE_DEPRECATED 1 2025-05-07T19:46:31.4420319Z #define CHAR_MAX SCHAR_MAX 2025-05-07T19:46:31.4420599Z #define _GLIBCXX_HAVE_SETENV 1 2025-05-07T19:46:31.4420925Z #define NL_ARGMAX _POSIX_ARG_MAX 2025-05-07T19:46:31.4421254Z #define _GLIBCXX_USE_UTIMENSAT 1 2025-05-07T19:46:31.4421655Z #define __extern_inline extern __inline __attribute__ ((__gnu_inline__)) 2025-05-07T19:46:31.4422188Z #define _GLIBCXX_DEBUG_ONLY(_Statement) 2025-05-07T19:46:31.4422897Z #define _IO_putc_unlocked(_ch,_fp) (_IO_BE ((_fp)->_IO_write_ptr >= (_fp)->_IO_write_end, 0) ? __overflow (_fp, (unsigned char) (_ch)) : (unsigned char) (*(_fp)->_IO_write_ptr++ = (_ch))) 2025-05-07T19:46:31.4423639Z #define _GLIBCXX_HAVE_BUILTIN_LAUNDER 1 2025-05-07T19:46:31.4423964Z #define _IO_BOOLALPHA 0200000 2025-05-07T19:46:31.4424376Z #define _PSTL_CPP17_EXECUTION_POLICIES_PRESENT (_MSC_VER >= 1912) 2025-05-07T19:46:31.4424812Z #define _GLIBCXX_PACKAGE_URL "" 2025-05-07T19:46:31.4425112Z #define __FLT64_MIN_10_EXP__ (-307) 2025-05-07T19:46:31.4425453Z #define cudaArrayDefault 0x00 2025-05-07T19:46:31.4425754Z #define __cudaCDP2LaunchDeviceV2 2025-05-07T19:46:31.4426095Z #define __FDS_BITS(set) ((set)->fds_bits) 2025-05-07T19:46:31.4426396Z #define TLOSS 5 2025-05-07T19:46:31.4426664Z #define __ssize_t_defined 2025-05-07T19:46:31.4426933Z #define __CUDACC_VER_BUILD__ 85 2025-05-07T19:46:31.4427245Z #define _GLIBCXX_HAVE_SYS_SOCKET_H 1 2025-05-07T19:46:31.4427555Z #define ULONG_MAX (LONG_MAX * 2UL + 1UL) 2025-05-07T19:46:31.4427899Z #define __FLT64X_DECIMAL_DIG__ 21 2025-05-07T19:46:31.4428312Z #define _GLIBCXX_NAMESPACE_LDBL_OR_CXX11 _GLIBCXX_NAMESPACE_CXX11 2025-05-07T19:46:31.4428721Z #define _POSIX_HIWAT _POSIX_PIPE_BUF 2025-05-07T19:46:31.4429042Z #define __DEC128_MIN__ 1E-6143DL 2025-05-07T19:46:31.4429411Z #define __cudaCDP2EventRecordWithFlags 2025-05-07T19:46:31.4429943Z #define _GLIBCXX_ATOMIC_BUILTINS 1 2025-05-07T19:46:31.4430268Z #define cudaPeerAccessDefault 0x00 2025-05-07T19:46:31.4430615Z #define __REGISTER_PREFIX__ 2025-05-07T19:46:31.4430926Z #define __UINT16_MAX__ 0xffff 2025-05-07T19:46:31.4431330Z #define __glibcxx_requires_sorted_set(_First1,_Last1,_First2) 2025-05-07T19:46:31.4431740Z #define _IOS_NOREPLACE 64 2025-05-07T19:46:31.4432040Z #define __cdecl 2025-05-07T19:46:31.4432345Z #define cudaEventInterprocess 0x04 2025-05-07T19:46:31.4432708Z #define M_SQRT1_2l 0.707106781186547524400844362104849039L 2025-05-07T19:46:31.4433114Z #define LOGIN_NAME_MAX 256 2025-05-07T19:46:31.4433406Z #define _IO_TIED_PUT_GET 0x400 2025-05-07T19:46:31.4433745Z #define X_TLOSS 1.41484755040568800000e+16 2025-05-07T19:46:31.4434078Z #define CUDA_IPC_HANDLE_SIZE 64 2025-05-07T19:46:31.4434412Z #define __LDBL_HAS_INFINITY__ 1 2025-05-07T19:46:31.4434758Z #define __attribute_pure__ __attribute__ ((__pure__)) 2025-05-07T19:46:31.4435157Z #define __TEXTURE_TYPES_H__ 2025-05-07T19:46:31.4435645Z #define __NV_GLIBCXX_VERSION (__GNUC__ * 10000 + __GNUC_MINOR__ * 100 + __GNUC_PATCHLEVEL__) 2025-05-07T19:46:31.4436143Z #define ADJ_NANO 0x2000 2025-05-07T19:46:31.4436519Z #define __FLT32_MIN__ 1.17549435082228750796873653722224568e-38F32 2025-05-07T19:46:31.4437001Z #define __UINT8_TYPE__ unsigned char 2025-05-07T19:46:31.4437372Z #define _GLIBCXX_HAVE_ISWBLANK 1 2025-05-07T19:46:31.4437681Z #define __FLT_DIG__ 6 2025-05-07T19:46:31.4438116Z #define __REDIRECT_LDBL(name,proto,alias) __REDIRECT (name, proto, alias) 2025-05-07T19:46:31.4438583Z #define __NO_INLINE__ 1 2025-05-07T19:46:31.4438967Z #define _PSTL_EARLYEXIT_PRESENT (__INTEL_COMPILER >= 1800) 2025-05-07T19:46:31.4439409Z #define _POSIX_NGROUPS_MAX 8 2025-05-07T19:46:31.4439715Z #define ADJ_STATUS 0x0010 2025-05-07T19:46:31.4440039Z #define __cudaCDP2MemcpyAsync_ptsz 2025-05-07T19:46:31.4440369Z #define CLOCK_BOOTTIME_ALARM 9 2025-05-07T19:46:31.4440714Z #define LONG_LONG_MAX __LONG_LONG_MAX__ 2025-05-07T19:46:31.4441048Z #define _GLIBCXX_HAVE_OBSOLETE_ISNAN 1 2025-05-07T19:46:31.4441405Z #define __DEC_EVAL_METHOD__ 2 2025-05-07T19:46:31.4441826Z #define cudaStreamGraphFireAndForget (cudaStream_t)0x0200000000000000 2025-05-07T19:46:31.4442401Z #define _GLIBCXX_HAVE_ALIGNED_ALLOC 1 2025-05-07T19:46:31.4442768Z #define __DEC128_MAX__ 9.999999999999999999999999999999999E6144DL 2025-05-07T19:46:31.4443154Z #define CHAR_MIN SCHAR_MIN 2025-05-07T19:46:31.4443434Z #define MAX_CANON 255 2025-05-07T19:46:31.4443678Z #define __FLT_MANT_DIG__ 24 2025-05-07T19:46:31.4443973Z #define __LDBL_DECIMAL_DIG__ 21 2025-05-07T19:46:31.4444313Z #define _GLIBCXX_HAVE_COMPLEX_H 1 2025-05-07T19:46:31.4444636Z #define _PSTL_PRAGMA_VECTOR_UNALIGNED 2025-05-07T19:46:31.4444962Z #define _POSIX_FD_SETSIZE _POSIX_OPEN_MAX 2025-05-07T19:46:31.4445314Z #define _GLIBCXX_HAVE_HYPOT 1 2025-05-07T19:46:31.4445614Z #define __cudaCDP2Memset2DAsync_ptsz 2025-05-07T19:46:31.4445986Z #define _GLIBCXX_TR1_MODIFIED_BESSEL_FUNC_TCC 1 2025-05-07T19:46:31.4446349Z #define __VERSION__ "11.4.0" 2025-05-07T19:46:31.4446626Z #define _GLIBCXX11_USE_C99_STDLIB 1 2025-05-07T19:46:31.4446961Z #define cudaHostRegisterMapped 0x02 2025-05-07T19:46:31.4447265Z #define _GLIBCXX_HAVE_INT64_T 1 2025-05-07T19:46:31.4447587Z #define _GLIBCXX_USE_CONSTEXPR constexpr 2025-05-07T19:46:31.4447918Z #define FD_ZERO(fdsetp) __FD_ZERO (fdsetp) 2025-05-07T19:46:31.4448258Z #define __UINT64_C(c) c ## UL 2025-05-07T19:46:31.4448531Z #define MOD_OFFSET ADJ_OFFSET 2025-05-07T19:46:31.4448827Z #define _SYS_TYPES_H 1 2025-05-07T19:46:31.4449082Z #define AIO_PRIO_DELTA_MAX 20 2025-05-07T19:46:31.4449386Z #define _GLIBCXX_HAVE_TANHF 1 2025-05-07T19:46:31.4449679Z #define _SYS_CDEFS_H 1 2025-05-07T19:46:31.4449932Z #define _GLIBCXX_HAVE_TANHL 1 2025-05-07T19:46:31.4450252Z #define __cpp_unicode_characters 201411L 2025-05-07T19:46:31.4450564Z #define _IO_ERR_SEEN 0x20 2025-05-07T19:46:31.4450862Z #define _GLIBCXX_USE_DECIMAL_FLOAT 1 2025-05-07T19:46:31.4451179Z #define __cudaCDP2StreamDestroy 2025-05-07T19:46:31.4451498Z #define FP_SUBNORMAL 3 2025-05-07T19:46:31.4451773Z #define cudaOccupancyDefault 0x00 2025-05-07T19:46:31.4452103Z #define _INITIALIZER_LIST 2025-05-07T19:46:31.4452372Z #define _STDC_PREDEF_H 1 2025-05-07T19:46:31.4452668Z #define __CUDA_RUNTIME_API_H__ 2025-05-07T19:46:31.4452996Z #define _GLIBCXX_PACKAGE_BUGREPORT "" 2025-05-07T19:46:31.4453300Z #define _GLIBCXX_HAVE_MODF 1 2025-05-07T19:46:31.4453601Z #define _IO_file_flags _flags 2025-05-07T19:46:31.4453875Z #define __USE_XOPEN2K8 1 2025-05-07T19:46:31.4454168Z #define htobe64(x) __bswap_64 (x) 2025-05-07T19:46:31.4454469Z #define _OLD_STDIO_MAGIC 0xFABC0000 2025-05-07T19:46:31.4454789Z #define HUGE 3.40282347e+38F 2025-05-07T19:46:31.4455073Z #define __cpp_lib_is_null_pointer 201309 2025-05-07T19:46:31.4455501Z #define WEXITSTATUS(status) __WEXITSTATUS (__WAIT_INT (status)) 2025-05-07T19:46:31.4455923Z #define islower_l(c,l) __islower_l ((c), (l)) 2025-05-07T19:46:31.4456290Z #define _GLIBCXX_USE_CXX11_ABI 1 2025-05-07T19:46:31.4456621Z #define _GLIBCXX_HAVE_SYMLINK 1 2025-05-07T19:46:31.4456899Z #define _BSD_SOURCE 1 2025-05-07T19:46:31.4457192Z #define _GLIBCXX_THROW(_EXC) 2025-05-07T19:46:31.4458169Z #define _GLIBCXX_HAS_NESTED_TYPE(_NTYPE) template> struct __has_ ##_NTYPE : false_type { }; template struct __has_ ##_NTYPE<_Tp, __void_t> : true_type { }; 2025-05-07T19:46:31.4459133Z #define __catch(X) catch(X) 2025-05-07T19:46:31.4459420Z #define __INT_LEAST32_MAX__ 0x7fffffff 2025-05-07T19:46:31.4459772Z #define LINE_MAX _POSIX2_LINE_MAX 2025-05-07T19:46:31.4460114Z #define __TIMER_T_TYPE void * 2025-05-07T19:46:31.4460392Z #define __STRING(x) #x 2025-05-07T19:46:31.4460686Z #define __GCC_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:46:31.4460981Z #define _T_PTRDIFF_ 2025-05-07T19:46:31.4461283Z #define _GLIBCXX_USE_NOEXCEPT noexcept 2025-05-07T19:46:31.4461608Z #define cudaEventWaitExternal 0x01 2025-05-07T19:46:31.4461938Z #define __unbounded 2025-05-07T19:46:31.4462192Z #define __DEVICE_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:31.4462531Z #define __FLT128_MAX_EXP__ 16384 2025-05-07T19:46:31.4462825Z #define __INO_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:31.4463169Z #define be16toh(x) __bswap_16 (x) 2025-05-07T19:46:31.4463498Z #define __cpp_lib_is_final 201402L 2025-05-07T19:46:31.4463810Z #define _GLIBCXX_BEGIN_NAMESPACE_CONTAINER 2025-05-07T19:46:31.4464177Z #define LONG_LONG_MIN (-LONG_LONG_MAX - 1LL) 2025-05-07T19:46:31.4464499Z #define __MATH_DECLARE_LDOUBLE 1 2025-05-07T19:46:31.4464817Z #define __managed__ __location__(managed) 2025-05-07T19:46:31.4465190Z #define _POSIX2_EXPR_NEST_MAX 32 2025-05-07T19:46:31.4465628Z #define __GNUC_PREREQ(maj,min) ((__GNUC__ << 16) + __GNUC_MINOR__ >= ((maj) << 16) + (min)) 2025-05-07T19:46:31.4466074Z #define _POSIX_STREAM_MAX 8 2025-05-07T19:46:31.4466377Z #define __LIBRARY_TYPES_H__ 2025-05-07T19:46:31.4466799Z #define _GLIBCXX_END_NAMESPACE_LDBL_OR_CXX11 _GLIBCXX_END_NAMESPACE_CXX11 2025-05-07T19:46:31.4467224Z #define __FLT32_MANT_DIG__ 24 2025-05-07T19:46:31.4467523Z #define _SYS_SIZE_T_H 2025-05-07T19:46:31.4467828Z #define _PSTL_VERSION_MINOR ((_PSTL_VERSION % 1000) / 10) 2025-05-07T19:46:31.4468215Z #define _GLIBCXX_STDLIB_H 1 2025-05-07T19:46:31.4468510Z #define isupper_l(c,l) __isupper_l ((c), (l)) 2025-05-07T19:46:31.4468864Z #define _CRTIMP 2025-05-07T19:46:31.4469112Z #define _GLIBCXX_CXX_CONFIG_H 1 2025-05-07T19:46:31.4469541Z #define __FLOAT_WORD_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:46:31.4470070Z #define STA_PPSJITTER 0x0200 2025-05-07T19:46:31.4493149Z #define _IO_feof_unlocked(__fp) (((__fp)->_flags & _IO_EOF_SEEN) != 0) 2025-05-07T19:46:31.4493738Z #define __SUSECONDS_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:31.4494123Z #define _GLIBCXX_HAVE_ISINFF 1 2025-05-07T19:46:31.4494467Z #define __glibcxx_requires_subscript(_N) 2025-05-07T19:46:31.4494803Z #define __SIZE_T__ 2025-05-07T19:46:31.4495075Z #define __stub_gtty 2025-05-07T19:46:31.4495337Z #define __pid_t_defined 2025-05-07T19:46:31.4495646Z #define _GLIBCXX_FWDREF(_Tp) _Tp&& 2025-05-07T19:46:31.4495973Z #define __NLINK_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:31.4496351Z #define __glibcxx_function_requires(...) 2025-05-07T19:46:31.4496705Z #define __SM_80_RT_HPP__ 2025-05-07T19:46:31.4496984Z #define __need_clockid_t 2025-05-07T19:46:31.4497293Z #define SSIZE_MAX LONG_MAX 2025-05-07T19:46:31.4497593Z #define _GLIBCXX_HAVE_USELOCALE 1 2025-05-07T19:46:31.4497973Z #define __glibcxx_requires_string_len(_String,_Len) 2025-05-07T19:46:31.4498333Z #define _IO_HEX 0100 2025-05-07T19:46:31.4498641Z #define __NFDBITS (8 * (int) sizeof (__fd_mask)) 2025-05-07T19:46:31.4499023Z #define cudaExternalMemoryDedicated 0x1 2025-05-07T19:46:31.4499386Z #define _GLIBCXX_HAVE_TGMATH_H 1 2025-05-07T19:46:31.4499693Z #define _GLIBCXX11_USE_C99_COMPLEX 1 2025-05-07T19:46:31.4500276Z #define _GLIBCXX17_DEPRECATED_SUGGEST(ALT) _GLIBCXX_DEPRECATED_SUGGEST(ALT) 2025-05-07T19:46:31.4500784Z #define ispunct_l(c,l) __ispunct_l ((c), (l)) 2025-05-07T19:46:31.4501129Z #define __cpp_aggregate_bases 201603L 2025-05-07T19:46:31.4501478Z #define __cudaGet_blockDim() blockDim 2025-05-07T19:46:31.4501807Z #define __cudaCDP2Memcpy3DAsync 2025-05-07T19:46:31.4501935Z #define __cudaCDP2MemcpyAsync 2025-05-07T19:46:31.4502025Z #define __stub_sstk 2025-05-07T19:46:31.4502298Z #define _IO_IN_BACKUP 0x100 2025-05-07T19:46:31.4502472Z #define _GLIBCXX_USE_C99_STDLIB _GLIBCXX11_USE_C99_STDLIB 2025-05-07T19:46:31.4502578Z #define __wur 2025-05-07T19:46:31.4502709Z #define isprint_l(c,l) __isprint_l ((c), (l)) 2025-05-07T19:46:31.4502817Z #define _G_HAVE_MMAP 1 2025-05-07T19:46:31.4502937Z #define _IO_OCT 040 2025-05-07T19:46:31.4503043Z #define __FLT128_HAS_DENORM__ 1 2025-05-07T19:46:31.4503144Z #define NL_MSGMAX INT_MAX 2025-05-07T19:46:31.4503250Z #define _GLIBCXX_USE_LFS 1 2025-05-07T19:46:31.4503401Z #define cudaDeviceScheduleBlockingSync 0x04 2025-05-07T19:46:31.4503505Z #define _POSIX_RTSIG_MAX 8 2025-05-07T19:46:31.4503622Z #define _GLIBCXX_NOEXCEPT noexcept 2025-05-07T19:46:31.4503860Z #define __glibcxx_requires_partitioned_lower(_First,_Last,_Value) 2025-05-07T19:46:31.4503966Z #define __FLT32_DECIMAL_DIG__ 9 2025-05-07T19:46:31.4504067Z #define _STL_ALGOBASE_H 1 2025-05-07T19:46:31.4504185Z #define __cudaCDP2MemsetAsync_ptsz 2025-05-07T19:46:31.4504303Z #define __off64_t_defined 2025-05-07T19:46:31.4504418Z #define _GLIBCXX_WEAK_DEFINITION 2025-05-07T19:46:31.4504517Z #define __FLT128_DIG__ 33 2025-05-07T19:46:31.4504637Z #define _GLIBCXX_USE_C99_INTTYPES_TR1 1 2025-05-07T19:46:31.4504741Z #define _GLIBCXX_HAVE_LOCALE_H 1 2025-05-07T19:46:31.4504976Z #define __INT32_C(c) c 2025-05-07T19:46:31.4505085Z #define __DEC64_EPSILON__ 1E-15DD 2025-05-07T19:46:31.4505194Z #define __ORDER_PDP_ENDIAN__ 3412 2025-05-07T19:46:31.4505328Z #define __DEC128_MIN_EXP__ (-6142) 2025-05-07T19:46:31.4505428Z #define __PDP_ENDIAN 3412 2025-05-07T19:46:31.4505534Z #define _ISOC95_SOURCE 1 2025-05-07T19:46:31.4505660Z #define _IO_fpos64_t _G_fpos64_t 2025-05-07T19:46:31.4505803Z #define M_PI_2l 1.570796326794896619231321691639751442L 2025-05-07T19:46:31.4505911Z #define BYTE_ORDER __BYTE_ORDER 2025-05-07T19:46:31.4506015Z #define __SM_90_RT_HPP__ 2025-05-07T19:46:31.4506142Z #define __INT_FAST32_TYPE__ long int 2025-05-07T19:46:31.4506247Z #define __have_pthread_attr_t 1 2025-05-07T19:46:31.4506363Z #define _GLIBCXX_HAVE_LIMIT_DATA 1 2025-05-07T19:46:31.4506624Z #define _GLIBCXX_BEGIN_NAMESPACE_LDBL_OR_CXX11 _GLIBCXX_BEGIN_NAMESPACE_CXX11 2025-05-07T19:46:31.4506749Z #define __cudaCDP2StreamWaitEvent 2025-05-07T19:46:31.4506862Z #define __cudaCDP2EventRecord 2025-05-07T19:46:31.4506971Z #define _BITS_TYPESIZES_H 1 2025-05-07T19:46:31.4507073Z #define htole32(x) (x) 2025-05-07T19:46:31.4507338Z #define __cudaCDP2OccupancyMaxActiveBlocksPerMultiprocessorWithFlags 2025-05-07T19:46:31.4507477Z #define __SYSCALL_SLONG_TYPE __SLONGWORD_TYPE 2025-05-07T19:46:31.4507617Z #define _GLIBCXX_USE_C99_MATH_TR1 1 2025-05-07T19:46:31.4507786Z #define WSTOPSIG(status) __WSTOPSIG (__WAIT_INT (status)) 2025-05-07T19:46:31.4507940Z #define _GLIBCXX_USE_C99_MATH _GLIBCXX11_USE_C99_MATH 2025-05-07T19:46:31.4508077Z #define __UINT_LEAST16_TYPE__ short unsigned int 2025-05-07T19:46:31.4508248Z #define __WIFEXITED(status) (__WTERMSIG(status) == 0) 2025-05-07T19:46:31.4508347Z #define ADJ_OFFSET 0x0001 2025-05-07T19:46:31.4508461Z #define cudaArrayLayered 0x01 2025-05-07T19:46:31.4508665Z #define _PSTL_ICC_18_OMP_SIMD_BROKEN (__INTEL_COMPILER == 1800) 2025-05-07T19:46:31.4508782Z #define cudaEventRecordDefault 0x00 2025-05-07T19:46:31.4508885Z #define _GLIBCXX_HAVE_FMODF 1 2025-05-07T19:46:31.4509019Z #define _PSTL_PRAGMA_MESSAGE(x) 2025-05-07T19:46:31.4509110Z #define unix 1 2025-05-07T19:46:31.4509211Z #define __DBL_HAS_DENORM__ 1 2025-05-07T19:46:31.4509401Z #define _POSIX_CHILD_MAX 25 2025-05-07T19:46:31.4509537Z #define _POSIX_MAX_INPUT 255 2025-05-07T19:46:31.4509675Z #define __cudaCDP2DeviceGetCacheConfig 2025-05-07T19:46:31.4509936Z #define __USE_POSIX 1 2025-05-07T19:46:31.4510075Z #define __FD_ZERO_STOS "stosq" 2025-05-07T19:46:31.4510218Z #define _PSTL_VERSION_MAJOR (_PSTL_VERSION / 1000) 2025-05-07T19:46:31.4510322Z #define __THROWNL throw () 2025-05-07T19:46:31.4510432Z #define __cpp_rtti 199711L 2025-05-07T19:46:31.4510582Z #define __SIZE_TYPE__ long unsigned int 2025-05-07T19:46:31.4510706Z #define __PMT(args) args 2025-05-07T19:46:31.4510897Z #define __UINT64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:46:31.4511092Z #define __va_arg_pack_len() __builtin_va_arg_pack_len () 2025-05-07T19:46:31.4511220Z #define __ULONGWORD_TYPE unsigned long int 2025-05-07T19:46:31.4511327Z #define _SIZE_T_DECLARED 2025-05-07T19:46:31.4511440Z #define _PSTL_STRING_AUX(x) #x 2025-05-07T19:46:31.4511579Z #define __FLT_IS_IEC_60559__ 2 2025-05-07T19:46:31.4512029Z #define _PSTL_CPP14_MAKE_REVERSE_ITERATOR_PRESENT (_MSC_VER >= 1900 || __cplusplus >= 201402L || __cpp_lib_make_reverse_iterator == 201402) 2025-05-07T19:46:31.4512143Z #define _GLIBCXX_HAVE_LIMIT_AS 1 2025-05-07T19:46:31.4512282Z #define XATTR_LIST_MAX 65536 2025-05-07T19:46:31.4512390Z #define __CUDACC_VER_MAJOR__ 12 2025-05-07T19:46:31.4512551Z #define __GNUC_WIDE_EXECUTION_CHARSET_NAME "UTF-32LE" 2025-05-07T19:46:31.4512645Z #define _WCHAR_T_H 2025-05-07T19:46:31.4512776Z #define __FLT64X_DIG__ 18 2025-05-07T19:46:31.4512870Z #define _IO_SHOWBASE 0200 2025-05-07T19:46:31.4512970Z #define _POSIX_QLIMIT 1 2025-05-07T19:46:31.4513105Z #define __INT8_TYPE__ signed char 2025-05-07T19:46:31.4513216Z #define __SURFACE_TYPES_H__ 2025-05-07T19:46:31.4513317Z #define __CUDA_ARCH__ 520 2025-05-07T19:46:31.4513435Z #define __cpp_digit_separators 201309L 2025-05-07T19:46:31.4514336Z #define __ELF__ 1 2025-05-07T19:46:31.4514449Z #define CLOCK_THREAD_CPUTIME_ID 3 2025-05-07T19:46:31.4514561Z #define __GCC_ASM_FLAG_OUTPUTS__ 1 2025-05-07T19:46:31.4514695Z #define STA_INS 0x0010 2025-05-07T19:46:31.4514810Z #define __UINT32_TYPE__ unsigned int 2025-05-07T19:46:31.4515003Z #define _toupper(c) ((int) (*__ctype_toupper_loc ())[(int) (c)]) 2025-05-07T19:46:31.4515101Z #define _BITS_BYTESWAP_H 1 2025-05-07T19:46:31.4515239Z #define __ID_T_TYPE __U32_TYPE 2025-05-07T19:46:31.4515363Z #define __TIME_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:31.4515483Z #define __DEVICE_DOUBLE_FUNCTIONS_HPP__ 2025-05-07T19:46:31.4515614Z #define _GLIBCXX_HAVE_MBSTATE_T 1 2025-05-07T19:46:31.4515744Z #define __cpp_lib_logical_traits 201510 2025-05-07T19:46:31.4515849Z #define ADJ_OFFSET_SS_READ 0xa001 2025-05-07T19:46:31.4516044Z #define __warnattr(msg) __attribute__((__warning__ (msg))) 2025-05-07T19:46:31.4516215Z #define _PSTL_PRAGMA_LOCATION " [Parallel STL message]: " 2025-05-07T19:46:31.4516328Z #define _IO_funlockfile(_fp) 2025-05-07T19:46:31.4516677Z #define cudaKernelNodeAttributeAccessPolicyWindow cudaLaunchAttributeAccessPolicyWindow 2025-05-07T19:46:31.4516835Z #define M_2_PIl 0.636619772367581343075535053490057448L 2025-05-07T19:46:31.4516940Z #define __DRIVER_TYPES_H__ 2025-05-07T19:46:31.4517036Z #define __FLT_RADIX__ 2 2025-05-07T19:46:31.4517170Z #define __INT_LEAST16_TYPE__ short int 2025-05-07T19:46:31.4517355Z #define __LDBL_EPSILON__ 1.08420217248550443400745280086994171e-19L 2025-05-07T19:46:31.4517469Z #define __UINTMAX_C(c) c ## UL 2025-05-07T19:46:31.4517565Z #define _GLIBCXX_USE_LSTAT 1 2025-05-07T19:46:31.4517699Z #define minor(dev) gnu_dev_minor (dev) 2025-05-07T19:46:31.4517813Z #define _POSIX_C_SOURCE 200809L 2025-05-07T19:46:31.4517918Z #define _GLIBCXX_HAVE_DIRENT_H 1 2025-05-07T19:46:31.4518063Z #define __GLIBCXX_BITSIZE_INT_N_0 128 2025-05-07T19:46:31.4518162Z #define WORD_BIT 32 2025-05-07T19:46:31.4518261Z #define _IO_USER_BUF 1 2025-05-07T19:46:31.4518366Z #define __VECTOR_TYPES_H__ 2025-05-07T19:46:31.4518506Z #define __SM_20_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:46:31.4518628Z #define cudaHostAllocPortable 0x01 2025-05-07T19:46:31.4518740Z #define PTHREAD_STACK_MIN 16384 2025-05-07T19:46:31.4518870Z #define __long_double_t long double 2025-05-07T19:46:31.4518976Z #define _GLIBCXX_HAVE_ISINF 1 2025-05-07T19:46:31.4519080Z #define _POSIX_ARG_MAX 4096 2025-05-07T19:46:31.4519519Z #define cudaKernelNodeAttributeDeviceUpdatableKernelNode cudaLaunchAttributeDeviceUpdatableKernelNode 2025-05-07T19:46:31.4519641Z #define __k8 1 2025-05-07T19:46:31.4519860Z #define _GLIBCXX_NO_OBSOLETE_ISINF_ISNAN_DYNAMIC __GLIBC_PREREQ(2,23) 2025-05-07T19:46:31.4520106Z #define __FLT32X_MIN__ 2.22507385850720138309023271733240406e-308F32x 2025-05-07T19:46:31.4520253Z #define __LDBL_REDIR(name,proto) name proto 2025-05-07T19:46:31.4520366Z #define __SIG_ATOMIC_MAX__ 0x7fffffff 2025-05-07T19:46:31.4520478Z #define __SM_30_INTRINSICS_HPP__ 2025-05-07T19:46:31.4520612Z #define _GLIBCXX_EXTERN_TEMPLATE 1 2025-05-07T19:46:31.4520724Z #define __blksize_t_defined 2025-05-07T19:46:31.4520827Z #define _IO_SHOWPOINT 0400 2025-05-07T19:46:31.4520940Z #define _GLIBCXX_HAVE_LIMIT_RSS 1 2025-05-07T19:46:31.4521090Z #define cudaDeviceLmemResizeToMax 0x10 2025-05-07T19:46:31.4521200Z #define _GLIBCXX_X86_RDRAND 1 2025-05-07T19:46:31.4521324Z #define __GCC_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:46:31.4521460Z #define _IO_IS_FILEBUF 0x2000 2025-05-07T19:46:31.4521565Z #define _GLIBCXX_USE_DUAL_ABI 1 2025-05-07T19:46:31.4521961Z #define __bswap_constant_16(x) ((unsigned short int) ((((x) >> 8) & 0xff) | (((x) & 0xff) << 8))) 2025-05-07T19:46:31.4522427Z #define cudaSignalExternalSemaphoresAsync __CUDART_API_PTSZ(cudaSignalExternalSemaphoresAsync_v2) 2025-05-07T19:46:31.4522550Z #define UCHAR_MAX (SCHAR_MAX * 2 + 1) 2025-05-07T19:46:31.4522653Z #define __SIZEOF_PTRDIFF_T__ 8 2025-05-07T19:46:31.4522748Z #define SEEK_SET 0 2025-05-07T19:46:31.4522874Z #define _GLIBCXX_TR1_GAMMA_TCC 1 2025-05-07T19:46:31.4523023Z #define __CUDA_API_VER_MINOR__ 6 2025-05-07T19:46:31.4523218Z #define _GLIBCXX_VISIBILITY(V) __attribute__ ((__visibility__ (#V))) 2025-05-07T19:46:31.4523360Z #define _GLIBCXX20_DEPRECATED(MSG) 2025-05-07T19:46:31.4523467Z #define __cudaCDP2GetLastError 2025-05-07T19:46:31.4523569Z #define _GLIBCXX_HAVE_COSL 1 2025-05-07T19:46:31.4523667Z #define _MATH_H_MATHDEF 1 2025-05-07T19:46:31.4524032Z #define __bswap_constant_32(x) ((((x) & 0xff000000) >> 24) | (((x) & 0x00ff0000) >> 8) | (((x) & 0x0000ff00) << 8) | (((x) & 0x000000ff) << 24)) 2025-05-07T19:46:31.4524140Z #define _GLIBCXX_USE_FLOAT128 1 2025-05-07T19:46:31.4524247Z #define _IO_FLAGS2_NOTCANCEL 2 2025-05-07T19:46:31.4524371Z #define __stub_sigreturn 2025-05-07T19:46:31.4524622Z #define __errordecl(name,msg) extern void name (void) __attribute__((__error__ (msg))) 2025-05-07T19:46:31.4524724Z #define _GLIBCXX_HAVE_UTIME_H 1 2025-05-07T19:46:31.4524822Z #define __HOST_CONFIG_H__ 2025-05-07T19:46:31.4524961Z #define _XOPEN_SOURCE_EXTENDED 1 2025-05-07T19:46:31.4525053Z #define CLOCK_TAI 11 2025-05-07T19:46:31.4525165Z #define _GLIBCXX_END_NAMESPACE_VERSION 2025-05-07T19:46:31.4525285Z #define __restrict_arr 2025-05-07T19:46:31.4525400Z #define _PSTL_PRAGMA_MESSAGE_POLICIES(x) 2025-05-07T19:46:31.4525543Z #define __glibcxx_requires_valid_range(_First,_Last) 2025-05-07T19:46:31.4526128Z #define strndupa(s,n) (__extension__ ({ const char *__old = (s); size_t __len = strnlen (__old, (n)); char *__new = (char *) __builtin_alloca (__len + 1); __new[__len] = '\0'; (char *) memcpy (__new, __old, __len); })) 2025-05-07T19:46:31.4526322Z #define __attribute_artificial__ __attribute__ ((__artificial__)) 2025-05-07T19:46:31.4526408Z #define __USE_MISC 1 2025-05-07T19:46:31.4526519Z #define __UWORD_TYPE unsigned long int 2025-05-07T19:46:31.4526652Z #define _EXCEPTION_DEFINES_H 1 2025-05-07T19:46:31.4526744Z #define _GCC_LIMITS_H_ 2025-05-07T19:46:31.4526828Z #define __LDBL_DIG__ 18 2025-05-07T19:46:31.4526962Z #define __BIT_TYPES_DEFINED__ 1 2025-05-07T19:46:31.4527072Z #define __malloc_and_calloc_defined 2025-05-07T19:46:31.4527166Z #define __FLT64_IS_IEC_60559__ 2 2025-05-07T19:46:31.4527275Z #define _GLIBCXX_HAVE_SYS_SYSINFO_H 1 2025-05-07T19:46:31.4527389Z #define __x86_64__ 1 2025-05-07T19:46:31.4527478Z #define _SIZE_T_ 2025-05-07T19:46:31.4528432Z #define __bswap_constant_64(x) (__extension__ ((((x) & 0xff00000000000000ull) >> 56) | (((x) & 0x00ff000000000000ull) >> 40) | (((x) & 0x0000ff0000000000ull) >> 24) | (((x) & 0x000000ff00000000ull) >> 8) | (((x) & 0x00000000ff000000ull) << 8) | (((x) & 0x0000000000ff0000ull) << 24) | (((x) & 0x000000000000ff00ull) << 40) | (((x) & 0x00000000000000ffull) << 56))) 2025-05-07T19:46:31.4528555Z #define _POSIX2_COLL_WEIGHTS_MAX 2 2025-05-07T19:46:31.4528709Z #define __FLT32X_MIN_EXP__ (-1021) 2025-05-07T19:46:31.4528834Z #define __PTHREAD_RWLOCK_INT_FLAGS_SHARED 1 2025-05-07T19:46:31.4528976Z #define __DEC32_SUBNORMAL_MIN__ 0.000001E-95DF 2025-05-07T19:46:31.4529075Z #define _IO_iconv_t _G_iconv_t 2025-05-07T19:46:31.4529192Z #define _GLIBCXX_FLOAT_IS_IEEE_BINARY32 1 2025-05-07T19:46:31.4529334Z #define __cpp_lib_make_reverse_iterator 201402 2025-05-07T19:46:31.4529478Z #define _GLIBCXX_SYNCHRONIZATION_HAPPENS_BEFORE(A) 2025-05-07T19:46:31.4529580Z #define _GLIBCXX_HAVE_DLFCN_H 1 2025-05-07T19:46:31.4530073Z #define strdupa(s) (__extension__ ({ const char *__old = (s); size_t __len = strlen (__old) + 1; char *__new = (char *) __builtin_alloca (__len); (char *) memcpy (__new, __old, __len); })) 2025-05-07T19:46:31.4530214Z #define __no_return__ __attribute__((noreturn)) 2025-05-07T19:46:31.4530362Z #define __device_builtin__ __location__(device_builtin) 2025-05-07T19:46:31.4530479Z #define _PSTL_HIDE_FROM_ABI_POP 2025-05-07T19:46:31.4530592Z #define _GLIBCXX_HAVE_ACOSF 1 2025-05-07T19:46:31.4530687Z #define STA_FLL 0x0008 2025-05-07T19:46:31.4530834Z #define _GLIBCXX_HAVE_BUILTIN_IS_CONSTANT_EVALUATED 1 2025-05-07T19:46:31.4530933Z #define _GLIBCXX_END_EXTERN_C } 2025-05-07T19:46:31.4531126Z #define __INT_FAST16_MAX__ 0x7fffffffffffffffL 2025-05-07T19:46:31.4531238Z #define __cpp_lib_integer_sequence 201304 2025-05-07T19:46:31.4531330Z #define __stub_revoke 2025-05-07T19:46:31.4531444Z #define __timer_t_defined 1 2025-05-07T19:46:31.4531578Z #define _GLIBCXX11_DEPRECATED _GLIBCXX_DEPRECATED 2025-05-07T19:46:31.4531678Z #define INT_MAX __INT_MAX__ 2025-05-07T19:46:31.4531802Z #define ULLONG_MAX (LLONG_MAX * 2ULL + 1) 2025-05-07T19:46:31.4531909Z #define _GLIBCXX_END_NAMESPACE_CXX11 } 2025-05-07T19:46:31.4532008Z #define _GLIBCXX_ICONV_CONST 2025-05-07T19:46:31.4532117Z #define major(dev) gnu_dev_major (dev) 2025-05-07T19:46:31.4532246Z #define cudaArrayTextureGather 0x08 2025-05-07T19:46:31.4532354Z #define _GLIBCXX_LT_OBJDIR ".libs/" 2025-05-07T19:46:31.4532507Z #define __inline_hint__ __attribute__((nv_inline_hint)) 2025-05-07T19:46:31.4532627Z #define __NV_LEGACY_LAUNCH 1 2025-05-07T19:46:31.4532717Z #define _IO_off_t __off_t 2025-05-07T19:46:31.4532808Z #define __FLT64_DIG__ 15 2025-05-07T19:46:31.4533041Z #define PTHREAD_DESTRUCTOR_ITERATIONS _POSIX_THREAD_DESTRUCTOR_ITERATIONS 2025-05-07T19:46:31.4533157Z #define _POSIX2_LINE_MAX 2048 2025-05-07T19:46:31.4533292Z #define __UINT_FAST32_MAX__ 0xffffffffffffffffUL 2025-05-07T19:46:31.4533418Z #define __UINT_LEAST64_TYPE__ long unsigned int 2025-05-07T19:46:31.4533555Z #define ADJ_FREQUENCY 0x0002 2025-05-07T19:46:31.4533663Z #define __CUDART_API_PTDS(api) api 2025-05-07T19:46:31.4533749Z #define NULL __null 2025-05-07T19:46:31.4533883Z #define cudaStreamPerThread ((cudaStream_t)0x2) 2025-05-07T19:46:31.4534017Z #define _GLIBCXX_CONSTEXPR constexpr 2025-05-07T19:46:31.4534115Z #define __U64_TYPE unsigned long int 2025-05-07T19:46:31.4534224Z #define __FLT_HAS_QUIET_NAN__ 1 2025-05-07T19:46:31.4534352Z #define __FLT_MAX_10_EXP__ 38 2025-05-07T19:46:31.4534438Z #define FP_ZERO 2 2025-05-07T19:46:31.4534543Z #define _GLIBCXX_HAVE_FLOORL 1 2025-05-07T19:46:31.4534699Z #define __isgraph_l(c,l) __isctype_l((c), _ISgraph, (l)) 2025-05-07T19:46:31.4534842Z #define __LONG_MAX__ 0x7fffffffffffffffL 2025-05-07T19:46:31.4534928Z #define __WCHAR_T__ 2025-05-07T19:46:31.4535027Z #define __FLT64X_HAS_DENORM__ 1 2025-05-07T19:46:31.4535255Z #define __DEC128_SUBNORMAL_MIN__ 0.000000000000000000000000000000001E-6143DL 2025-05-07T19:46:31.4535403Z #define _GLIBCXX_NORETURN __attribute__ ((__noreturn__)) 2025-05-07T19:46:31.4535501Z #define __FLT_HAS_INFINITY__ 1 2025-05-07T19:46:31.4535653Z #define __GNUC_EXECUTION_CHARSET_NAME "UTF-8" 2025-05-07T19:46:31.4535779Z #define _GLIBCXX20_DEPRECATED_SUGGEST(ALT) 2025-05-07T19:46:31.4535907Z #define __WSTOPSIG(status) __WEXITSTATUS(status) 2025-05-07T19:46:31.4536038Z #define cudaSurfaceTypeCubemapLayered 0xFC 2025-05-07T19:46:31.4536168Z #define _BSD_PTRDIFF_T_ 2025-05-07T19:46:31.4536313Z #define _SIGSET_H_types 1 2025-05-07T19:46:31.4536429Z #define cudaTextureType1DLayered 0xF1 2025-05-07T19:46:31.4536567Z #define __cpp_unicode_literals 200710L 2025-05-07T19:46:31.4536721Z #define __isdigit_l(c,l) __isctype_l((c), _ISdigit, (l)) 2025-05-07T19:46:31.4536824Z #define __LONG_LONG_PAIR(HI,LO) LO, HI 2025-05-07T19:46:31.4536946Z #define __UINT_FAST16_TYPE__ long unsigned int 2025-05-07T19:46:31.4537108Z #define __bos0(ptr) __builtin_object_size (ptr, 0) 2025-05-07T19:46:31.4537220Z #define __DEC64_MAX__ 9.999999999999999E384DD 2025-05-07T19:46:31.4537348Z #define M_1_PIl 0.318309886183790671537767526745028724L 2025-05-07T19:46:31.4537544Z #define WIFSTOPPED(status) __WIFSTOPPED (__WAIT_INT (status)) 2025-05-07T19:46:31.4537650Z #define __INT_FAST32_WIDTH__ 64 2025-05-07T19:46:31.4537757Z #define _POSIX2_CHARCLASS_NAME_MAX 14 2025-05-07T19:46:31.4537860Z #define _GLIBCXX_BITS_STD_ABS_H 2025-05-07T19:46:31.4537980Z #define STA_MODE 0x4000 2025-05-07T19:46:31.4538099Z #define __CHAR16_TYPE__ short unsigned int 2025-05-07T19:46:31.4538204Z #define __PRAGMA_REDEFINE_EXTNAME 1 2025-05-07T19:46:31.4538347Z #define __glibcxx_signed_b(T,B) ((T)(-1) < 0) 2025-05-07T19:46:31.4538456Z #define __USING_NAMESPACE_C99(name) 2025-05-07T19:46:31.4538625Z #define BIG_ENDIAN __BIG_ENDIAN 2025-05-07T19:46:31.4538749Z #define __cudaCDP2EventRecord_ptsz 2025-05-07T19:46:31.4538851Z #define _GLIBCXX_HAVE_SINL 1 2025-05-07T19:46:31.4538972Z #define EXPR_NEST_MAX _POSIX2_EXPR_NEST_MAX 2025-05-07T19:46:31.4539070Z #define __SIZE_WIDTH__ 64 2025-05-07T19:46:31.4539215Z #define __BLKSIZE_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:31.4539305Z #define __SEG_FS 1 2025-05-07T19:46:31.4539404Z #define _IO_size_t size_t 2025-05-07T19:46:31.4539531Z #define __INT_LEAST16_MAX__ 0x7fff 2025-05-07T19:46:31.4539638Z #define INT_MIN (-INT_MAX - 1) 2025-05-07T19:46:31.4539733Z #define __stub_lchmod 2025-05-07T19:46:31.4539835Z #define __DEC64_MANT_DIG__ 16 2025-05-07T19:46:31.4539985Z #define __INT64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:46:31.4540092Z #define _GLIBCXX_MANGLE_SIZE_T m 2025-05-07T19:46:31.4540181Z #define __SEG_GS 1 2025-05-07T19:46:31.4540388Z #define __FLT32_DENORM_MIN__ 1.40129846432481707092372958328991613e-45F32 2025-05-07T19:46:31.4540482Z #define _IOS_APPEND 8 2025-05-07T19:46:31.4540583Z #define __SIG_ATOMIC_WIDTH__ 32 2025-05-07T19:46:31.4540678Z #define _GLIBCXX_RELEASE 11 2025-05-07T19:46:31.4540808Z #define _GLIBCXX98_USE_C99_WCHAR 1 2025-05-07T19:46:31.4540915Z #define _IO_IS_APPENDING 0x1000 2025-05-07T19:46:31.4541021Z #define __INT_LEAST64_TYPE__ long int 2025-05-07T19:46:31.4541133Z #define htole16(x) (x) 2025-05-07T19:46:31.4541247Z #define __TEXTURE_INDIRECT_FUNCTIONS_H__ 2025-05-07T19:46:31.4541346Z #define _GLIBCXX_HAVE_FCNTL_H 1 2025-05-07T19:46:31.4541446Z #define __INT16_TYPE__ short int 2025-05-07T19:46:31.4541572Z #define __INT_LEAST8_TYPE__ signed char 2025-05-07T19:46:31.4541680Z #define __glibcxx_class_requires(_a,_b) 2025-05-07T19:46:31.4541800Z #define __cpp_structured_bindings 201606L 2025-05-07T19:46:31.4541942Z #define __align__(n) __attribute__((aligned(n))) 2025-05-07T19:46:31.4542034Z #define __SIZEOF_INT__ 4 2025-05-07T19:46:31.4542129Z #define __WCLONE 0x80000000 2025-05-07T19:46:31.4542224Z #define __DEC32_MAX_EXP__ 97 2025-05-07T19:46:31.4542332Z #define SEEK_HOLE 4 2025-05-07T19:46:31.4542422Z #define TIMER_ABSTIME 1 2025-05-07T19:46:31.4542521Z #define __INT_FAST8_MAX__ 0x7f 2025-05-07T19:46:31.4542637Z #define __CUDA_MATH_CRTIMP 2025-05-07T19:46:31.4542816Z #define __FLT128_MAX__ 1.18973149535723176508575932662800702e+4932F128 2025-05-07T19:46:31.4542933Z #define __INTPTR_MAX__ 0x7fffffffffffffffL 2025-05-07T19:46:31.4543039Z #define __DRIVER_FUNCTIONS_H__ 2025-05-07T19:46:31.4543173Z #define __cpp_sized_deallocation 201309L 2025-05-07T19:46:31.4543271Z #define __MATH_FUNCTIONS_HPP__ 2025-05-07T19:46:31.4543398Z #define __cpp_guaranteed_copy_elision 201606L 2025-05-07T19:46:31.4543511Z #define _LINUX_LIMITS_H 2025-05-07T19:46:31.4543594Z #define linux 1 2025-05-07T19:46:31.4543737Z #define MOD_MICRO ADJ_MICRO 2025-05-07T19:46:31.4543856Z #define _GLIBCXX_DEBUG_ASSERT(_Condition) 2025-05-07T19:46:31.4543979Z #define _GLIBCXX_HAVE_VSWSCANF 1 2025-05-07T19:46:31.4544074Z #define _GLIBCXX_HAVE_ISNAN 1 2025-05-07T19:46:31.4544186Z #define _XOPEN_IOV_MAX _POSIX_UIO_MAXIOV 2025-05-07T19:46:31.4544356Z #define __cudart_builtin__ __location__(cudart_builtin) 2025-05-07T19:46:31.4544457Z #define __cpp_lib_hypot 201603 2025-05-07T19:46:31.4544555Z #define __FLT64_HAS_QUIET_NAN__ 1 2025-05-07T19:46:31.4544687Z #define _GLIBCXX_HAVE_WCTYPE_H 1 2025-05-07T19:46:31.4544781Z #define MOD_NANO ADJ_NANO 2025-05-07T19:46:31.4544869Z #define htole64(x) (x) 2025-05-07T19:46:31.4544973Z #define FP_ILOGBNAN (-2147483647 - 1) 2025-05-07T19:46:31.4545135Z #define _IO_stdout ((_IO_FILE*)(&_IO_2_1_stdout_)) 2025-05-07T19:46:31.4545235Z #define _IO_UPPERCASE 01000 2025-05-07T19:46:31.4545732Z #define cudaKernelNodeAttributeClusterSchedulingPolicyPreference cudaLaunchAttributeClusterSchedulingPolicyPreference 2025-05-07T19:46:31.4545852Z #define __USE_POSIX2 1 2025-05-07T19:46:31.4545951Z #define MOD_ESTERROR ADJ_ESTERROR 2025-05-07T19:46:31.4546037Z #define __WALL 0x40000000 2025-05-07T19:46:31.4546137Z #define _GLIBCXX_HAVE_LDEXPF 1 2025-05-07T19:46:31.4546318Z #define _XLOCALE_H 1 2025-05-07T19:46:31.4546417Z #define _GLIBCXX_USE_TMPNAM 1 2025-05-07T19:46:31.4546515Z #define __FLT32_MIN_10_EXP__ (-37) 2025-05-07T19:46:31.4546614Z #define __KEY_T_TYPE __S32_TYPE 2025-05-07T19:46:31.4546746Z #define __cudaGet_threadIdx() threadIdx 2025-05-07T19:46:31.4546840Z #define __EXCEPTIONS 1 2025-05-07T19:46:31.4546946Z #define __CUDART_API_PTSZ(api) api 2025-05-07T19:46:31.4547165Z #define __launch_bounds__(...) __annotate__(launch_bounds(__VA_ARGS__)) 2025-05-07T19:46:31.4547256Z #define __WORDSIZE 64 2025-05-07T19:46:31.4547351Z #define CLOCK_MONOTONIC 1 2025-05-07T19:46:31.4547436Z #define _STL_RELOPS_H 1 2025-05-07T19:46:31.4547559Z #define __PTRDIFF_WIDTH__ 64 2025-05-07T19:46:31.4547662Z #define __BEGIN_DECLS extern "C" { 2025-05-07T19:46:31.4547759Z #define _GLIBCXX_HAVE_SYS_IPC_H 1 2025-05-07T19:46:31.4547880Z #define __LDBL_MANT_DIG__ 64 2025-05-07T19:46:31.4547973Z #define _GLIBCXX_HAVE_TRUNCATE 1 2025-05-07T19:46:31.4548280Z #define cudaKernelNodeAttributeClusterDimension cudaLaunchAttributeClusterDimension 2025-05-07T19:46:31.4548527Z #define _PSTL_GCC_VERSION (__GNUC__ * 10000 + __GNUC_MINOR__ * 100 + __GNUC_PATCHLEVEL__) 2025-05-07T19:46:31.4548674Z #define _GLIBCXX_NAMESPACE_CXX11 __cxx11:: 2025-05-07T19:46:31.4548781Z #define _GLIBCXX_NUMERIC_LIMITS 1 2025-05-07T19:46:31.4548887Z #define __cpp_range_based_for 201603L 2025-05-07T19:46:31.4549030Z #define __cpp_lib_exchange_function 201304 2025-05-07T19:46:31.4549136Z #define _GLIBCXX_HAVE_INTTYPES_H 1 2025-05-07T19:46:31.4549243Z #define _GLIBCXX_DARWIN_USE_64_BIT_INODE 1 2025-05-07T19:46:31.4549517Z #define cudaCooperativeLaunchMultiDeviceNoPostSync 0x02 2025-05-07T19:46:31.4549625Z #define __FLT64_HAS_INFINITY__ 1 2025-05-07T19:46:31.4549903Z #define _GLIBCXX_CSTDLIB 1 2025-05-07T19:46:31.4550015Z #define _GLIBCXX_DEBUG_MACRO_SWITCH_H 1 2025-05-07T19:46:31.4550235Z #define __FLT64X_MAX__ 1.18973149535723176502126385303097021e+4932F64x 2025-05-07T19:46:31.4550366Z #define __STDCPP_DEFAULT_NEW_ALIGNMENT__ 16 2025-05-07T19:46:31.4550467Z #define _STRING_H 1 2025-05-07T19:46:31.4550606Z #define _BITS_PTHREADTYPES_H 1 2025-05-07T19:46:31.4550713Z #define _GCC_MAX_ALIGN_T 2025-05-07T19:46:31.4550829Z #define __SM_32_INTRINSICS_HPP__ 2025-05-07T19:46:31.4550973Z #define __SIG_ATOMIC_MIN__ (-__SIG_ATOMIC_MAX__ - 1) 2025-05-07T19:46:31.4551096Z #define __code_model_small__ 1 2025-05-07T19:46:31.4551199Z #define _PSTL_CONFIG_H 2025-05-07T19:46:31.4551320Z #define __GCC_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:46:31.4551470Z #define __cpp_nontype_template_args 201411L 2025-05-07T19:46:31.4551577Z #define __SM_20_INTRINSICS_H__ 2025-05-07T19:46:31.4551691Z #define cudaCpuDeviceId ((int)-1) 2025-05-07T19:46:31.4552126Z #define assert(expr) ((expr) ? __ASSERT_VOID_CAST (0) : __assert_fail (__STRING(expr), __FILE__, __LINE__, __ASSERT_FUNCTION)) 2025-05-07T19:46:31.4552256Z #define __DEC32_MANT_DIG__ 7 2025-05-07T19:46:31.4552356Z #define le64toh(x) (x) 2025-05-07T19:46:31.4552460Z #define FILENAME_MAX 4096 2025-05-07T19:46:31.4552641Z #define __iscntrl_l(c,l) __isctype_l((c), _IScntrl, (l)) 2025-05-07T19:46:31.4552763Z #define __cpp_return_type_deduction 201304L 2025-05-07T19:46:31.4552859Z #define L_cuserid 9 2025-05-07T19:46:31.4552979Z #define __ino_t_defined 2025-05-07T19:46:31.4553071Z #define __k8__ 1 2025-05-07T19:46:31.4553183Z #define __INTPTR_TYPE__ long int 2025-05-07T19:46:31.4553307Z #define __UINT16_TYPE__ short unsigned int 2025-05-07T19:46:31.4553427Z #define __int8_t_defined 2025-05-07T19:46:31.4553528Z #define __WCHAR_TYPE__ int 2025-05-07T19:46:31.4553644Z #define __CLOCKID_T_TYPE __S32_TYPE 2025-05-07T19:46:31.4553787Z #define cudaHostRegisterPortable 0x01 2025-05-07T19:46:31.4553900Z #define __SLONGWORD_TYPE long int 2025-05-07T19:46:31.4554002Z #define _IOS_TRUNC 16 2025-05-07T19:46:31.4554134Z #define _GLIBCXX_PACKAGE_TARNAME "libstdc++" 2025-05-07T19:46:31.4554325Z #define __isblank_l(c,l) __isctype_l((c), _ISblank, (l)) 2025-05-07T19:46:31.4554426Z #define __HAVE_COLUMN 2025-05-07T19:46:31.4554586Z #define __stub_fdetach 2025-05-07T19:46:31.4555076Z #define __CUDACC_VER__ "__CUDACC_VER__ is no longer supported. Use __CUDACC_VER_MAJOR__, __CUDACC_VER_MINOR__, and __CUDACC_VER_BUILD__ instead." 2025-05-07T19:46:31.4555166Z #define __pic__ 2 2025-05-07T19:46:31.4555295Z #define __UINTPTR_MAX__ 0xffffffffffffffffUL 2025-05-07T19:46:31.4555402Z #define CLOCKS_PER_SEC 1000000l 2025-05-07T19:46:31.4555523Z #define __INT_FAST64_WIDTH__ 64 2025-05-07T19:46:31.4555632Z #define _GLIBCXX_HAVE_SOCKATMARK 1 2025-05-07T19:46:31.4555730Z #define __stub_chflags 2025-05-07T19:46:31.4555855Z #define CLOCK_BOOTTIME 7 2025-05-07T19:46:31.4555952Z #define __need_IOV_MAX 2025-05-07T19:46:31.4556075Z #define putc(_ch,_fp) _IO_putc (_ch, _fp) 2025-05-07T19:46:31.4556189Z #define __UQUAD_TYPE unsigned long int 2025-05-07T19:46:31.4556323Z #define __cpp_decltype 200707L 2025-05-07T19:46:31.4556439Z #define __BYTE_ORDER __LITTLE_ENDIAN 2025-05-07T19:46:31.4556543Z #define _GLIBCXX_USE_C99 1 2025-05-07T19:46:31.4556695Z #define _GLIBCXX_TR1_BETA_FUNCTION_TCC 1 2025-05-07T19:46:31.4556789Z #define TTY_NAME_MAX 32 2025-05-07T19:46:31.4556968Z #define _GLIBCXX_FORWARD(_Tp,__val) std::forward<_Tp>(__val) 2025-05-07T19:46:31.4557098Z #define __INT_FAST64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:46:31.4557308Z #define _PSTL_ASSERT(_Condition) __glibcxx_assert(_Condition) 2025-05-07T19:46:31.4557426Z #define __GCC_ATOMIC_TEST_AND_SET_TRUEVAL 1 2025-05-07T19:46:31.4557524Z #define __LITTLE_ENDIAN 1234 2025-05-07T19:46:31.4557654Z #define STA_PPSTIME 0x0004 2025-05-07T19:46:31.4557751Z #define __import__ 2025-05-07T19:46:31.4557846Z #define BUFSIZ _IO_BUFSIZ 2025-05-07T19:46:31.4558014Z #define M_SQRT2l 1.414213562373095048801688724209698079L 2025-05-07T19:46:31.4558113Z #define __export__ 2025-05-07T19:46:31.4558242Z #define __FSID_T_TYPE struct { int __val[2]; } 2025-05-07T19:46:31.4558350Z #define cudaMemAttachHost 0x02 2025-05-07T19:46:31.4558552Z #define __FLT_NORM_MAX__ 3.40282346638528859811704183484516925e+38F 2025-05-07T19:46:31.4558658Z #define _GLIBCXX_HAVE_ICONV 1 2025-05-07T19:46:31.4558757Z #define _GLIBCXX_SYMVER 1 2025-05-07T19:46:31.4558890Z #define __FLT64X_MAX_EXP__ 16384 2025-05-07T19:46:31.4558993Z #define _WCHAR_T_DECLARED 2025-05-07T19:46:31.4559119Z #define __UINT_FAST64_TYPE__ long unsigned int 2025-05-07T19:46:31.4559245Z #define isalpha_l(c,l) __isalpha_l ((c), (l)) 2025-05-07T19:46:31.4559383Z #define __cpp_inline_variables 201606L 2025-05-07T19:46:31.4559488Z #define WNOWAIT 0x01000000 2025-05-07T19:46:31.4559574Z #define PLOSS 6 2025-05-07T19:46:31.4559701Z #define M_LN10 2.30258509299404568402 2025-05-07T19:46:31.4559996Z #define _PSTL_UDS_PRESENT (__INTEL_COMPILER >= 1900 && __INTEL_COMPILER_BUILD_DATE >= 20180626) 2025-05-07T19:46:31.4560144Z #define EXIT_SUCCESS 0 2025-05-07T19:46:31.4560245Z #define __LDBL_REDIR_DECL(name) 2025-05-07T19:46:31.4560374Z #define _GLIBCXX_HAVE_STRTOF 1 2025-05-07T19:46:31.4560489Z #define MOD_FREQUENCY ADJ_FREQUENCY 2025-05-07T19:46:31.4560585Z #define __thread__ __thread 2025-05-07T19:46:31.4560718Z #define _GLIBCXX_HAVE_MEMORY_H 1 2025-05-07T19:46:31.4560825Z #define __INT_MAX__ 0x7fffffff 2025-05-07T19:46:31.4560943Z #define __SIZEOF_PTHREAD_BARRIER_T 32 2025-05-07T19:46:31.4561195Z #define __glibcxx_requires_partitioned_upper_pred(_First,_Last,_Value,_Pred) 2025-05-07T19:46:31.4561339Z #define __cudaCDP2StreamWaitEvent_ptsz 2025-05-07T19:46:31.4561449Z #define _GLIBCXX_HAVE_SINF 1 2025-05-07T19:46:31.4561542Z #define __linux__ 1 2025-05-07T19:46:31.4561673Z #define STA_PPSSIGNAL 0x0100 2025-05-07T19:46:31.4561815Z #define M_LN2l 0.693147180559945309417232121458176568L 2025-05-07T19:46:31.4561927Z #define __S16_TYPE short int 2025-05-07T19:46:31.4562421Z #define __glibcxx_constexpr_assert(cond) if (__builtin_is_constant_evaluated() && !bool(cond)) __builtin_unreachable() 2025-05-07T19:46:31.4562534Z #define __NVCC_DIAG_PRAGMA_SUPPORT__ 1 2025-05-07T19:46:31.4562730Z #define __bos(ptr) __builtin_object_size (ptr, __USE_FORTIFY_LEVEL > 1) 2025-05-07T19:46:31.4562889Z #define __COMMON_FUNCTIONS_H__ 2025-05-07T19:46:31.4563016Z #define UINT_MAX (INT_MAX * 2U + 1U) 2025-05-07T19:46:31.4563107Z #define _T_SIZE_ 2025-05-07T19:46:31.4563212Z #define LLONG_MAX __LONG_LONG_MAX__ 2025-05-07T19:46:31.4563366Z #define __cudaCDP2StreamCreateWithFlags 2025-05-07T19:46:31.4563472Z #define _PSTL_VERSION 12000 2025-05-07T19:46:31.4563601Z #define __noinline__ __attribute__((noinline)) 2025-05-07T19:46:31.4563711Z #define __WNOTHREAD 0x20000000 2025-05-07T19:46:31.4563826Z #define _G_va_list __gnuc_va_list 2025-05-07T19:46:31.4563960Z #define M_PI_4l 0.785398163397448309615660845819875721L 2025-05-07T19:46:31.4564055Z #define _IOS_INPUT 1 2025-05-07T19:46:31.4564179Z #define __USE_LARGEFILE64 1 2025-05-07T19:46:31.4564292Z #define _GLIBCXX_TR1_EXP_INTEGRAL_TCC 1 2025-05-07T19:46:31.4564392Z #define __INT64_TYPE__ long int 2025-05-07T19:46:31.4564497Z #define _POSIX_SSIZE_MAX 32767 2025-05-07T19:46:31.4564626Z #define __shared__ __location__(shared) 2025-05-07T19:46:31.4564717Z #define __FLT_MAX_EXP__ 128 2025-05-07T19:46:31.4564877Z #define __glibc_unlikely(cond) __builtin_expect((cond), 0) 2025-05-07T19:46:31.4564992Z #define __gid_t_defined 2025-05-07T19:46:31.4565106Z #define _GLIBCXX_USE_SC_NPROCESSORS_ONLN 1 2025-05-07T19:46:31.4565207Z #define __ORDER_BIG_ENDIAN__ 4321 2025-05-07T19:46:31.4565414Z #define __glibcxx_requires_can_increment_range(_First1,_Last1,_First2) 2025-05-07T19:46:31.4565546Z #define _GLIBCXX17_INLINE inline 2025-05-07T19:46:31.4565638Z #define __DBL_MANT_DIG__ 53 2025-05-07T19:46:31.4565735Z #define ___int_size_t_h 2025-05-07T19:46:31.4565877Z #define __FSBLKCNT64_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:31.4565995Z #define __cpp_inheriting_constructors 201511L 2025-05-07T19:46:31.4566156Z #define __WIFCONTINUED(status) ((status) == __W_CONTINUED) 2025-05-07T19:46:31.4566295Z #define CUDA_DOUBLE_MATH_FUNCTIONS 1 2025-05-07T19:46:31.4566393Z #define _GLIBCXX_HAVE_FENV_H 1 2025-05-07T19:46:31.4566495Z #define _GLIBCXX_HAVE_STDBOOL_H 1 2025-05-07T19:46:31.4566601Z #define __SIZEOF_FLOAT128__ 16 2025-05-07T19:46:31.4566758Z #define __INT_LEAST64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:46:31.4566872Z #define _GLIBCXX_TR1_HYPERGEOMETRIC_TCC 1 2025-05-07T19:46:31.4566996Z #define _GLIBCXX_DEBUG_PEDASSERT(_Condition) 2025-05-07T19:46:31.4567128Z #define __clock_t_defined 1 2025-05-07T19:46:31.4567227Z #define _POSIX_SEM_VALUE_MAX 32767 2025-05-07T19:46:31.4567343Z #define __cudaCDP2RuntimeGetVersion 2025-05-07T19:46:31.4567439Z #define __GLIBC_MINOR__ 17 2025-05-07T19:46:31.4567566Z #define __DEC64_MIN__ 1E-383DD 2025-05-07T19:46:31.4567662Z #define __WINT_TYPE__ unsigned int 2025-05-07T19:46:31.4567780Z #define __UINT_LEAST32_TYPE__ unsigned int 2025-05-07T19:46:31.4567906Z #define __SIZEOF_SHORT__ 2 2025-05-07T19:46:31.4568151Z #define __FLT32_NORM_MAX__ 3.40282346638528859811704183484516925e+38F32 2025-05-07T19:46:31.4568235Z #define __SSE__ 1 2025-05-07T19:46:31.4568331Z #define SEM_VALUE_MAX (2147483647) 2025-05-07T19:46:31.4568466Z #define M_SQRT1_2 0.70710678118654752440 2025-05-07T19:46:31.4568556Z #define _CTYPE_H 1 2025-05-07T19:46:31.4568644Z #define __sigset_t_defined 2025-05-07T19:46:31.4568774Z #define __LDBL_MIN_EXP__ (-16381) 2025-05-07T19:46:31.4568875Z #define _GLIBCXX_HAVE_LOGF 1 2025-05-07T19:46:31.4568970Z #define MOD_TAI ADJ_TAI 2025-05-07T19:46:31.4569067Z #define _IO_va_list __gnuc_va_list 2025-05-07T19:46:31.4569191Z #define _GLIBCXX_HAVE_LOGL 1 2025-05-07T19:46:31.4569282Z #define __SM_70_RT_H__ 2025-05-07T19:46:31.4569381Z #define _GLIBCXX_HAVE_WRITEV 1 2025-05-07T19:46:31.4569519Z #define cudaEventWaitDefault 0x00 2025-05-07T19:46:31.4569618Z #define _GLIBCXX_HAVE_EXPL 1 2025-05-07T19:46:31.4569776Z #define __FLT64_MAX__ 1.79769313486231570814527423731704357e+308F64 2025-05-07T19:46:31.4569882Z #define _POSIX_MAX_CANON 255 2025-05-07T19:46:31.4570029Z #define _GLIBCXX_NOEXCEPT_PARM , bool _NE 2025-05-07T19:46:31.4570130Z #define FD_SETSIZE __FD_SETSIZE 2025-05-07T19:46:31.4570221Z #define _GLIBCXX_TXN_SAFE 2025-05-07T19:46:31.4570383Z #define __amd64__ 1 2025-05-07T19:46:31.4570477Z #define __WINT_WIDTH__ 32 2025-05-07T19:46:31.4570583Z #define __CUDA_DEVICE_RUNTIME_API_H__ 2025-05-07T19:46:31.4570851Z #define __REDIRECT_NTHNL(name,proto,alias) name proto __THROWNL __asm__ (__ASMNAME (#alias)) 2025-05-07T19:46:31.4570977Z #define _GLIBCXX_STDIO_SEEK_CUR 1 2025-05-07T19:46:31.4571065Z #define EOF (-1) 2025-05-07T19:46:31.4571159Z #define __WAIT_STATUS_DEFN void * 2025-05-07T19:46:31.4571272Z #define __USE_POSIX199309 1 2025-05-07T19:46:31.4571372Z #define __INT_LEAST64_WIDTH__ 64 2025-05-07T19:46:31.4571470Z #define __LDBL_MAX_EXP__ 16384 2025-05-07T19:46:31.4571586Z #define __FLT32X_MAX_10_EXP__ 308 2025-05-07T19:46:31.4571693Z #define LLONG_MIN (-LLONG_MAX-1) 2025-05-07T19:46:31.4571818Z #define cudaSurfaceType2DLayered 0xF2 2025-05-07T19:46:31.4571916Z #define ____mbstate_t_defined 1 2025-05-07T19:46:31.4572034Z #define STA_NANO 0x2000 2025-05-07T19:46:31.4572130Z #define _GLIBCXX_HAVE_LOG10F 1 2025-05-07T19:46:31.4572231Z #define _GLIBCXX_HAVE_LOG10L 1 2025-05-07T19:46:31.4572349Z #define _IO_LINKED 0x80 2025-05-07T19:46:31.4572452Z #define __cpp_lib_launder 201606 2025-05-07T19:46:31.4572550Z #define __SIZEOF_INT128__ 16 2025-05-07T19:46:31.4572657Z #define __PTHREAD_MUTEX_HAVE_PREV 1 2025-05-07T19:46:31.4572769Z #define __FLT64X_IS_IEC_60559__ 2 2025-05-07T19:46:31.4572860Z #define _GLIBCXX_TYPE_TRAITS 1 2025-05-07T19:46:31.4573004Z #define cudaGraphKernelNodePortProgrammatic 1 2025-05-07T19:46:31.4573144Z #define __DEVICE_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:46:31.4573249Z #define __BLKCNT64_T_TYPE __SQUAD_TYPE 2025-05-07T19:46:31.4573350Z #define __LDBL_MAX_10_EXP__ 4932 2025-05-07T19:46:31.4573451Z #define __W_CONTINUED 0xffff 2025-05-07T19:46:31.4573561Z #define __ATOMIC_RELAXED 0 2025-05-07T19:46:31.4573695Z #define w_coredump __wait_terminated.__w_coredump 2025-05-07T19:46:31.4573821Z #define __FSBLKCNT_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:31.4574049Z #define __cudaCDP2OccupancyMaxActiveBlocksPerMultiprocessor 2025-05-07T19:46:31.4574245Z #define __DBL_EPSILON__ double(2.22044604925031308084726333618164062e-16L) 2025-05-07T19:46:31.4574335Z #define __stub_stty 2025-05-07T19:46:31.4574509Z #define _tolower(c) ((int) (*__ctype_tolower_loc ())[(int) (c)]) 2025-05-07T19:46:31.4574618Z #define le16toh(x) (x) 2025-05-07T19:46:31.4574729Z #define BC_SCALE_MAX _POSIX2_BC_SCALE_MAX 2025-05-07T19:46:31.4574909Z #define __FLT128_MIN__ 3.36210314311209350626267781732175260e-4932F128 2025-05-07T19:46:31.4575016Z #define _SIZET_ 2025-05-07T19:46:31.4575108Z #define XATTR_NAME_MAX 255 2025-05-07T19:46:31.4575200Z #define _SVID_SOURCE 1 2025-05-07T19:46:31.4575293Z #define _LP64 1 2025-05-07T19:46:31.4575411Z #define _LIBC_LIMITS_H_ 1 2025-05-07T19:46:31.4575701Z #define __REDIRECT_NTH_LDBL(name,proto,alias) __REDIRECT_NTH (name, proto, alias) 2025-05-07T19:46:31.4575825Z #define _GLIBCXX_TR1_BESSEL_FUNCTION_TCC 1 2025-05-07T19:46:31.4575950Z #define __UINT8_C(c) c 2025-05-07T19:46:31.4576050Z #define _GLIBCXX_HAVE_CEILF 1 2025-05-07T19:46:31.4576152Z #define _GLIBCXX_HAVE_CEILL 1 2025-05-07T19:46:31.4576289Z #define __cudaCDP2Memset3DAsync_ptsz 2025-05-07T19:46:31.4576381Z #define __CUDA_ARCH_LIST__ 520 2025-05-07T19:46:31.4576476Z #define __FLT64_MAX_EXP__ 1024 2025-05-07T19:46:31.4576579Z #define MOD_MAXERROR ADJ_MAXERROR 2025-05-07T19:46:31.4576693Z #define CUDARTAPI 2025-05-07T19:46:31.4576776Z #define IOV_MAX 1024 2025-05-07T19:46:31.4576922Z #define __glibcxx_requires_irreflexive2(_First,_Last) 2025-05-07T19:46:31.4577051Z #define __INT_LEAST32_TYPE__ int 2025-05-07T19:46:31.4577159Z #define cudaMemAttachSingle 0x04 2025-05-07T19:46:31.4577248Z #define __wchar_t__ 2025-05-07T19:46:31.4577361Z #define __cpp_lib_is_aggregate 201703 2025-05-07T19:46:31.4577487Z #define SEEK_END 2 2025-05-07T19:46:31.4577591Z #define __SIZEOF_WCHAR_T__ 4 2025-05-07T19:46:31.4577773Z #define _GLIBCXX_USE_TBB_PAR_BACKEND __has_include() 2025-05-07T19:46:31.4577899Z #define _IO_ftrylockfile(_fp) 2025-05-07T19:46:31.4578052Z #define _GLIBCXX_USE_C99_WCHAR _GLIBCXX11_USE_C99_WCHAR 2025-05-07T19:46:31.4578229Z #define ____FILE_defined 1 2025-05-07T19:46:31.4578345Z #define _GLIBCXX_HAVE_BUILTIN_IS_AGGREGATE 1 2025-05-07T19:46:31.4578473Z #define __GNUC_PATCHLEVEL__ 0 2025-05-07T19:46:31.4578567Z #define _ISOC99_SOURCE 1 2025-05-07T19:46:31.4578666Z #define __VECTOR_FUNCTIONS_H__ 2025-05-07T19:46:31.4578959Z #define __REDIRECT_NTH(name,proto,alias) name proto __THROW __asm__ (__ASMNAME (#alias)) 2025-05-07T19:46:31.4579098Z #define _PSTL_USE_NONTEMPORAL_STORES_IF_ALLOWED 2025-05-07T19:46:31.4579192Z #define _IO_RIGHT 04 2025-05-07T19:46:31.4579289Z #define __END_NAMESPACE_STD 2025-05-07T19:46:31.4579515Z #define __FLT128_NORM_MAX__ 1.18973149535723176508575932662800702e+4932F128 2025-05-07T19:46:31.4579622Z #define _GLIBCXX_STD_C std 2025-05-07T19:46:31.4579742Z #define cudaInitDeviceFlagsAreValid 0x01 2025-05-07T19:46:31.4579872Z #define _LARGEFILE64_SOURCE 1 2025-05-07T19:46:31.4579985Z #define _GLIBCXX_USE_C99_STDINT_TR1 1 2025-05-07T19:46:31.4580083Z #define _STDDEF_H_ 2025-05-07T19:46:31.4580261Z #define __FLT64_NORM_MAX__ 1.79769313486231570814527423731704357e+308F64 2025-05-07T19:46:31.4580381Z #define __FLT128_HAS_QUIET_NAN__ 1 2025-05-07T19:46:31.4580501Z #define isalnum_l(c,l) __isalnum_l ((c), (l)) 2025-05-07T19:46:31.4580694Z #define __FD_ISSET(d,set) ((__FDS_BITS (set)[__FD_ELT (d)] & __FD_MASK (d)) != 0) 2025-05-07T19:46:31.4580828Z #define __INTMAX_MAX__ 0x7fffffffffffffffL 2025-05-07T19:46:31.4580968Z #define __glibcxx_requires_irreflexive(_First,_Last) 2025-05-07T19:46:31.4581094Z #define cudaGraphKernelNodePortDefault 0 2025-05-07T19:46:31.4581221Z #define __INT_FAST8_TYPE__ signed char 2025-05-07T19:46:31.4581334Z #define __cudaCDP2Memcpy3DAsync_ptsz 2025-05-07T19:46:31.4581440Z #define __PID_T_TYPE __S32_TYPE 2025-05-07T19:46:31.4581555Z #define __cpp_namespace_attributes 201411L 2025-05-07T19:46:31.4581667Z #define CHARCLASS_NAME_MAX 2048 2025-05-07T19:46:31.4581767Z #define _GLIBCXX_HAVE_TANF 1 2025-05-07T19:46:31.4581870Z #define _GLIBCXX_USE_ST_MTIM 1 2025-05-07T19:46:31.4582067Z #define __FLT64X_MIN__ 3.36210314311209350626267781732175260e-4932F64x 2025-05-07T19:46:31.4582161Z #define __CUDA_RUNTIME_H__ 2025-05-07T19:46:31.4582344Z #define WIFSIGNALED(status) __WIFSIGNALED (__WAIT_INT (status)) 2025-05-07T19:46:31.4582445Z #define _GLIBCXX_HAVE_STDLIB_H 1 2025-05-07T19:46:31.4582557Z #define __STDCPP_THREADS__ 1 2025-05-07T19:46:31.4582698Z #define M_2_SQRTPIl 1.128379167095512573896158903121545172L 2025-05-07T19:46:31.4582797Z #define __GNUC_STDC_INLINE__ 1 2025-05-07T19:46:31.4582905Z #define _POSIX_UIO_MAXIOV 16 2025-05-07T19:46:31.4583007Z #define _PSTL_PAR_BACKEND_SERIAL 2025-05-07T19:46:31.4583100Z #define P_tmpdir "/tmp" 2025-05-07T19:46:31.4583278Z #define __ASSERT_FUNCTION __PRETTY_FUNCTION__ 2025-05-07T19:46:31.4583389Z #define __FLT64_HAS_DENORM__ 1 2025-05-07T19:46:31.4583498Z #define __WORDSIZE_TIME64_COMPAT32 1 2025-05-07T19:46:31.4583666Z #define _GLIBCXX_DEPRECATED __attribute__ ((__deprecated__)) 2025-05-07T19:46:31.4583856Z #define __FLT32_EPSILON__ 1.19209289550781250000000000000000000e-7F32 2025-05-07T19:46:31.4583965Z #define _PSTL_HIDE_FROM_ABI_PUSH 2025-05-07T19:46:31.4584090Z #define cudaStreamLegacy ((cudaStream_t)0x1) 2025-05-07T19:46:31.4584220Z #define _IO_cleanup_region_start(_fct,_fp) 2025-05-07T19:46:31.4584320Z #define __location__(a) __annotate__(a) 2025-05-07T19:46:31.4585309Z #define __device_builtin_surface_type__ __location__(device_builtin_surface_type) 2025-05-07T19:46:31.4585422Z #define _POSIX2_BC_BASE_MAX 99 2025-05-07T19:46:31.4585567Z #define __cudaCDP2DeviceGetAttribute 2025-05-07T19:46:31.4585665Z #define __DBL_DECIMAL_DIG__ 17 2025-05-07T19:46:31.4585772Z #define __STDC_UTF_32__ 1 2025-05-07T19:46:31.4585897Z #define __INT_FAST8_WIDTH__ 8 2025-05-07T19:46:31.4586004Z #define NAN (__builtin_nanf ("")) 2025-05-07T19:46:31.4586109Z #define _POSIX_MQ_PRIO_MAX 32 2025-05-07T19:46:31.4586201Z #define __FXSR__ 1 2025-05-07T19:46:31.4586324Z #define _SIZE_T 2025-05-07T19:46:31.4586551Z #define _GLIBCXX_USE_GETTIMEOFDAY 1 2025-05-07T19:46:31.4586678Z #define cudaHostRegisterReadOnly 0x08 2025-05-07T19:46:31.4586894Z #define __FLT32X_MAX__ 1.79769313486231570814527423731704357e+308F32x 2025-05-07T19:46:31.4587050Z #define __WIFSTOPPED(status) (((status) & 0xff) == 0x7f) 2025-05-07T19:46:31.4587154Z #define _IO_ssize_t __ssize_t 2025-05-07T19:46:31.4587265Z #define __ULONG32_TYPE unsigned int 2025-05-07T19:46:31.4587486Z #define __DBL_NORM_MAX__ double(1.79769313486231570814527423731704357e+308L) 2025-05-07T19:46:31.4587697Z #define cudaStreamGraphTailLaunch (cudaStream_t)0x0100000000000000 2025-05-07T19:46:31.4587798Z #define _GXX_NULLPTR_T 2025-05-07T19:46:31.4587965Z #define __glibcxx_class_requires3(_a,_b,_c,_d) 2025-05-07T19:46:31.4588062Z #define FOPEN_MAX 16 2025-05-07T19:46:31.4588164Z #define __BIG_ENDIAN 4321 2025-05-07T19:46:31.4588315Z #define __BYTE_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:46:31.4588423Z #define __suseconds_t_defined 2025-05-07T19:46:31.4588519Z #define __off_t_defined 2025-05-07T19:46:31.4588619Z #define stderr stderr 2025-05-07T19:46:31.4588753Z #define M_LOG10E 0.43429448190325182765 2025-05-07T19:46:31.4588871Z #define __glibcxx_requires_string(_String) 2025-05-07T19:46:31.4588999Z #define _GLIBCXX_HAVE_LDEXPL 1 2025-05-07T19:46:31.4589095Z #define __INTMAX_WIDTH__ 64 2025-05-07T19:46:31.4589630Z #define _PSTL_CPP14_2RANGE_MISMATCH_EQUAL_PRESENT (_MSC_VER >= 1900 || __cplusplus >= 201300L || __cpp_lib_robust_nonmodifying_seq_ops == 201304) 2025-05-07T19:46:31.4589758Z #define __mode_t_defined 2025-05-07T19:46:31.4589850Z #define _GCC_SIZE_T 2025-05-07T19:46:31.4589959Z #define __INO64_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:31.4590076Z #define __cpp_runtime_arrays 198712L 2025-05-07T19:46:31.4590224Z #define __UINT64_TYPE__ long unsigned int 2025-05-07T19:46:31.4590323Z #define __USE_XOPEN2K8XSI 1 2025-05-07T19:46:31.4590428Z #define __UINT32_C(c) c ## U 2025-05-07T19:46:31.4590569Z #define __cpp_alias_templates 200704L 2025-05-07T19:46:31.4590684Z #define cudaHostAllocMapped 0x02 2025-05-07T19:46:31.4590794Z #define __DEVICE_LAUNCH_PARAMETERS_H__ 2025-05-07T19:46:31.4590893Z #define _STL_ITERATOR_H 1 2025-05-07T19:46:31.4591021Z #define __size_t__ 2025-05-07T19:46:31.4591157Z #define cudaStreamAttrID cudaLaunchAttributeID 2025-05-07T19:46:31.4591253Z #define _GLIBCXX_HAVE_ATANF 1 2025-05-07T19:46:31.4591392Z #define cudaEventRecordExternal 0x01 2025-05-07T19:46:31.4591548Z #define __isspace_l(c,l) __isctype_l((c), _ISspace, (l)) 2025-05-07T19:46:31.4591646Z #define _IO_BUFSIZ _G_BUFSIZ 2025-05-07T19:46:31.4591826Z #define __FLT_DENORM_MIN__ 1.40129846432481707092372958328991613e-45F 2025-05-07T19:46:31.4591946Z #define __SM_80_RT_H__ 2025-05-07T19:46:31.4592034Z #define _ENDIAN_H 1 2025-05-07T19:46:31.4592226Z #define __builtin_align__(a) __align__(a) 2025-05-07T19:46:31.4592353Z #define _GLIBCXX20_CONSTEXPR 2025-05-07T19:46:31.4592462Z #define __NV_NO_HOST_COMPILER_CHECK 1 2025-05-07T19:46:31.4592548Z #define __try try 2025-05-07T19:46:31.4592652Z #define _GLIBCXX_HAVE_FINITE 1 2025-05-07T19:46:31.4592782Z #define __FLT128_IS_IEC_60559__ 2 2025-05-07T19:46:31.4592876Z #define __INT8_MAX__ 0x7f 2025-05-07T19:46:31.4593157Z #define cudaStreamGetCaptureInfo __CUDART_API_PTSZ(cudaStreamGetCaptureInfo_v2) 2025-05-07T19:46:31.4593278Z #define __LONG_WIDTH__ 64 2025-05-07T19:46:31.4593368Z #define __PIC__ 2 2025-05-07T19:46:31.4593485Z #define BC_STRING_MAX _POSIX2_BC_STRING_MAX 2025-05-07T19:46:31.4593644Z #define __UINT_FAST32_TYPE__ long unsigned int 2025-05-07T19:46:31.4593788Z #define FD_ISSET(fd,fdsetp) __FD_ISSET (fd, fdsetp) 2025-05-07T19:46:31.4593893Z #define _GLIBCXX_HAVE_FLOAT_H 1 2025-05-07T19:46:31.4593995Z #define _GLIBCXX_HAVE_ATANL 1 2025-05-07T19:46:31.4594227Z #define __FLT32X_NORM_MAX__ 1.79769313486231570814527423731704357e+308F32x 2025-05-07T19:46:31.4594336Z #define __DEVICE_FUNCTIONS_HPP__ 2025-05-07T19:46:31.4594443Z #define __CHAR32_TYPE__ unsigned int 2025-05-07T19:46:31.4594563Z #define _IO_uid_t __uid_t 2025-05-07T19:46:31.4594674Z #define _GLIBCXX_HAVE_READLINK 1 2025-05-07T19:46:31.4594861Z #define __cudaCDP2EventRecordWithFlags_ptsz 2025-05-07T19:46:31.4594960Z #define _CONCEPT_CHECK_H 1 2025-05-07T19:46:31.4595139Z #define __FLT_MAX__ 3.40282346638528859811704183484516925e+38F 2025-05-07T19:46:31.4595247Z #define _GLIBCXX_HAVE_NETINET_IN_H 1 2025-05-07T19:46:31.4595374Z #define _GLIBCXX_TR1_SPECIAL_FUNCTION_UTIL_H 1 2025-05-07T19:46:31.4595482Z #define LONG_BIT 64 2025-05-07T19:46:31.4595605Z #define __SIZEOF_PTHREAD_BARRIERATTR_T 4 2025-05-07T19:46:31.4595714Z #define _GLIBCXX_USE_ALLOCATOR_NEW 1 2025-05-07T19:46:31.4595845Z #define __cpp_lib_math_special_functions 201603L 2025-05-07T19:46:31.4595965Z #define __fsfilcnt_t_defined 2025-05-07T19:46:31.4596067Z #define __blkcnt_t_defined 2025-05-07T19:46:31.4596356Z #define cudaKernelNodeAttributeMemSyncDomain cudaLaunchAttributeMemSyncDomain 2025-05-07T19:46:31.4596469Z #define __USE_LARGEFILE 1 2025-05-07T19:46:31.4596571Z #define __cpp_constexpr 201603L 2025-05-07T19:46:31.4596675Z #define CUDART_VERSION 12060 2025-05-07T19:46:31.4596770Z #define NL_TEXTMAX INT_MAX 2025-05-07T19:46:31.4596894Z #define cudaDeviceMapHost 0x08 2025-05-07T19:46:31.4596993Z #define _GLIBCXX_CMATH 1 2025-05-07T19:46:31.4597206Z #define __attribute_format_arg__(x) __attribute__ ((__format_arg__ (x))) 2025-05-07T19:46:31.4597308Z #define __lldiv_t_defined 1 2025-05-07T19:46:31.4597396Z #define __SSE2__ 1 2025-05-07T19:46:31.4597489Z #define _IOLBF 1 2025-05-07T19:46:31.4597593Z #define _GLIBCXX_HAVE_SYS_TYPES_H 1 2025-05-07T19:46:31.4597714Z #define _GLIBCXX_HAVE_FLOORF 1 2025-05-07T19:46:31.4597830Z #define __cpp_deduction_guides 201703L 2025-05-07T19:46:31.4597928Z #define _GLIBCXX_HAVE_EXPF 1 2025-05-07T19:46:31.4598054Z #define __annotate__(a) __attribute__((a)) 2025-05-07T19:46:31.4598154Z #define __INT32_TYPE__ int 2025-05-07T19:46:31.4598255Z #define __SIZEOF_DOUBLE__ 8 2025-05-07T19:46:31.4598388Z #define cudaDeviceSyncMemops 0x80 2025-05-07T19:46:31.4598499Z #define __cpp_exceptions 199711L 2025-05-07T19:46:31.4598605Z #define __FLT_MIN_10_EXP__ (-37) 2025-05-07T19:46:31.4598724Z #define cudaDeviceScheduleYield 0x02 2025-05-07T19:46:31.4598842Z #define _SYS_SYSMACROS_H 1 2025-05-07T19:46:31.4598971Z #define _GLIBCXX_TR1_LEGENDRE_FUNCTION_TCC 1 2025-05-07T19:46:31.4599146Z #define __FLT64_MIN__ 2.22507385850720138309023271733240406e-308F64 2025-05-07T19:46:31.4599268Z #define __INT_LEAST32_WIDTH__ 32 2025-05-07T19:46:31.4599370Z #define __SWORD_TYPE long int 2025-05-07T19:46:31.4599475Z #define __INTMAX_TYPE__ long int 2025-05-07T19:46:31.4599583Z #define _GLIBCXX11_USE_C99_MATH 1 2025-05-07T19:46:31.4599697Z #define __PTHREAD_SPINS 0, 0 2025-05-07T19:46:31.4599793Z #define _BITS_POSIX1_LIM_H 1 2025-05-07T19:46:31.4600163Z #define cudaStreamAttributeMemSyncDomainMap cudaLaunchAttributeMemSyncDomainMap 2025-05-07T19:46:31.4600283Z #define __DEC128_MAX_EXP__ 6145 2025-05-07T19:46:31.4600433Z #define math_errhandling (MATH_ERRNO | MATH_ERREXCEPT) 2025-05-07T19:46:31.4600520Z #define _T_SIZE 2025-05-07T19:46:31.4600635Z #define cudaHostAllocDefault 0x00 2025-05-07T19:46:31.4600788Z #define _PSTL_PRAGMA_SIMD_EXCLUSIVE_SCAN(PRM) 2025-05-07T19:46:31.4600925Z #define __va_arg_pack() __builtin_va_arg_pack () 2025-05-07T19:46:31.4601027Z #define _POSIX_TIMER_MAX 32 2025-05-07T19:46:31.4601146Z #define _GLIBCXX_HAVE_TLS 1 2025-05-07T19:46:31.4601385Z #define _GLIBCXX_NOTHROW _GLIBCXX_USE_NOEXCEPT 2025-05-07T19:46:31.4601480Z #define _GLIBCXX_HAVE_ACOSL 1 2025-05-07T19:46:31.4601584Z #define __FLT32X_HAS_QUIET_NAN__ 1 2025-05-07T19:46:31.4601704Z #define __ATOMIC_CONSUME 1 2025-05-07T19:46:31.4601884Z #define __CUDA_ARCH_HAS_FEATURE__(_FEAT) __CUDA_ARCH_FEAT_ ##_FEAT 2025-05-07T19:46:31.4601984Z #define __GNUC_MINOR__ 4 2025-05-07T19:46:31.4602110Z #define __GLIBCXX_TYPE_INT_N_0 __int128 2025-05-07T19:46:31.4602204Z #define __INT_FAST16_WIDTH__ 64 2025-05-07T19:46:31.4602327Z #define __UINTMAX_MAX__ 0xffffffffffffffffUL 2025-05-07T19:46:31.4602409Z #define __PIE__ 2 2025-05-07T19:46:31.4602532Z #define LITTLE_ENDIAN __LITTLE_ENDIAN 2025-05-07T19:46:31.4602682Z #define _GLIBCXX_HAVE_INT64_T_LONG 1 2025-05-07T19:46:31.4602891Z #define __FLT32X_DENORM_MIN__ 4.94065645841246544176568792868221372e-324F32x 2025-05-07T19:46:31.4603155Z #define __intN_t(N,MODE) typedef int int ##N ##_t __attribute__ ((__mode__ (MODE))) 2025-05-07T19:46:31.4603254Z #define __nlink_t_defined 2025-05-07T19:46:31.4603390Z #define _GLIBCXX17_DEPRECATED [[__deprecated__]] 2025-05-07T19:46:31.4603543Z #define _PSTL_STRING(x) _PSTL_STRING_AUX(x) 2025-05-07T19:46:31.4603635Z #define _XOPEN_LIM_H 1 2025-05-07T19:46:31.4603910Z #define __u_intN_t(N,MODE) typedef unsigned int u_int ##N ##_t __attribute__ ((__mode__ (MODE))) 2025-05-07T19:46:31.4604041Z #define __cpp_template_template_args 201611L 2025-05-07T19:46:31.4604185Z #define _GTHREAD_USE_MUTEX_TIMEDLOCK 1 2025-05-07T19:46:31.4604293Z #define BC_DIM_MAX _POSIX2_BC_DIM_MAX 2025-05-07T19:46:31.4604387Z #define __DBL_MAX_10_EXP__ 308 2025-05-07T19:46:31.4604502Z #define __FILE_defined 1 2025-05-07T19:46:31.4604687Z #define __LDBL_DENORM_MIN__ 3.64519953188247460252840593361941982e-4951L 2025-05-07T19:46:31.4604784Z #define _GLIBCXX_HAVE_SINCOS 1 2025-05-07T19:46:31.4604883Z #define __USE_XOPEN_EXTENDED 1 2025-05-07T19:46:31.4605016Z #define __cpp_lib_tuple_element_t 201402L 2025-05-07T19:46:31.4605134Z #define isascii_l(c,l) __isascii_l ((c), (l)) 2025-05-07T19:46:31.4605243Z #define cudaInvalidDeviceId ((int)-2) 2025-05-07T19:46:31.4605368Z #define _GLIBCXX_HAVE_SYS_RESOURCE_H 1 2025-05-07T19:46:31.4605459Z #define __INT16_C(c) c 2025-05-07T19:46:31.4605552Z #define __U32_TYPE unsigned int 2025-05-07T19:46:31.4605674Z #define _GLIBCXX_HAVE_SYS_IOCTL_H 1 2025-05-07T19:46:31.4605805Z #define FD_CLR(fd,fdsetp) __FD_CLR (fd, fdsetp) 2025-05-07T19:46:31.4605900Z #define __STDC__ 1 2025-05-07T19:46:31.4605999Z #define _GLIBCXX_HAVE_VWSCANF 1 2025-05-07T19:46:31.4606123Z #define _GLIBCXX_HAVE_EXECINFO_H 1 2025-05-07T19:46:31.4606227Z #define _GLIBCXX_USE_REALPATH 1 2025-05-07T19:46:31.4606386Z #define __attribute_malloc__ __attribute__ ((__malloc__)) 2025-05-07T19:46:31.4606509Z #define __FLT32X_DIG__ 15 2025-05-07T19:46:31.4606610Z #define _GLIBCXX_USE_C99_CTYPE_TR1 1 2025-05-07T19:46:31.4606712Z #define __PTRDIFF_TYPE__ long int 2025-05-07T19:46:31.4606825Z #define cudaArrayDeferredMapping 0x80 2025-05-07T19:46:31.4606955Z #define _GLIBCXX_END_NAMESPACE_CONTAINER 2025-05-07T19:46:31.4607057Z #define USHRT_MAX (SHRT_MAX * 2 + 1) 2025-05-07T19:46:31.4607161Z #define __cpp_lib_is_swappable 201603 2025-05-07T19:46:31.4607270Z #define stdin stdin 2025-05-07T19:46:31.4607360Z #define __ino64_t_defined 2025-05-07T19:46:31.4607452Z #define STA_CLK 0x8000 2025-05-07T19:46:31.4607544Z #define __clockid_t_defined 1 2025-05-07T19:46:31.4607809Z #define _GLIBCXX_NOEXCEPT_IF(...) noexcept(__VA_ARGS__) 2025-05-07T19:46:31.4607990Z #define __attribute_noinline__ __attribute__ ((__noinline__)) 2025-05-07T19:46:31.4608099Z #define __cudaCDP2MemsetAsync 2025-05-07T19:46:31.4608225Z #define _PSTL_PRAGMA_SIMD_SCAN(PRM) 2025-05-07T19:46:31.4608338Z #define _GLIBCXX_BEGIN_NAMESPACE_LDBL 2025-05-07T19:46:31.4608452Z #define _GLIBCXX_TR1_POLY_HERMITE_TCC 1 2025-05-07T19:46:31.4608657Z #define __FD_SET(d,set) ((void) (__FDS_BITS (set)[__FD_ELT (d)] |= __FD_MASK (d))) 2025-05-07T19:46:31.4608774Z #define __ATOMIC_SEQ_CST 5 2025-05-07T19:46:31.4609365Z #define __tobody(c,f,a,args) (__extension__ ({ int __res; if (sizeof (c) > 1) { if (__builtin_constant_p (c)) { int __c = (c); __res = __c < -128 || __c > 255 ? __c : (a)[__c]; } else __res = f args; } else __res = (a)[(int) (c)]; __res; })) 2025-05-07T19:46:31.4609455Z #define DOMAIN 1 2025-05-07T19:46:31.4609572Z #define M_LN2 0.69314718055994530942 2025-05-07T19:46:31.4609660Z #define __NVCC__ 1 2025-05-07T19:46:31.4609774Z #define __cudaCDP2Memset2DAsync 2025-05-07T19:46:31.4609908Z #define __CLOCK_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:31.4610006Z #define _PSTL_PRAGMA_SIMD_EARLYEXIT 2025-05-07T19:46:31.4610118Z #define __throw_exception_again throw 2025-05-07T19:46:31.4610215Z #define M_SQRT2 1.41421356237309504880 2025-05-07T19:46:31.4610368Z #define __EXCEPTION_H 1 2025-05-07T19:46:31.4610468Z #define __FLT32X_MIN_10_EXP__ (-307) 2025-05-07T19:46:31.4610577Z #define HUGE_VAL (__builtin_huge_val()) 2025-05-07T19:46:31.4610902Z #define cudaStreamAttributeAccessPolicyWindow cudaLaunchAttributeAccessPolicyWindow 2025-05-07T19:46:31.4611021Z #define __UINTPTR_TYPE__ long unsigned int 2025-05-07T19:46:31.4611126Z #define _GLIBCXX_INLINE_VERSION 0 2025-05-07T19:46:31.4611259Z #define _GLIBCXX_USE_INT128 1 2025-05-07T19:46:31.4611365Z #define __cpp_lib_bool_constant 201505 2025-05-07T19:46:31.4611460Z #define PTHREAD_KEYS_MAX 1024 2025-05-07T19:46:31.4611615Z #define __DEC64_SUBNORMAL_MIN__ 0.000000000000001E-383DD 2025-05-07T19:46:31.4611752Z #define __FSFILCNT64_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:31.4611864Z #define _GLIBCXX_DOUBLE_IS_IEEE_BINARY64 1 2025-05-07T19:46:31.4611962Z #define __DEC128_MANT_DIG__ 34 2025-05-07T19:46:31.4612101Z #define __cpp_lib_tuples_by_type 201304 2025-05-07T19:46:31.4612202Z #define __LDBL_MIN_10_EXP__ (-4931) 2025-05-07T19:46:31.4612309Z #define __cpp_generic_lambdas 201304L 2025-05-07T19:46:31.4612452Z #define _GLIBCXX_THROW_OR_ABORT(_EXC) (throw (_EXC)) 2025-05-07T19:46:31.4612573Z #define __useconds_t_defined 2025-05-07T19:46:31.4612677Z #define _GLIBCXX_USE_SCHED_YIELD 1 2025-05-07T19:46:31.4612867Z #define __attribute_deprecated__ __attribute__ ((__deprecated__)) 2025-05-07T19:46:31.4613046Z #define __cpp_lib_type_trait_variable_templates 201510L 2025-05-07T19:46:31.4613135Z #define __SSE_MATH__ 1 2025-05-07T19:46:31.4613226Z #define _IO_wint_t wint_t 2025-05-07T19:46:31.4613326Z #define __SIZEOF_LONG_LONG__ 8 2025-05-07T19:46:31.4613551Z #define _GLIBCXX_VERBOSE 1 2025-05-07T19:46:31.4613642Z #define _GLIBCXX_HAVE_ASINF 1 2025-05-07T19:46:31.4613751Z #define __cpp_user_defined_literals 200809L 2025-05-07T19:46:31.4613865Z #define _GLIBCXX_HAVE_ISINFL 1 2025-05-07T19:46:31.4613964Z #define _GLIBCXX_HAVE_ASINL 1 2025-05-07T19:46:31.4614044Z #define __USE_ATFILE 1 2025-05-07T19:46:31.4614142Z #define _POSIX_OPEN_MAX 20 2025-05-07T19:46:31.4614270Z #define _POSIX_LOGIN_NAME_MAX 9 2025-05-07T19:46:31.4614360Z #define _GCC_PTRDIFF_T 2025-05-07T19:46:31.4614578Z #define cudaKernelNodeAttributePriority cudaLaunchAttributePriority 2025-05-07T19:46:31.4614691Z #define __FLT128_DECIMAL_DIG__ 36 2025-05-07T19:46:31.4614798Z #define _POSIX_THREAD_KEYS_MAX 128 2025-05-07T19:46:31.4614903Z #define __GCC_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:46:31.4615008Z #define __cpp_lib_array_constexpr 201803L 2025-05-07T19:46:31.4615110Z #define _STDLIB_H 1 2025-05-07T19:46:31.4615257Z #define __exctype(name) extern int name (int) __THROW 2025-05-07T19:46:31.4615353Z #define __FLT32_HAS_QUIET_NAN__ 1 2025-05-07T19:46:31.4616081Z #define __FLT_DECIMAL_DIG__ 9 2025-05-07T19:46:31.4616221Z #define __UINT_FAST16_MAX__ 0xffffffffffffffffUL 2025-05-07T19:46:31.4616328Z #define __SURFACE_INDIRECT_FUNCTIONS_H__ 2025-05-07T19:46:31.4616450Z #define __SM_61_INTRINSICS_H__ 2025-05-07T19:46:31.4616649Z #define _GLIBCXX_PACKAGE_STRING "package-unused version-unused" 2025-05-07T19:46:31.4616810Z #define __isxdigit_l(c,l) __isctype_l((c), _ISxdigit, (l)) 2025-05-07T19:46:31.4616920Z #define __glibcxx_requires_nonempty() 2025-05-07T19:46:31.4617057Z #define w_stopsig __wait_stopped.__w_stopsig 2025-05-07T19:46:31.4617155Z #define __ldiv_t_defined 1 2025-05-07T19:46:31.4617341Z #define __glibcxx_requires_irreflexive_pred(_First,_Last,_Pred) 2025-05-07T19:46:31.4617448Z #define ___int_ptrdiff_t_h 2025-05-07T19:46:31.4617621Z #define __LDBL_NORM_MAX__ 1.18973149535723176502126385303097021e+4932L 2025-05-07T19:46:31.4617731Z #define __cudaCDP2EventDestroy 2025-05-07T19:46:31.4617823Z #define __HOST_DEFINES_H__ 2025-05-07T19:46:31.4617948Z #define __GCC_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:46:31.4618051Z #define __SM_20_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:31.4618155Z #define _GLIBCXX_USE_NANOSLEEP 1 2025-05-07T19:46:31.4618258Z #define CUDART_CB 2025-05-07T19:46:31.4618363Z #define BC_BASE_MAX _POSIX2_BC_BASE_MAX 2025-05-07T19:46:31.4618547Z #define _GLIBCXX_USE_C99_INTTYPES_WCHAR_T_TR1 1 2025-05-07T19:46:31.4618628Z #define MB_LEN_MAX 16 2025-05-07T19:46:31.4618869Z #define __glibcxx_requires_partitioned_lower_pred(_First,_Last,_Value,_Pred) 2025-05-07T19:46:31.4618976Z #define _GLIBCXX11_USE_C99_WCHAR 1 2025-05-07T19:46:31.4619100Z #define _IO_peekc(_fp) _IO_peekc_unlocked (_fp) 2025-05-07T19:46:31.4619234Z #define _GLIBCXX_HAVE_AS_SYMVER_DIRECTIVE 1 2025-05-07T19:46:31.4619338Z #define _GLIBCXX_HAVE_UNISTD_H 1 2025-05-07T19:46:31.4619483Z #define __glibc_likely(cond) __builtin_expect((cond), 1) 2025-05-07T19:46:31.4619613Z #define __UINT_FAST8_TYPE__ unsigned char 2025-05-07T19:46:31.4619709Z #define _GNU_SOURCE 1 2025-05-07T19:46:31.4619803Z #define __stub_putmsg 2025-05-07T19:46:31.4619889Z #define __CUDACC__ 1 2025-05-07T19:46:31.4619998Z #define __N(msgid) (msgid) 2025-05-07T19:46:31.4620090Z #define __P(args) args 2025-05-07T19:46:31.4620342Z #define cudaKernelNodeAttributeCooperative cudaLaunchAttributeCooperative 2025-05-07T19:46:31.4620463Z #define __cpp_init_captures 201304L 2025-05-07T19:46:31.4620569Z #define _GLIBCXX17_CONSTEXPR constexpr 2025-05-07T19:46:31.4620662Z #define __ATOMIC_ACQ_REL 4 2025-05-07T19:46:31.4620769Z #define __cpp_lib_as_const 201510 2025-05-07T19:46:31.4620866Z #define __WCHAR_T 2025-05-07T19:46:31.4620955Z #define __ATOMIC_RELEASE 3 2025-05-07T19:46:31.4621049Z #define __fsblkcnt_t_defined 2025-05-07T19:46:31.4621187Z #define __cudaCDP2EventCreateWithFlags 2025-05-07T19:46:31.4621295Z #define __DEVICE_DOUBLE_FUNCTIONS_H__ 2025-05-07T19:46:31.4621304Z 2025-05-07T19:46:31.4877003Z 2025-05-07T19:46:31.4877569Z + conda run -n build_binary nvcc --version 2025-05-07T19:46:31.4877581Z 2025-05-07T19:46:33.3250352Z nvcc: NVIDIA (R) Cuda compiler driver 2025-05-07T19:46:33.3251431Z Copyright (c) 2005-2024 NVIDIA Corporation 2025-05-07T19:46:33.3251947Z Built on Tue_Oct_29_23:50:19_PDT_2024 2025-05-07T19:46:33.3252287Z Cuda compilation tools, release 12.6, V12.6.85 2025-05-07T19:46:33.3252700Z Build cuda_12.6.r12.6/compiler.35059454_0 2025-05-07T19:46:33.3252923Z 2025-05-07T19:46:33.3842126Z 2025-05-07T19:46:33.3856906Z which: no nvidia-smi in (CONDA=/github/home/miniconda:/github/home/miniconda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:46:33.3857768Z [CHECK] nvidia-smi not found 2025-05-07T19:46:33.3858101Z [INSTALL] Successfully installed CUDA 12.6.3 2025-05-07T19:46:33.3953387Z ##[group]Run . $PRELUDE; install_pytorch_pip $BUILD_ENV nightly cuda/12.6.3 2025-05-07T19:46:33.3954061Z . $PRELUDE; install_pytorch_pip $BUILD_ENV nightly cuda/12.6.3 2025-05-07T19:46:33.3954783Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:46:33.3955140Z env: 2025-05-07T19:46:33.3955420Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:46:33.3955761Z BUILD_ENV: build_binary 2025-05-07T19:46:33.3956066Z BUILD_TARGET: default 2025-05-07T19:46:33.3956335Z BUILD_VARIANT: cuda 2025-05-07T19:46:33.3956634Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:46:33.3956936Z ##[endgroup] 2025-05-07T19:46:33.8283896Z ################################################################################ 2025-05-07T19:46:33.8284320Z # Install PyTorch (PIP) 2025-05-07T19:46:33.8284756Z # 2025-05-07T19:46:33.8296181Z # [2025-05-07T19:46:33.829Z] + install_pytorch_pip build_binary nightly cuda/12.6.3 2025-05-07T19:46:33.8296800Z ################################################################################ 2025-05-07T19:46:33.8297043Z 2025-05-07T19:46:33.8326769Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y numpy 2025-05-07T19:46:34.7527752Z Channels: 2025-05-07T19:46:34.7528449Z - conda-forge 2025-05-07T19:46:34.7529134Z Platform: linux-64 2025-05-07T19:46:37.8005164Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:46:39.4320930Z Solving environment: \ | / - done 2025-05-07T19:46:39.7305904Z 2025-05-07T19:46:39.7306577Z ## Package Plan ## 2025-05-07T19:46:39.7307538Z 2025-05-07T19:46:39.7308135Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:46:39.7309075Z 2025-05-07T19:46:39.7309640Z added / updated specs: 2025-05-07T19:46:39.7310378Z - numpy 2025-05-07T19:46:39.7310766Z 2025-05-07T19:46:39.7310778Z 2025-05-07T19:46:39.7311133Z The following packages will be downloaded: 2025-05-07T19:46:39.7311804Z 2025-05-07T19:46:39.7312161Z package | build 2025-05-07T19:46:39.7313036Z ---------------------------|----------------- 2025-05-07T19:46:39.7313484Z libblas-3.9.0 |31_h59b9bed_openblas 16 KB conda-forge 2025-05-07T19:46:39.7313988Z libcblas-3.9.0 |31_he106b2a_openblas 16 KB conda-forge 2025-05-07T19:46:39.7314527Z liblapack-3.9.0 |31_h7ac8fdf_openblas 16 KB conda-forge 2025-05-07T19:46:39.7315044Z numpy-2.2.5 | py311h5d046bc_0 8.6 MB conda-forge 2025-05-07T19:46:39.7315511Z ------------------------------------------------------------ 2025-05-07T19:46:39.7315917Z Total: 8.7 MB 2025-05-07T19:46:39.7316149Z 2025-05-07T19:46:39.7316291Z The following NEW packages will be INSTALLED: 2025-05-07T19:46:39.7316538Z 2025-05-07T19:46:39.7316836Z libblas conda-forge/linux-64::libblas-3.9.0-31_h59b9bed_openblas 2025-05-07T19:46:39.7317400Z libcblas conda-forge/linux-64::libcblas-3.9.0-31_he106b2a_openblas 2025-05-07T19:46:39.7318005Z liblapack conda-forge/linux-64::liblapack-3.9.0-31_h7ac8fdf_openblas 2025-05-07T19:46:39.7318567Z numpy conda-forge/linux-64::numpy-2.2.5-py311h5d046bc_0 2025-05-07T19:46:39.7318868Z 2025-05-07T19:46:39.7318872Z 2025-05-07T19:46:39.7318876Z 2025-05-07T19:46:39.7319038Z Downloading and Extracting Packages: ...working... 2025-05-07T19:46:39.7319469Z numpy-2.2.5 | 8.6 MB | | 0% 2025-05-07T19:46:39.7319722Z 2025-05-07T19:46:39.7320045Z libblas-3.9.0 | 16 KB | | 0%  2025-05-07T19:46:39.7320327Z 2025-05-07T19:46:39.7320331Z 2025-05-07T19:46:39.7320567Z libcblas-3.9.0 | 16 KB | | 0%  2025-05-07T19:46:39.7320834Z 2025-05-07T19:46:39.7320862Z 2025-05-07T19:46:39.7320865Z 2025-05-07T19:46:39.8845685Z liblapack-3.9.0 | 16 KB | | 0%  2025-05-07T19:46:39.8846003Z 2025-05-07T19:46:39.8846482Z 2025-05-07T19:46:39.8846489Z 2025-05-07T19:46:39.8846808Z liblapack-3.9.0 | 16 KB | #########7 | 98%  2025-05-07T19:46:39.8847104Z 2025-05-07T19:46:39.8851564Z libblas-3.9.0 | 16 KB | #########7 | 97%  2025-05-07T19:46:39.8851854Z 2025-05-07T19:46:39.8851857Z 2025-05-07T19:46:39.8852214Z 2025-05-07T19:46:39.8872333Z liblapack-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:39.8872655Z 2025-05-07T19:46:39.9224639Z libblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:39.9224968Z 2025-05-07T19:46:39.9253036Z libblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:39.9253866Z 2025-05-07T19:46:39.9253880Z 2025-05-07T19:46:39.9253891Z 2025-05-07T19:46:39.9353099Z liblapack-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:39.9474934Z numpy-2.2.5 | 8.6 MB | | 0% 2025-05-07T19:46:39.9475759Z 2025-05-07T19:46:39.9475772Z 2025-05-07T19:46:39.9479156Z libcblas-3.9.0 | 16 KB | #########7 | 98%  2025-05-07T19:46:39.9479962Z 2025-05-07T19:46:39.9479975Z 2025-05-07T19:46:39.9722105Z libcblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:39.9722952Z 2025-05-07T19:46:39.9722966Z 2025-05-07T19:46:39.9919514Z libcblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:40.3856766Z numpy-2.2.5 | 8.6 MB | ########## | 100% 2025-05-07T19:46:40.3857922Z numpy-2.2.5 | 8.6 MB | ########## | 100% 2025-05-07T19:46:40.3866248Z numpy-2.2.5 | 8.6 MB | ########## | 100% 2025-05-07T19:46:40.3867852Z 2025-05-07T19:46:40.3868487Z 2025-05-07T19:46:40.3869153Z  2025-05-07T19:46:40.3870119Z 2025-05-07T19:46:40.3870131Z 2025-05-07T19:46:40.3870646Z  2025-05-07T19:46:40.3871287Z 2025-05-07T19:46:40.3871299Z 2025-05-07T19:46:40.3871327Z 2025-05-07T19:46:40.3871882Z  done 2025-05-07T19:46:40.4877733Z Preparing transaction: | done 2025-05-07T19:46:40.6889253Z Verifying transaction: - \ done 2025-05-07T19:46:40.7899130Z Executing transaction: / done 2025-05-07T19:46:40.8971709Z ################################################################################ 2025-05-07T19:46:40.8972308Z # Install Package From PyTorch PIP: torch 2025-05-07T19:46:40.8972651Z # 2025-05-07T19:46:40.8995131Z # [2025-05-07T19:46:40.898Z] + install_from_pytorch_pip build_binary torch nightly cuda/12.6.3 2025-05-07T19:46:40.8996280Z ################################################################################ 2025-05-07T19:46:40.8996670Z 2025-05-07T19:46:40.9012173Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:46:40.9967168Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:46:40.9968174Z ################################################################################ 2025-05-07T19:46:40.9968663Z # Prepare PIP Arguments (PyTorch PIP) 2025-05-07T19:46:40.9969023Z # 2025-05-07T19:46:40.9983193Z # [2025-05-07T19:46:40.997Z] + __prepare_pip_arguments torch nightly cuda/12.6.3 2025-05-07T19:46:40.9983820Z ################################################################################ 2025-05-07T19:46:40.9984065Z 2025-05-07T19:46:41.0004938Z [INSTALL] Extracted package (channel, version): (nightly, LATEST) 2025-05-07T19:46:41.0033283Z [INSTALL] Extracted package variant: cu126 2025-05-07T19:46:41.0051943Z [INSTALL] Using a non-RELEASE channel: nightly ... 2025-05-07T19:46:41.0052744Z [INSTALL] Extracted the full PIP channel: https://download.pytorch.org/whl/nightly/cu126/ 2025-05-07T19:46:41.0058479Z [INSTALL] Extracted the full PIP package: --pre torch 2025-05-07T19:46:41.0067466Z [INSTALL] Attempting to install [torch, LATEST] from PyTorch PIP using channel https://download.pytorch.org/whl/nightly/cu126/ ... 2025-05-07T19:46:41.0092035Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install --pre torch --index-url https://download.pytorch.org/whl/nightly/cu126/ 2025-05-07T19:48:11.5572977Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:48:11.5574704Z 2025-05-07T19:48:11.5575117Z Looking in indexes: https://download.pytorch.org/whl/nightly/cu126/ 2025-05-07T19:48:11.5575579Z Collecting torch 2025-05-07T19:48:11.5576343Z Downloading https://download.pytorch.org/whl/nightly/cu126/torch-2.8.0.dev20250507%2Bcu126-cp311-cp311-manylinux_2_28_x86_64.whl.metadata (30 kB) 2025-05-07T19:48:11.5577152Z Collecting filelock (from torch) 2025-05-07T19:48:11.5577774Z Downloading https://download.pytorch.org/whl/nightly/filelock-3.16.1-py3-none-any.whl (16 kB) 2025-05-07T19:48:11.5578874Z Requirement already satisfied: typing-extensions>=4.10.0 in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from torch) (4.13.2) 2025-05-07T19:48:11.5579771Z Collecting sympy>=1.13.3 (from torch) 2025-05-07T19:48:11.5580322Z Downloading https://download.pytorch.org/whl/nightly/sympy-1.13.3-py3-none-any.whl (6.2 MB) 2025-05-07T19:48:11.5581382Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 6.2/6.2 MB 30.4 MB/s eta 0:00:00 2025-05-07T19:48:11.5581777Z Collecting networkx (from torch) 2025-05-07T19:48:11.5582773Z Downloading https://download.pytorch.org/whl/nightly/networkx-3.4.2-py3-none-any.whl (1.7 MB) 2025-05-07T19:48:11.5583517Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.7/1.7 MB 11.3 MB/s eta 0:00:00 2025-05-07T19:48:11.5584322Z Requirement already satisfied: jinja2 in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from torch) (3.1.6) 2025-05-07T19:48:11.5585270Z Collecting fsspec (from torch) 2025-05-07T19:48:11.5585831Z Downloading https://download.pytorch.org/whl/nightly/fsspec-2024.10.0-py3-none-any.whl (179 kB) 2025-05-07T19:48:11.5586512Z Collecting nvidia-cuda-nvrtc-cu12==12.6.77 (from torch) 2025-05-07T19:48:11.5587321Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cuda_nvrtc_cu12-12.6.77-py3-none-manylinux2014_x86_64.whl (23.7 MB) 2025-05-07T19:48:11.5588243Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 23.7/23.7 MB 55.8 MB/s eta 0:00:00 2025-05-07T19:48:11.5588729Z Collecting nvidia-cuda-runtime-cu12==12.6.77 (from torch) 2025-05-07T19:48:11.5589648Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cuda_runtime_cu12-12.6.77-py3-none-manylinux2014_x86_64.whl (897 kB) 2025-05-07T19:48:11.5590587Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 897.7/897.7 kB 4.7 MB/s eta 0:00:00 2025-05-07T19:48:11.5591037Z Collecting nvidia-cuda-cupti-cu12==12.6.80 (from torch) 2025-05-07T19:48:11.5591912Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cuda_cupti_cu12-12.6.80-py3-none-manylinux2014_x86_64.whl (8.9 MB) 2025-05-07T19:48:11.5592799Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 8.9/8.9 MB 44.8 MB/s eta 0:00:00 2025-05-07T19:48:11.5593258Z Collecting nvidia-cudnn-cu12==9.5.1.17 (from torch) 2025-05-07T19:48:11.5594065Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cudnn_cu12-9.5.1.17-py3-none-manylinux_2_28_x86_64.whl (571.0 MB) 2025-05-07T19:48:11.5594953Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 571.0/571.0 MB 48.5 MB/s eta 0:00:00 2025-05-07T19:48:11.5595424Z Collecting nvidia-cublas-cu12==12.6.4.1 (from torch) 2025-05-07T19:48:11.5596326Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cublas_cu12-12.6.4.1-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (393.1 MB) 2025-05-07T19:48:11.5597332Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 393.1/393.1 MB 47.3 MB/s eta 0:00:00 2025-05-07T19:48:11.5597759Z Collecting nvidia-cufft-cu12==11.3.0.4 (from torch) 2025-05-07T19:48:11.5600232Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cufft_cu12-11.3.0.4-py3-none-manylinux2014_x86_64.whl (200.2 MB) 2025-05-07T19:48:11.5601194Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 200.2/200.2 MB 70.4 MB/s eta 0:00:00 2025-05-07T19:48:11.5601744Z Collecting nvidia-curand-cu12==10.3.7.77 (from torch) 2025-05-07T19:48:11.5602510Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_curand_cu12-10.3.7.77-py3-none-manylinux2014_x86_64.whl (56.3 MB) 2025-05-07T19:48:11.5603330Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 56.3/56.3 MB 61.6 MB/s eta 0:00:00 2025-05-07T19:48:11.5603799Z Collecting nvidia-cusolver-cu12==11.7.1.2 (from torch) 2025-05-07T19:48:11.5604583Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cusolver_cu12-11.7.1.2-py3-none-manylinux2014_x86_64.whl (158.2 MB) 2025-05-07T19:48:11.5605408Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 158.2/158.2 MB 80.5 MB/s eta 0:00:00 2025-05-07T19:48:11.5605861Z Collecting nvidia-cusparse-cu12==12.5.4.2 (from torch) 2025-05-07T19:48:11.5606600Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cusparse_cu12-12.5.4.2-py3-none-manylinux2014_x86_64.whl (216.6 MB) 2025-05-07T19:48:11.5607441Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 216.6/216.6 MB 72.4 MB/s eta 0:00:00 2025-05-07T19:48:11.5607877Z Collecting nvidia-cusparselt-cu12==0.6.3 (from torch) 2025-05-07T19:48:11.5608611Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cusparselt_cu12-0.6.3-py3-none-manylinux2014_x86_64.whl (156.8 MB) 2025-05-07T19:48:11.5609577Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 156.8/156.8 MB 68.7 MB/s eta 0:00:00 2025-05-07T19:48:11.5609967Z Collecting nvidia-nccl-cu12==2.26.2 (from torch) 2025-05-07T19:48:11.5610806Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_nccl_cu12-2.26.2-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl.metadata (2.0 kB) 2025-05-07T19:48:11.5611660Z Collecting nvidia-nvtx-cu12==12.6.77 (from torch) 2025-05-07T19:48:11.5612355Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_nvtx_cu12-12.6.77-py3-none-manylinux2014_x86_64.whl (89 kB) 2025-05-07T19:48:11.5613098Z Collecting nvidia-nvjitlink-cu12==12.6.85 (from torch) 2025-05-07T19:48:11.5613917Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_nvjitlink_cu12-12.6.85-py3-none-manylinux2010_x86_64.manylinux_2_12_x86_64.whl (19.7 MB) 2025-05-07T19:48:11.5614843Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 19.7/19.7 MB 50.2 MB/s eta 0:00:00 2025-05-07T19:48:11.5615242Z Collecting nvidia-cufile-cu12==1.11.1.6 (from torch) 2025-05-07T19:48:11.5616115Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cufile_cu12-1.11.1.6-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl.metadata (1.5 kB) 2025-05-07T19:48:11.5616997Z Collecting pytorch-triton==3.3.0+git96316ce5 (from torch) 2025-05-07T19:48:11.5617883Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.3.0%2Bgit96316ce5-cp311-cp311-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.metadata (1.6 kB) 2025-05-07T19:48:11.5619266Z Requirement already satisfied: setuptools>=40.8.0 in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from pytorch-triton==3.3.0+git96316ce5->torch) (78.1.1) 2025-05-07T19:48:11.5620205Z Collecting mpmath<1.4,>=1.1.0 (from sympy>=1.13.3->torch) 2025-05-07T19:48:11.5620782Z Downloading https://download.pytorch.org/whl/nightly/mpmath-1.3.0-py3-none-any.whl (536 kB) 2025-05-07T19:48:11.5621473Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 536.2/536.2 kB 3.8 MB/s eta 0:00:00 2025-05-07T19:48:11.5622269Z Requirement already satisfied: MarkupSafe>=2.0 in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from jinja2->torch) (3.0.2) 2025-05-07T19:48:11.5623431Z Downloading https://download.pytorch.org/whl/nightly/cu126/torch-2.8.0.dev20250507%2Bcu126-cp311-cp311-manylinux_2_28_x86_64.whl (825.6 MB) 2025-05-07T19:48:11.5624386Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 825.6/825.6 MB 30.5 MB/s eta 0:00:00 2025-05-07T19:48:11.5625206Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cufile_cu12-1.11.1.6-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (1.1 MB) 2025-05-07T19:48:11.5626126Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.1/1.1 MB 28.1 MB/s eta 0:00:00 2025-05-07T19:48:11.5626920Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_nccl_cu12-2.26.2-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (201.3 MB) 2025-05-07T19:48:11.5627839Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 201.3/201.3 MB 60.0 MB/s eta 0:00:00 2025-05-07T19:48:11.5628792Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.3.0%2Bgit96316ce5-cp311-cp311-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (153.5 MB) 2025-05-07T19:48:11.5629992Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 153.5/153.5 MB 73.1 MB/s eta 0:00:00 2025-05-07T19:48:11.5631941Z Installing collected packages: nvidia-cusparselt-cu12, mpmath, sympy, pytorch-triton, nvidia-nvtx-cu12, nvidia-nvjitlink-cu12, nvidia-nccl-cu12, nvidia-curand-cu12, nvidia-cufile-cu12, nvidia-cuda-runtime-cu12, nvidia-cuda-nvrtc-cu12, nvidia-cuda-cupti-cu12, nvidia-cublas-cu12, networkx, fsspec, filelock, nvidia-cusparse-cu12, nvidia-cufft-cu12, nvidia-cudnn-cu12, nvidia-cusolver-cu12, torch 2025-05-07T19:48:11.5633684Z 2025-05-07T19:48:11.5635869Z Successfully installed filelock-3.16.1 fsspec-2024.10.0 mpmath-1.3.0 networkx-3.4.2 nvidia-cublas-cu12-12.6.4.1 nvidia-cuda-cupti-cu12-12.6.80 nvidia-cuda-nvrtc-cu12-12.6.77 nvidia-cuda-runtime-cu12-12.6.77 nvidia-cudnn-cu12-9.5.1.17 nvidia-cufft-cu12-11.3.0.4 nvidia-cufile-cu12-1.11.1.6 nvidia-curand-cu12-10.3.7.77 nvidia-cusolver-cu12-11.7.1.2 nvidia-cusparse-cu12-12.5.4.2 nvidia-cusparselt-cu12-0.6.3 nvidia-nccl-cu12-2.26.2 nvidia-nvjitlink-cu12-12.6.85 nvidia-nvtx-cu12-12.6.77 pytorch-triton-3.3.0+git96316ce5 sympy-1.13.3 torch-2.8.0.dev20250507+cu126 2025-05-07T19:48:11.5638155Z 2025-05-07T19:48:13.7129912Z torch 2.8.0.dev20250507+cu126 2025-05-07T19:48:13.7131276Z [CHECK] The installed package [torch, nightly/LATEST] is the correct variant (cu126) 2025-05-07T19:48:16.9112559Z [CHECK] Python (sub-)package 'torch.distributed' found ... 2025-05-07T19:48:20.1075151Z [CHECK] NOTE: The installed version is: 2.8.0.dev20250507+cu126 2025-05-07T19:48:20.1075714Z [CHECK] NOTE: Checking _GLIBCXX_USE_CXX11_ABI ... 2025-05-07T19:48:23.2715799Z True 2025-05-07T19:48:23.3466369Z True 2025-05-07T19:48:23.3466569Z 2025-05-07T19:48:23.3466864Z [INSTALL] Successfully installed PyTorch through PyTorch PIP 2025-05-07T19:48:23.3547151Z ##[group]Run if . $PRELUDE && which conda; then collect_pytorch_env_info $BUILD_ENV; fi 2025-05-07T19:48:23.3547842Z if . $PRELUDE && which conda; then collect_pytorch_env_info $BUILD_ENV; fi 2025-05-07T19:48:23.3548518Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:48:23.3548869Z env: 2025-05-07T19:48:23.3549098Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:48:23.3549580Z BUILD_ENV: build_binary 2025-05-07T19:48:23.3550040Z BUILD_TARGET: default 2025-05-07T19:48:23.3550339Z BUILD_VARIANT: cuda 2025-05-07T19:48:23.3550608Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:48:23.3550876Z ##[endgroup] 2025-05-07T19:48:23.8046266Z /github/home/miniconda/bin/conda 2025-05-07T19:48:23.8047252Z ################################################################################ 2025-05-07T19:48:23.8048396Z # Collect PyTorch Environment Information (for Reporting Issues) 2025-05-07T19:48:23.8048783Z # 2025-05-07T19:48:23.8068038Z # [2025-05-07T19:48:23.806Z] + collect_pytorch_env_info build_binary 2025-05-07T19:48:23.8068495Z ################################################################################ 2025-05-07T19:48:23.8068731Z 2025-05-07T19:48:23.8089423Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:48:23.8994573Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:48:23.8999611Z [INFO] Downloading the PyTorch environment info collection script ... 2025-05-07T19:48:23.9000561Z + wget -q https://raw.githubusercontent.com/pytorch/pytorch/main/torch/utils/collect_env.py 2025-05-07T19:48:23.9001028Z 2025-05-07T19:48:23.9838609Z 2025-05-07T19:48:23.9839344Z [INFO] Collecting PyTorch environment info (will be needed for reporting issues to PyTorch) ... 2025-05-07T19:48:23.9862081Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary python collect_env.py 2025-05-07T19:48:29.3007685Z Collecting environment information... 2025-05-07T19:48:29.3008761Z PyTorch version: 2.8.0.dev20250507+cu126 2025-05-07T19:48:29.3009687Z Is debug build: False 2025-05-07T19:48:29.3011087Z CUDA used to build PyTorch: 12.6 2025-05-07T19:48:29.3011942Z ROCM used to build PyTorch: N/A 2025-05-07T19:48:29.3012471Z 2025-05-07T19:48:29.3012776Z OS: Amazon Linux 2023.7.20250428 (x86_64) 2025-05-07T19:48:29.3013753Z GCC version: (conda-forge gcc 11.4.0-13) 11.4.0 2025-05-07T19:48:29.3014742Z Clang version: Could not collect 2025-05-07T19:48:29.3015550Z CMake version: version 4.0.2 2025-05-07T19:48:29.3016336Z Libc version: glibc-2.34 2025-05-07T19:48:29.3016793Z 2025-05-07T19:48:29.3017765Z Python version: 3.11.11 | packaged by conda-forge | (main, Mar 3 2025, 20:43:55) [GCC 13.3.0] (64-bit runtime) 2025-05-07T19:48:29.3018683Z Python platform: Linux-6.1.130-139.222.amzn2023.x86_64-x86_64-with-glibc2.34 2025-05-07T19:48:29.3019121Z Is CUDA available: False 2025-05-07T19:48:29.3019400Z CUDA runtime version: 12.6.85 2025-05-07T19:48:29.3019678Z CUDA_MODULE_LOADING set to: N/A 2025-05-07T19:48:29.3020127Z GPU models and configuration: Could not collect 2025-05-07T19:48:29.3020491Z Nvidia driver version: Could not collect 2025-05-07T19:48:29.3020791Z cuDNN version: Could not collect 2025-05-07T19:48:29.3021076Z HIP runtime version: N/A 2025-05-07T19:48:29.3021326Z MIOpen runtime version: N/A 2025-05-07T19:48:29.3021608Z Is XNNPACK available: True 2025-05-07T19:48:29.3021767Z 2025-05-07T19:48:29.3021846Z CPU: 2025-05-07T19:48:29.3022081Z Architecture: x86_64 2025-05-07T19:48:29.3022417Z CPU op-mode(s): 32-bit, 64-bit 2025-05-07T19:48:29.3022822Z Address sizes: 46 bits physical, 48 bits virtual 2025-05-07T19:48:29.3023235Z Byte Order: Little Endian 2025-05-07T19:48:29.3023553Z CPU(s): 96 2025-05-07T19:48:29.3023868Z On-line CPU(s) list: 0-95 2025-05-07T19:48:29.3024186Z Vendor ID: GenuineIntel 2025-05-07T19:48:29.3024807Z Model name: Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:48:29.3025207Z CPU family: 6 2025-05-07T19:48:29.3025508Z Model: 85 2025-05-07T19:48:29.3025796Z Thread(s) per core: 2 2025-05-07T19:48:29.3026106Z Core(s) per socket: 24 2025-05-07T19:48:29.3026410Z Socket(s): 2 2025-05-07T19:48:29.3026684Z Stepping: 7 2025-05-07T19:48:29.3026998Z BogoMIPS: 6000.01 2025-05-07T19:48:29.3029453Z Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:48:29.3032164Z Hypervisor vendor: KVM 2025-05-07T19:48:29.3032529Z Virtualization type: full 2025-05-07T19:48:29.3032910Z L1d cache: 1.5 MiB (48 instances) 2025-05-07T19:48:29.3033306Z L1i cache: 1.5 MiB (48 instances) 2025-05-07T19:48:29.3033718Z L2 cache: 48 MiB (48 instances) 2025-05-07T19:48:29.3034102Z L3 cache: 71.5 MiB (2 instances) 2025-05-07T19:48:29.3034466Z NUMA node(s): 2 2025-05-07T19:48:29.3034789Z NUMA node0 CPU(s): 0-23,48-71 2025-05-07T19:48:29.3035164Z NUMA node1 CPU(s): 24-47,72-95 2025-05-07T19:48:29.3035666Z Vulnerability Gather data sampling: Unknown: Dependent on hypervisor status 2025-05-07T19:48:29.3036370Z Vulnerability Itlb multihit: KVM: Mitigation: VMX unsupported 2025-05-07T19:48:29.3037002Z Vulnerability L1tf: Mitigation; PTE Inversion 2025-05-07T19:48:29.3037623Z Vulnerability Mds: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:48:29.3038246Z Vulnerability Meltdown: Mitigation; PTI 2025-05-07T19:48:29.3038890Z Vulnerability Mmio stale data: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:48:29.3039511Z Vulnerability Reg file data sampling: Not affected 2025-05-07T19:48:29.3039913Z Vulnerability Retbleed: Vulnerable 2025-05-07T19:48:29.3040296Z Vulnerability Spec rstack overflow: Not affected 2025-05-07T19:48:29.3040701Z Vulnerability Spec store bypass: Vulnerable 2025-05-07T19:48:29.3041302Z Vulnerability Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization 2025-05-07T19:48:29.3042259Z Vulnerability Spectre v2: Mitigation; Retpolines; STIBP disabled; RSB filling; PBRSB-eIBRS Not affected; BHI Retpoline 2025-05-07T19:48:29.3042912Z Vulnerability Srbds: Not affected 2025-05-07T19:48:29.3043275Z Vulnerability Tsx async abort: Not affected 2025-05-07T19:48:29.3043527Z 2025-05-07T19:48:29.3043631Z Versions of relevant libraries: 2025-05-07T19:48:29.3043895Z [pip3] numpy==2.2.5 2025-05-07T19:48:29.3044154Z [pip3] nvidia-cublas-cu12==12.6.4.1 2025-05-07T19:48:29.3044470Z [pip3] nvidia-cuda-cupti-cu12==12.6.80 2025-05-07T19:48:29.3044772Z [pip3] nvidia-cuda-nvrtc-cu12==12.6.77 2025-05-07T19:48:29.3045098Z [pip3] nvidia-cuda-runtime-cu12==12.6.77 2025-05-07T19:48:29.3045406Z [pip3] nvidia-cudnn-cu12==9.5.1.17 2025-05-07T19:48:29.3045704Z [pip3] nvidia-cufft-cu12==11.3.0.4 2025-05-07T19:48:29.3045986Z [pip3] nvidia-curand-cu12==10.3.7.77 2025-05-07T19:48:29.3046299Z [pip3] nvidia-cusolver-cu12==11.7.1.2 2025-05-07T19:48:29.3046598Z [pip3] nvidia-cusparse-cu12==12.5.4.2 2025-05-07T19:48:29.3047020Z [pip3] nvidia-cusparselt-cu12==0.6.3 2025-05-07T19:48:29.3047313Z [pip3] nvidia-nccl-cu12==2.26.2 2025-05-07T19:48:29.3047613Z [pip3] nvidia-nvjitlink-cu12==12.6.85 2025-05-07T19:48:29.3047929Z [pip3] nvidia-nvtx-cu12==12.6.77 2025-05-07T19:48:29.3048213Z [pip3] pytorch-triton==3.3.0+git96316ce5 2025-05-07T19:48:29.3048537Z [pip3] torch==2.8.0.dev20250507+cu126 2025-05-07T19:48:29.3048906Z [conda] cuda-cudart 12.6.77 h5888daf_0 conda-forge 2025-05-07T19:48:29.3049420Z [conda] cuda-cudart-dev 12.6.77 h5888daf_0 conda-forge 2025-05-07T19:48:29.3049942Z [conda] cuda-cudart-dev_linux-64 12.6.77 h3f2d84a_0 conda-forge 2025-05-07T19:48:29.3050486Z [conda] cuda-cudart-static 12.6.77 h5888daf_0 conda-forge 2025-05-07T19:48:29.3051045Z [conda] cuda-cudart-static_linux-64 12.6.77 h3f2d84a_0 conda-forge 2025-05-07T19:48:29.3051585Z [conda] cuda-cudart_linux-64 12.6.77 h3f2d84a_0 conda-forge 2025-05-07T19:48:29.3052090Z [conda] cuda-cupti 12.6.80 hbd13f7d_0 conda-forge 2025-05-07T19:48:29.3052562Z [conda] cuda-cupti-dev 12.6.80 h5888daf_0 conda-forge 2025-05-07T19:48:29.3053064Z [conda] cuda-libraries 12.6.3 ha770c72_0 conda-forge 2025-05-07T19:48:29.3053565Z [conda] cuda-libraries-dev 12.6.3 ha770c72_0 conda-forge 2025-05-07T19:48:29.3054061Z [conda] cuda-nvrtc 12.6.85 hbd13f7d_0 conda-forge 2025-05-07T19:48:29.3054543Z [conda] cuda-nvrtc-dev 12.6.85 h5888daf_0 conda-forge 2025-05-07T19:48:29.3055007Z [conda] cuda-nvtx 12.6.77 hbd13f7d_0 conda-forge 2025-05-07T19:48:29.3055479Z [conda] cuda-opencl 12.6.77 hbd13f7d_0 conda-forge 2025-05-07T19:48:29.3055956Z [conda] cuda-opencl-dev 12.6.77 h5888daf_0 conda-forge 2025-05-07T19:48:29.3056460Z [conda] cuda-runtime 12.6.3 ha804496_0 conda-forge 2025-05-07T19:48:29.3057019Z [conda] libcublas 12.6.4.1 h5888daf_1 conda-forge 2025-05-07T19:48:29.3057489Z [conda] libcublas-dev 12.6.4.1 h5888daf_1 conda-forge 2025-05-07T19:48:29.3057975Z [conda] libcufft 11.3.0.4 hbd13f7d_0 conda-forge 2025-05-07T19:48:29.3058437Z [conda] libcufft-dev 11.3.0.4 h5888daf_0 conda-forge 2025-05-07T19:48:29.3058919Z [conda] libcurand 10.3.7.77 hbd13f7d_0 conda-forge 2025-05-07T19:48:29.3059390Z [conda] libcurand-dev 10.3.7.77 h5888daf_0 conda-forge 2025-05-07T19:48:29.3059887Z [conda] libcusolver 11.7.1.2 h5888daf_1 conda-forge 2025-05-07T19:48:29.3060387Z [conda] libcusolver-dev 11.7.1.2 h5888daf_1 conda-forge 2025-05-07T19:48:29.3060878Z [conda] libcusparse 12.5.4.2 hbd13f7d_0 conda-forge 2025-05-07T19:48:29.3061385Z [conda] libcusparse-dev 12.5.4.2 h5888daf_0 conda-forge 2025-05-07T19:48:29.3061878Z [conda] libnvjitlink 12.6.85 hbd13f7d_0 conda-forge 2025-05-07T19:48:29.3062391Z [conda] libnvjitlink-dev 12.6.85 h5888daf_0 conda-forge 2025-05-07T19:48:29.3062887Z [conda] numpy 2.2.5 py311h5d046bc_0 conda-forge 2025-05-07T19:48:29.3063347Z [conda] nvidia-cublas-cu12 12.6.4.1 pypi_0 pypi 2025-05-07T19:48:29.3063861Z [conda] nvidia-cuda-cupti-cu12 12.6.80 pypi_0 pypi 2025-05-07T19:48:29.3064359Z [conda] nvidia-cuda-nvrtc-cu12 12.6.77 pypi_0 pypi 2025-05-07T19:48:29.3064879Z [conda] nvidia-cuda-runtime-cu12 12.6.77 pypi_0 pypi 2025-05-07T19:48:29.3065373Z [conda] nvidia-cudnn-cu12 9.5.1.17 pypi_0 pypi 2025-05-07T19:48:29.3065946Z [conda] nvidia-cufft-cu12 11.3.0.4 pypi_0 pypi 2025-05-07T19:48:29.3066444Z [conda] nvidia-curand-cu12 10.3.7.77 pypi_0 pypi 2025-05-07T19:48:29.3066929Z [conda] nvidia-cusolver-cu12 11.7.1.2 pypi_0 pypi 2025-05-07T19:48:29.3067439Z [conda] nvidia-cusparse-cu12 12.5.4.2 pypi_0 pypi 2025-05-07T19:48:29.3067938Z [conda] nvidia-cusparselt-cu12 0.6.3 pypi_0 pypi 2025-05-07T19:48:29.3068440Z [conda] nvidia-nccl-cu12 2.26.2 pypi_0 pypi 2025-05-07T19:48:29.3068929Z [conda] nvidia-nvjitlink-cu12 12.6.85 pypi_0 pypi 2025-05-07T19:48:29.3069920Z [conda] nvidia-nvtx-cu12 12.6.77 pypi_0 pypi 2025-05-07T19:48:29.3070517Z [conda] pytorch-triton 3.3.0+git96316ce5 pypi_0 pypi 2025-05-07T19:48:29.3071024Z [conda] torch 2.8.0.dev20250507+cu126 pypi_0 pypi 2025-05-07T19:48:29.3071330Z 2025-05-07T19:48:29.3814053Z ##[group]Run . $PRELUDE; install_cudnn $BUILD_ENV "$(pwd)/build_only/cudnn" 12.6.3 2025-05-07T19:48:29.3814713Z . $PRELUDE; install_cudnn $BUILD_ENV "$(pwd)/build_only/cudnn" 12.6.3 2025-05-07T19:48:29.3815311Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:48:29.3815670Z env: 2025-05-07T19:48:29.3815944Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:48:29.3816265Z BUILD_ENV: build_binary 2025-05-07T19:48:29.3816550Z BUILD_TARGET: default 2025-05-07T19:48:29.3816799Z BUILD_VARIANT: cuda 2025-05-07T19:48:29.3817068Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:48:29.3817334Z ##[endgroup] 2025-05-07T19:48:29.8332039Z ################################################################################ 2025-05-07T19:48:29.8333095Z # Install cuDNN 2025-05-07T19:48:29.8333737Z # 2025-05-07T19:48:29.8353080Z # [2025-05-07T19:48:29.834Z] + install_cudnn build_binary /__w/FBGEMM/FBGEMM/build_only/cudnn 12.6.3 2025-05-07T19:48:29.8354052Z ################################################################################ 2025-05-07T19:48:29.8354296Z 2025-05-07T19:48:29.8373853Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:48:29.9249956Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:48:29.9250546Z [INSTALL] cuda_concat_version is determined to be: 126 2025-05-07T19:48:29.9251064Z + rm -rf /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:48:29.9251319Z 2025-05-07T19:48:29.9266643Z 2025-05-07T19:48:29.9267367Z + mkdir -p /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:48:29.9267727Z 2025-05-07T19:48:29.9281756Z 2025-05-07T19:48:29.9302244Z [INSTALL] Downloading cuDNN to /tmp/tmp.w6ZUCxSlGw ... 2025-05-07T19:48:29.9323352Z [EXEC] [ATTEMPT 0/3] + wget -q https://developer.download.nvidia.com/compute/cudnn/redist/cudnn/linux-x86_64/cudnn-linux-x86_64-9.5.1.17_cuda12-archive.tar.xz -O cudnn.tar.xz 2025-05-07T19:48:35.3965935Z [INSTALL] Unpacking cuDNN ... 2025-05-07T19:48:35.3966385Z + tar -xvf cudnn.tar.xz 2025-05-07T19:48:35.3966571Z 2025-05-07T19:48:35.3997632Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/ 2025-05-07T19:48:35.3998079Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/ 2025-05-07T19:48:35.3998542Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv_static_v9.a 2025-05-07T19:48:40.0810028Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn_static_v9.a 2025-05-07T19:48:40.1448831Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled_static_v9.a 2025-05-07T19:48:47.7665713Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled_static_v9.a 2025-05-07T19:48:48.0095808Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph_static_v9.a 2025-05-07T19:48:48.0471514Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic_static_v9.a 2025-05-07T19:48:48.5820477Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops_static_v9.a 2025-05-07T19:48:50.6752204Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv_static.a 2025-05-07T19:48:50.6753902Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn_static.a 2025-05-07T19:48:50.6755593Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled_static.a 2025-05-07T19:48:50.6756479Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled_static.a 2025-05-07T19:48:50.6757091Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph_static.a 2025-05-07T19:48:50.6757623Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic_static.a 2025-05-07T19:48:50.6758186Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops_static.a 2025-05-07T19:48:50.6758688Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn.so 2025-05-07T19:48:50.6759136Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn.so.9 2025-05-07T19:48:50.6759638Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn.so.9.5.1 2025-05-07T19:48:50.6760849Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv.so 2025-05-07T19:48:50.6761890Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv.so.9 2025-05-07T19:48:50.6762462Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv.so.9.5.1 2025-05-07T19:48:55.2036229Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn.so 2025-05-07T19:48:55.2037834Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn.so.9.5.1 2025-05-07T19:48:55.2656620Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn.so.9 2025-05-07T19:48:55.2657975Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled.so.9.5.1 2025-05-07T19:49:02.4660920Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled.so.9 2025-05-07T19:49:02.4662760Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled.so 2025-05-07T19:49:02.4664589Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled.so 2025-05-07T19:49:02.4666556Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled.so.9.5.1 2025-05-07T19:49:02.6621288Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled.so.9 2025-05-07T19:49:02.6623125Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph.so.9 2025-05-07T19:49:02.6624606Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph.so 2025-05-07T19:49:02.6626143Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph.so.9.5.1 2025-05-07T19:49:02.6985572Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic.so.9.5.1 2025-05-07T19:49:03.2456117Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic.so.9 2025-05-07T19:49:03.2457780Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic.so 2025-05-07T19:49:03.2459086Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops.so.9 2025-05-07T19:49:03.2459587Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops.so 2025-05-07T19:49:03.2460119Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops.so.9.5.1 2025-05-07T19:49:05.3861496Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/ 2025-05-07T19:49:05.3862869Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_v9.h 2025-05-07T19:49:05.3864363Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_adv_v9.h 2025-05-07T19:49:05.3865863Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_backend_v9.h 2025-05-07T19:49:05.3867071Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_cnn_v9.h 2025-05-07T19:49:05.3867624Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_graph_v9.h 2025-05-07T19:49:05.3868149Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_ops_v9.h 2025-05-07T19:49:05.3868708Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_version_v9.h 2025-05-07T19:49:05.3869321Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn.h 2025-05-07T19:49:05.3869838Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_adv.h 2025-05-07T19:49:05.3870462Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_backend.h 2025-05-07T19:49:05.3870985Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_cnn.h 2025-05-07T19:49:05.3871514Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_graph.h 2025-05-07T19:49:05.3872019Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_ops.h 2025-05-07T19:49:05.3872551Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_version.h 2025-05-07T19:49:05.3873012Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/LICENSE 2025-05-07T19:49:05.3880984Z 2025-05-07T19:49:05.3882049Z [INSTALL] Moving cuDNN files to /__w/FBGEMM/FBGEMM/build_only/cudnn ... 2025-05-07T19:49:05.3883525Z + rm -rf /__w/FBGEMM/FBGEMM/build_only/cudnn/include 2025-05-07T19:49:05.3884268Z 2025-05-07T19:49:05.3901818Z 2025-05-07T19:49:05.3903257Z + rm -rf /__w/FBGEMM/FBGEMM/build_only/cudnn/lib 2025-05-07T19:49:05.3904064Z 2025-05-07T19:49:05.3912549Z 2025-05-07T19:49:05.3913643Z + mv cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:49:05.3914819Z 2025-05-07T19:49:05.3942271Z 2025-05-07T19:49:05.3943823Z + mv cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:49:05.3945017Z 2025-05-07T19:49:06.7187580Z 2025-05-07T19:49:06.7187966Z /__w/FBGEMM/FBGEMM 2025-05-07T19:49:06.7189010Z + rm -rf /tmp/tmp.w6ZUCxSlGw 2025-05-07T19:49:06.7189393Z 2025-05-07T19:49:07.1509480Z 2025-05-07T19:49:07.1520053Z [INSTALL] Set environment variables CUDNN_INCLUDE_DIR and CUDNN_LIBRARY ... 2025-05-07T19:49:07.1521066Z + conda env config vars set -n build_binary CUDNN_INCLUDE_DIR=/__w/FBGEMM/FBGEMM/build_only/cudnn/include CUDNN_LIBRARY=/__w/FBGEMM/FBGEMM/build_only/cudnn/lib 2025-05-07T19:49:07.1521765Z 2025-05-07T19:49:07.5682526Z 2025-05-07T19:49:07.5683473Z [INSTALL] Successfully installed cuDNN (for CUDA 12.6.3) 2025-05-07T19:49:07.5754004Z ##[group]Run . $PRELUDE; cd fbgemm_gpu; prepare_fbgemm_gpu_build $BUILD_ENV 2025-05-07T19:49:07.5754644Z . $PRELUDE; cd fbgemm_gpu; prepare_fbgemm_gpu_build $BUILD_ENV 2025-05-07T19:49:07.5755446Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:49:07.5755793Z env: 2025-05-07T19:49:07.5756022Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:49:07.5756351Z BUILD_ENV: build_binary 2025-05-07T19:49:07.5756603Z BUILD_TARGET: default 2025-05-07T19:49:07.5756859Z BUILD_VARIANT: cuda 2025-05-07T19:49:07.5757111Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:49:07.5757361Z ##[endgroup] 2025-05-07T19:49:07.9621150Z ################################################################################ 2025-05-07T19:49:07.9621604Z # Prepare FBGEMM-GPU Build 2025-05-07T19:49:07.9621894Z # 2025-05-07T19:49:07.9634106Z # [2025-05-07T19:49:07.963Z] + prepare_fbgemm_gpu_build build_binary 2025-05-07T19:49:07.9634687Z ################################################################################ 2025-05-07T19:49:07.9634938Z 2025-05-07T19:49:07.9653177Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:49:08.0577417Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:49:08.0592042Z [BUILD] Running git submodules update ... 2025-05-07T19:49:08.0623520Z [EXEC] [ATTEMPT 0/3] + git submodule sync 2025-05-07T19:49:08.0952277Z Synchronizing submodule url for '../external/asmjit' 2025-05-07T19:49:08.0952847Z Synchronizing submodule url for '../external/composable_kernel' 2025-05-07T19:49:08.0953371Z Synchronizing submodule url for '../external/cpuinfo' 2025-05-07T19:49:08.0953842Z Synchronizing submodule url for '../external/cutlass' 2025-05-07T19:49:08.0954298Z Synchronizing submodule url for '../external/googletest' 2025-05-07T19:49:08.0954795Z Synchronizing submodule url for '../external/hipify_torch' 2025-05-07T19:49:08.0955245Z Synchronizing submodule url for '../external/json' 2025-05-07T19:49:08.0989665Z [EXEC] [ATTEMPT 0/3] + git submodule update --init --recursive 2025-05-07T19:49:08.1431932Z [BUILD] Installing other build dependencies ... 2025-05-07T19:49:08.1464042Z [EXEC] [ATTEMPT 0/3] + conda run --no-capture-output -n build_binary python -m pip install -r requirements.txt 2025-05-07T19:49:10.2152002Z Collecting backports.tarfile (from -r requirements.txt (line 13)) 2025-05-07T19:49:10.2366560Z Downloading backports.tarfile-1.2.0-py3-none-any.whl.metadata (2.0 kB) 2025-05-07T19:49:10.2468892Z Requirement already satisfied: build in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from -r requirements.txt (line 14)) (1.2.2.post1) 2025-05-07T19:49:10.3696576Z Collecting cmake (from -r requirements.txt (line 15)) 2025-05-07T19:49:10.3741247Z Downloading cmake-4.0.0-py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (6.3 kB) 2025-05-07T19:49:10.3818451Z Requirement already satisfied: click in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from -r requirements.txt (line 16)) (8.1.8) 2025-05-07T19:49:10.3819899Z Requirement already satisfied: hypothesis in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from -r requirements.txt (line 17)) (6.131.14) 2025-05-07T19:49:10.3822394Z Requirement already satisfied: jinja2 in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from -r requirements.txt (line 18)) (3.1.6) 2025-05-07T19:49:10.3827587Z Requirement already satisfied: mpmath==1.3.0 in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from -r requirements.txt (line 19)) (1.3.0) 2025-05-07T19:49:10.4140719Z Collecting ninja (from -r requirements.txt (line 20)) 2025-05-07T19:49:10.4180484Z Downloading ninja-1.11.1.4-py3-none-manylinux_2_12_x86_64.manylinux2010_x86_64.whl.metadata (5.0 kB) 2025-05-07T19:49:10.4250193Z Requirement already satisfied: numpy>=2.0.2 in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from -r requirements.txt (line 21)) (2.2.5) 2025-05-07T19:49:10.4406968Z Collecting pyre-extensions (from -r requirements.txt (line 22)) 2025-05-07T19:49:10.4451179Z Downloading pyre_extensions-0.0.32-py3-none-any.whl.metadata (4.0 kB) 2025-05-07T19:49:10.4517151Z Requirement already satisfied: pyyaml in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from -r requirements.txt (line 23)) (6.0.2) 2025-05-07T19:49:10.4518909Z Requirement already satisfied: scikit-build in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from -r requirements.txt (line 24)) (0.18.1) 2025-05-07T19:49:10.4528979Z Requirement already satisfied: setuptools in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from -r requirements.txt (line 25)) (78.1.1) 2025-05-07T19:49:10.4745184Z Collecting setuptools_git_versioning (from -r requirements.txt (line 26)) 2025-05-07T19:49:10.4786638Z Downloading setuptools_git_versioning-2.1.0-py3-none-any.whl.metadata (6.1 kB) 2025-05-07T19:49:10.4980276Z Collecting tabulate (from -r requirements.txt (line 27)) 2025-05-07T19:49:10.5018713Z Downloading tabulate-0.9.0-py3-none-any.whl.metadata (34 kB) 2025-05-07T19:49:10.5281313Z Collecting patchelf (from -r requirements.txt (line 28)) 2025-05-07T19:49:10.5321331Z Downloading patchelf-0.17.2.2-py3-none-manylinux1_x86_64.manylinux_2_5_x86_64.musllinux_1_1_x86_64.whl.metadata (3.5 kB) 2025-05-07T19:49:10.5411842Z Requirement already satisfied: packaging>=19.1 in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from build->-r requirements.txt (line 14)) (25.0) 2025-05-07T19:49:10.5416236Z Requirement already satisfied: pyproject_hooks in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from build->-r requirements.txt (line 14)) (1.2.0) 2025-05-07T19:49:10.5458213Z Requirement already satisfied: attrs>=22.2.0 in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from hypothesis->-r requirements.txt (line 17)) (25.3.0) 2025-05-07T19:49:10.5462831Z Requirement already satisfied: sortedcontainers<3.0.0,>=2.1.0 in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from hypothesis->-r requirements.txt (line 17)) (2.4.0) 2025-05-07T19:49:10.5513880Z Requirement already satisfied: MarkupSafe>=2.0 in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from jinja2->-r requirements.txt (line 18)) (3.0.2) 2025-05-07T19:49:10.5646885Z Collecting typing-inspect (from pyre-extensions->-r requirements.txt (line 22)) 2025-05-07T19:49:10.5685055Z Downloading typing_inspect-0.9.0-py3-none-any.whl.metadata (1.5 kB) 2025-05-07T19:49:10.5748998Z Requirement already satisfied: typing-extensions in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from pyre-extensions->-r requirements.txt (line 22)) (4.13.2) 2025-05-07T19:49:10.5762006Z Requirement already satisfied: distro in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from scikit-build->-r requirements.txt (line 24)) (1.9.0) 2025-05-07T19:49:10.5777427Z Requirement already satisfied: wheel>=0.32.0 in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from scikit-build->-r requirements.txt (line 24)) (0.45.1) 2025-05-07T19:49:10.6052690Z Collecting mypy-extensions>=0.3.0 (from typing-inspect->pyre-extensions->-r requirements.txt (line 22)) 2025-05-07T19:49:10.6089014Z Downloading mypy_extensions-1.1.0-py3-none-any.whl.metadata (1.1 kB) 2025-05-07T19:49:10.6207497Z Downloading backports.tarfile-1.2.0-py3-none-any.whl (30 kB) 2025-05-07T19:49:10.6298362Z Downloading cmake-4.0.0-py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (27.9 MB) 2025-05-07T19:49:10.7504172Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 27.9/27.9 MB 240.8 MB/s eta 0:00:00 2025-05-07T19:49:10.7569251Z Downloading ninja-1.11.1.4-py3-none-manylinux_2_12_x86_64.manylinux2010_x86_64.whl (422 kB) 2025-05-07T19:49:10.7659094Z Downloading pyre_extensions-0.0.32-py3-none-any.whl (12 kB) 2025-05-07T19:49:10.7726947Z Downloading setuptools_git_versioning-2.1.0-py3-none-any.whl (10 kB) 2025-05-07T19:49:10.7794866Z Downloading tabulate-0.9.0-py3-none-any.whl (35 kB) 2025-05-07T19:49:10.7892538Z Downloading patchelf-0.17.2.2-py3-none-manylinux1_x86_64.manylinux_2_5_x86_64.musllinux_1_1_x86_64.whl (466 kB) 2025-05-07T19:49:10.7994721Z Downloading typing_inspect-0.9.0-py3-none-any.whl (8.8 kB) 2025-05-07T19:49:10.8071553Z Downloading mypy_extensions-1.1.0-py3-none-any.whl (5.0 kB) 2025-05-07T19:49:10.9513593Z Installing collected packages: tabulate, setuptools_git_versioning, patchelf, ninja, mypy-extensions, cmake, backports.tarfile, typing-inspect, pyre-extensions 2025-05-07T19:49:11.7916184Z 2025-05-07T19:49:11.7945885Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:49:11.7948476Z Successfully installed backports.tarfile-1.2.0 cmake-4.0.0 mypy-extensions-1.1.0 ninja-1.11.1.4 patchelf-0.17.2.2 pyre-extensions-0.0.32 setuptools_git_versioning-2.1.0 tabulate-0.9.0 typing-inspect-0.9.0 2025-05-07T19:49:11.9496959Z ################################################################################ 2025-05-07T19:49:11.9497482Z # Install PyTorch (PyTorch PIP) 2025-05-07T19:49:11.9497811Z # 2025-05-07T19:49:11.9512249Z # [2025-05-07T19:49:11.950Z] + install_triton_pip build_binary 2025-05-07T19:49:11.9512706Z ################################################################################ 2025-05-07T19:49:11.9512953Z 2025-05-07T19:49:11.9513227Z [BUILD] Installing pytorch-triton nightly/3.2.0+git4b3bb1f8 from PIP ... 2025-05-07T19:49:11.9513703Z ################################################################################ 2025-05-07T19:49:11.9514127Z # Install Package From PyTorch PIP: pytorch-triton 2025-05-07T19:49:11.9514485Z # 2025-05-07T19:49:11.9531644Z # [2025-05-07T19:49:11.952Z] + install_from_pytorch_pip build_binary pytorch-triton nightly/3.2.0+git4b3bb1f8 2025-05-07T19:49:11.9532308Z ################################################################################ 2025-05-07T19:49:11.9532561Z 2025-05-07T19:49:11.9550695Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:49:12.0406449Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:49:12.0407627Z ################################################################################ 2025-05-07T19:49:12.0408674Z # Prepare PIP Arguments (PyTorch PIP) 2025-05-07T19:49:12.0409577Z # 2025-05-07T19:49:12.0433317Z # [2025-05-07T19:49:12.042Z] + __prepare_pip_arguments pytorch-triton nightly/3.2.0+git4b3bb1f8 2025-05-07T19:49:12.0434940Z ################################################################################ 2025-05-07T19:49:12.0435652Z 2025-05-07T19:49:12.0487404Z [INSTALL] Extracted package (channel, version): (nightly, 3.2.0+git4b3bb1f8) 2025-05-07T19:49:12.0501503Z [INSTALL] Using a non-RELEASE channel: nightly ... 2025-05-07T19:49:12.0502618Z [INSTALL] Extracted the full PIP channel: https://download.pytorch.org/whl/nightly/ 2025-05-07T19:49:12.0505989Z [INSTALL] Extracted the full PIP package: --pre pytorch-triton==3.2.0+git4b3bb1f8 2025-05-07T19:49:12.0517051Z [INSTALL] Attempting to install [pytorch-triton, 3.2.0+git4b3bb1f8] from PyTorch PIP using channel https://download.pytorch.org/whl/nightly/ ... 2025-05-07T19:49:12.0543609Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install --pre pytorch-triton==3.2.0+git4b3bb1f8 --index-url https://download.pytorch.org/whl/nightly/ 2025-05-07T19:49:17.4729580Z ERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts. 2025-05-07T19:49:17.4730580Z Looking in indexes: https://download.pytorch.org/whl/nightly/ 2025-05-07T19:49:17.4731015Z Collecting pytorch-triton==3.2.0+git4b3bb1f8 2025-05-07T19:49:17.4732286Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.2.0%2Bgit4b3bb1f8-cp311-cp311-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.metadata (1.3 kB) 2025-05-07T19:49:17.4733616Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.2.0%2Bgit4b3bb1f8-cp311-cp311-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (166.5 MB) 2025-05-07T19:49:17.4734982Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 166.5/166.5 MB 186.2 MB/s eta 0:00:00 2025-05-07T19:49:17.4735389Z Installing collected packages: pytorch-triton 2025-05-07T19:49:17.4735734Z Attempting uninstall: pytorch-triton 2025-05-07T19:49:17.4736655Z torch 2.8.0.dev20250507+cu126 requires pytorch-triton==3.3.0+git96316ce5; platform_system == "Linux" and platform_machine == "x86_64", but you have pytorch-triton 3.2.0+git4b3bb1f8 which is incompatible. 2025-05-07T19:49:17.4738709Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:49:17.4740133Z 2025-05-07T19:49:17.4740331Z Found existing installation: pytorch-triton 3.3.0+git96316ce5 2025-05-07T19:49:17.4740769Z Uninstalling pytorch-triton-3.3.0+git96316ce5: 2025-05-07T19:49:17.4741181Z Successfully uninstalled pytorch-triton-3.3.0+git96316ce5 2025-05-07T19:49:17.4741637Z Successfully installed pytorch-triton-3.2.0+git4b3bb1f8 2025-05-07T19:49:17.4741897Z 2025-05-07T19:49:19.5833347Z [CHECK] Python (sub-)package 'triton' found ... 2025-05-07T19:49:19.5834606Z [CHECK] Printing out the pytorch-triton version ... 2025-05-07T19:49:21.5842118Z ################################################################################ 2025-05-07T19:49:21.5843454Z [CHECK] The installed VERSION of pytorch-triton is: 3.2.0 2025-05-07T19:49:21.5844678Z ################################################################################ 2025-05-07T19:49:21.5845410Z 2025-05-07T19:49:23.5559966Z [CHECK] Python (sub-)package 'numpy' found ... 2025-05-07T19:49:25.5747984Z [CHECK] Python (sub-)package 'skbuild' found ... 2025-05-07T19:49:25.5748613Z [BUILD] Successfully ran git submodules update 2025-05-07T19:49:25.5825663Z ##[group]Run . $PRELUDE; cd fbgemm_gpu; build_fbgemm_gpu_package $BUILD_ENV nightly default/cuda 2025-05-07T19:49:25.5826443Z . $PRELUDE; cd fbgemm_gpu; build_fbgemm_gpu_package $BUILD_ENV nightly default/cuda 2025-05-07T19:49:25.5827095Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:49:25.5827472Z env: 2025-05-07T19:49:25.5827707Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:49:25.5828059Z BUILD_ENV: build_binary 2025-05-07T19:49:25.5828327Z BUILD_TARGET: default 2025-05-07T19:49:25.5828622Z BUILD_VARIANT: cuda 2025-05-07T19:49:25.5828902Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:49:25.5829272Z ##[endgroup] 2025-05-07T19:49:26.0374103Z [BUILD] BUILD_TARGET_VARIANT: default/cuda 2025-05-07T19:49:26.0375225Z [BUILD] Extracted build target: default 2025-05-07T19:49:26.0376202Z [BUILD] Extracted build variant: cuda 2025-05-07T19:49:27.8327610Z /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:49:27.8328588Z 2025-05-07T19:49:27.8908672Z [CHECK] Binary cc found in PATH 2025-05-07T19:49:29.6687072Z /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:49:29.6687389Z 2025-05-07T19:49:29.7471714Z [CHECK] Binary gcc found in PATH 2025-05-07T19:49:31.5407193Z /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:49:31.5407631Z 2025-05-07T19:49:31.6154998Z [CHECK] Binary c++ found in PATH 2025-05-07T19:49:33.4137791Z /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:49:33.4138687Z 2025-05-07T19:49:33.4890414Z [CHECK] Binary g++ found in PATH 2025-05-07T19:49:35.3323973Z [BUILD] Extracted and set Python tag: py311 2025-05-07T19:49:35.3324534Z [BUILD] Extracted and set Python platform name: manylinux_2_28_x86_64 2025-05-07T19:49:35.3559213Z core = 24 2025-05-07T19:49:35.3780561Z sockets = 2 2025-05-07T19:49:35.3781014Z [BUILD] Set multicore run option for setup.py: -j 48 2025-05-07T19:49:35.3781439Z [CHECK] LD_LIBRARY_PATH = 2025-05-07T19:49:35.3782194Z [BUILD] Running pre-build cleanups ... 2025-05-07T19:49:35.3782560Z + rm -rf dist 2025-05-07T19:49:35.3782746Z 2025-05-07T19:49:35.3793475Z 2025-05-07T19:49:35.3794262Z + conda run --no-capture-output -n build_binary python setup.py clean 2025-05-07T19:49:35.3795242Z 2025-05-07T19:49:38.4237222Z INFO:root:running clean 2025-05-07T19:49:38.4237733Z [SETUP.PY] ARGV: ['setup.py', 'clean'] 2025-05-07T19:49:38.4238867Z [SETUP.PY] Parsed setup.py arguments: Namespace(verbose=False, debug=False, dryrun=False, build_target='default', build_variant='cuda', package_channel='nightly', nvml_lib_path=None, nccl_lib_path=None, use_fb_only=False, cxxprefix=None) 2025-05-07T19:49:38.4240020Z [SETUP.PY] Other arguments: ['clean'] 2025-05-07T19:49:38.4240576Z [SETUP.PY] CUDA CUB directory environment variable not set. Using default CUB location. 2025-05-07T19:49:38.4241181Z [SETUP.PY] Using CUDA = /github/home/miniconda/envs/build_binary 2025-05-07T19:49:38.4241836Z [SETUP.PY] Generating version file at: /__w/FBGEMM/FBGEMM/fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:49:38.4242512Z [SETUP.PY] Setting the FBGEMM build target: default ... 2025-05-07T19:49:38.4242986Z [SETUP.PY] Setting the FBGEMM build variant: cuda ... 2025-05-07T19:49:38.4244348Z [SETUP.PY] Passing CMake arguments: ['-DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch', '-D_GLIBCXX_USE_CXX11_ABI=1', '-DFBGEMM_BUILD_TARGET=default', '-DFBGEMM_BUILD_VARIANT=cuda', "-DCMAKE_C_FLAGS=''", "-DCMAKE_CXX_FLAGS=''"] 2025-05-07T19:49:38.7851650Z 2025-05-07T19:49:38.7852516Z [BUILD] Printing git status ... 2025-05-07T19:49:38.7853464Z + git status 2025-05-07T19:49:38.7853842Z 2025-05-07T19:49:39.4850164Z HEAD detached at pull/4066/merge 2025-05-07T19:49:39.4851133Z Untracked files: 2025-05-07T19:49:39.4852061Z (use "git add ..." to include in what will be committed) 2025-05-07T19:49:39.4852611Z ../build_only/ 2025-05-07T19:49:39.4852878Z ../collect_env.py 2025-05-07T19:49:39.4853177Z fbgemm_gpu/docs/version.py 2025-05-07T19:49:39.4853394Z 2025-05-07T19:49:39.4854037Z nothing added to commit but untracked files present (use "git add" to track) 2025-05-07T19:49:39.4854417Z 2025-05-07T19:49:39.4854510Z + git diff 2025-05-07T19:49:39.4854781Z 2025-05-07T19:49:39.5139489Z 2025-05-07T19:49:39.5140142Z ################################################################################ 2025-05-07T19:49:39.5140830Z # Configure FBGEMM-GPU Build 2025-05-07T19:49:39.5141135Z # 2025-05-07T19:49:39.5155761Z # [2025-05-07T19:49:39.515Z] + __configure_fbgemm_gpu_build 2025-05-07T19:49:39.5156979Z ################################################################################ 2025-05-07T19:49:39.5157710Z 2025-05-07T19:49:39.5160934Z [BUILD] Setting the build target: default ... 2025-05-07T19:49:39.5162260Z [BUILD] Configuring build as CUDA variant (this is the default behavior) ... 2025-05-07T19:49:41.3549581Z /github/home/miniconda/envs/build_binary/bin/nvcc 2025-05-07T19:49:41.3549985Z 2025-05-07T19:49:41.4308379Z [CHECK] Binary nvcc found in PATH 2025-05-07T19:49:43.2694673Z /__w/FBGEMM/FBGEMM/build_only/cudnn/include 2025-05-07T19:49:43.2695014Z 2025-05-07T19:49:43.3441115Z [CHECK] Environment variable CUDNN_INCLUDE_DIR is defined in the Conda environment 2025-05-07T19:49:45.1862367Z /__w/FBGEMM/FBGEMM/build_only/cudnn/lib 2025-05-07T19:49:45.1862757Z 2025-05-07T19:49:45.2594611Z [CHECK] Environment variable CUDNN_LIBRARY is defined in the Conda environment 2025-05-07T19:49:47.0946799Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:49:47.0947903Z 2025-05-07T19:49:47.1633986Z [CHECK] Environment variable NVML_LIB_PATH is defined in the Conda environment 2025-05-07T19:49:49.0655315Z [BUILD] Using the default architectures for CUDA nvcc: NVIDIA (R) Cuda compiler driver 2025-05-07T19:49:49.0655940Z Copyright (c) 2005-2024 NVIDIA Corporation 2025-05-07T19:49:49.0656332Z Built on Tue_Oct_29_23:50:19_PDT_2024 2025-05-07T19:49:49.0656683Z Cuda compilation tools, release 12.6, V12.6.85 2025-05-07T19:49:49.0657495Z Build cuda_12.6.r12.6/compiler.35059454_0 ... 2025-05-07T19:49:49.0657953Z [BUILD] Setting the following CUDA targets: 7.0;8.0;9.0;9.0a 2025-05-07T19:49:49.0658363Z [BUILD] Looking up NVML filepath ... 2025-05-07T19:49:50.9425483Z [BUILD] Looking up NCCL filepath ... 2025-05-07T19:49:54.7868575Z [BUILD] Setting NVCC verbose mode ... 2025-05-07T19:49:54.7870227Z + conda env config vars set -n build_binary NVCC_VERBOSE=1 2025-05-07T19:49:54.7871074Z 2025-05-07T19:49:55.1945557Z 2025-05-07T19:49:55.1946151Z [BUILD] Setting CUDA build args ... 2025-05-07T19:49:57.0219915Z [BUILD] Looking up CUDA version ... 2025-05-07T19:50:00.6741723Z + conda run -n build_binary c++ --version | grep -i clang 2025-05-07T19:50:00.6742609Z 2025-05-07T19:50:02.5019078Z 2025-05-07T19:50:02.5019820Z [BUILD] Setting NVCC flags ... 2025-05-07T19:50:02.5022460Z + conda env config vars set -n build_binary NVCC_PREPEND_FLAGS="-std=c++20 -Xcompiler -std=c++20 -ccbin /github/home/miniconda/envs/build_binary/bin/c++ -allow-unsupported-compiler" 2025-05-07T19:50:02.5024820Z 2025-05-07T19:50:02.9057081Z 2025-05-07T19:50:02.9057855Z + conda run -n build_binary printenv NVCC_PREPEND_FLAGS 2025-05-07T19:50:02.9058756Z 2025-05-07T19:50:04.6736162Z -std=c++20 -Xcompiler -std=c++20 -ccbin /github/home/miniconda/envs/build_binary/bin/c++ -allow-unsupported-compiler 2025-05-07T19:50:04.6737817Z 2025-05-07T19:50:04.7312578Z 2025-05-07T19:50:04.7313710Z [BUILD] Setting CUDA build args ... 2025-05-07T19:50:04.7314097Z + conda run -n build_binary c++ --version 2025-05-07T19:50:04.7314336Z 2025-05-07T19:50:06.5202616Z c++ (conda-forge gcc 11.4.0-13) 11.4.0 2025-05-07T19:50:06.5203164Z Copyright (C) 2021 Free Software Foundation, Inc. 2025-05-07T19:50:06.5203671Z This is free software; see the source for copying conditions. There is NO 2025-05-07T19:50:06.5204304Z warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. 2025-05-07T19:50:06.5204672Z 2025-05-07T19:50:06.5204676Z 2025-05-07T19:50:06.5969812Z 2025-05-07T19:50:06.5971600Z + conda run -n build_binary c++ --version | grep -i clang 2025-05-07T19:50:06.5972061Z 2025-05-07T19:50:08.4512001Z .github/scripts/fbgemm_gpu_build.bash: line 370: [: : integer expression expected 2025-05-07T19:50:08.4513210Z 2025-05-07T19:50:08.4513681Z [BUILD] Enabling debug features in the build ... 2025-05-07T19:50:08.4516775Z [BUILD] FBGEMM_GPU build arguments have been set: --verbose --build-target=default --build-variant=cuda --nvml_lib_path=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so --nccl_lib_path=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 -DTORCH_CUDA_ARCH_LIST='7.0;8.0;9.0;9.0a' -DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCMAKE_CXX_STANDARD=20 --debug 2025-05-07T19:50:08.4519104Z ################################################################################ 2025-05-07T19:50:08.4519536Z # Build FBGEMM-GPU Package (Wheel) 2025-05-07T19:50:08.4519824Z # 2025-05-07T19:50:08.4531205Z # [2025-05-07T19:50:08.452Z] + build_fbgemm_gpu_package build_binary nightly default/cuda 2025-05-07T19:50:08.4532189Z ################################################################################ 2025-05-07T19:50:08.4532443Z 2025-05-07T19:50:08.4532661Z [BUILD] Building FBGEMM wheel (TARGET=default, VARIANT=cuda) ... 2025-05-07T19:50:08.4537634Z + conda run --no-capture-output -n build_binary python -m build --wheel --no-isolation --config-setting=--build-option=--verbose --config-setting=--build-option=--build-target=default --config-setting=--build-option=--build-variant=cuda --config-setting=--build-option=--nvml_lib_path=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so --config-setting=--build-option=--nccl_lib_path=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 --config-setting=--build-option=-DTORCH_CUDA_ARCH_LIST='7.0;8.0;9.0;9.0a' --config-setting=--build-option=-DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux --config-setting=--build-option=-DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux --config-setting=--build-option=-DCMAKE_CXX_STANDARD=20 --config-setting=--build-option=--debug --config-setting=--build-option=--package_channel=nightly --config-setting=--build-option=--python-tag=py311 --config-setting=--build-option=--plat-name=manylinux_2_28_x86_64 2025-05-07T19:50:08.4542601Z 2025-05-07T19:50:10.2831449Z * Getting build dependencies for wheel... 2025-05-07T19:50:11.5617928Z INFO:root:running egg_info 2025-05-07T19:50:11.5638943Z INFO:root:creating fbgemm_gpu_nightly.egg-info 2025-05-07T19:50:11.5644094Z INFO:root:writing fbgemm_gpu_nightly.egg-info/PKG-INFO 2025-05-07T19:50:11.5645038Z INFO:root:writing dependency_links to fbgemm_gpu_nightly.egg-info/dependency_links.txt 2025-05-07T19:50:11.5645785Z INFO:root:writing requirements to fbgemm_gpu_nightly.egg-info/requires.txt 2025-05-07T19:50:11.5646529Z INFO:root:writing top-level names to fbgemm_gpu_nightly.egg-info/top_level.txt 2025-05-07T19:50:11.5647152Z INFO:root:writing manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T19:50:11.5702306Z INFO:root:reading manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T19:50:11.5711599Z INFO:root:writing manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T19:50:11.5715453Z [SETUP.PY] ARGV: ['setup.py', 'egg_info'] 2025-05-07T19:50:11.5716597Z [SETUP.PY] Parsed setup.py arguments: Namespace(verbose=False, debug=False, dryrun=False, build_target='default', build_variant='cuda', package_channel='nightly', nvml_lib_path=None, nccl_lib_path=None, use_fb_only=False, cxxprefix=None) 2025-05-07T19:50:11.5717719Z [SETUP.PY] Other arguments: ['egg_info'] 2025-05-07T19:50:11.5718286Z [SETUP.PY] CUDA CUB directory environment variable not set. Using default CUB location. 2025-05-07T19:50:11.5718912Z [SETUP.PY] Using CUDA = /github/home/miniconda/envs/build_binary 2025-05-07T19:50:11.5719811Z [SETUP.PY] Generating version file at: /__w/FBGEMM/FBGEMM/fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:50:11.5720447Z [SETUP.PY] Setting the FBGEMM build target: default ... 2025-05-07T19:50:11.5720891Z [SETUP.PY] Setting the FBGEMM build variant: cuda ... 2025-05-07T19:50:11.5722246Z [SETUP.PY] Passing CMake arguments: ['-DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch', '-D_GLIBCXX_USE_CXX11_ABI=1', '-DFBGEMM_BUILD_TARGET=default', '-DFBGEMM_BUILD_VARIANT=cuda', "-DCMAKE_C_FLAGS=''", "-DCMAKE_CXX_FLAGS=''"] 2025-05-07T19:50:11.8988387Z * Building wheel... 2025-05-07T19:50:13.1770967Z [SETUP.PY] ARGV: ['setup.py', 'bdist_wheel', '--dist-dir', '/__w/FBGEMM/FBGEMM/fbgemm_gpu/dist/.tmp-w8ozr4b7', '--verbose', '--build-target=default', '--build-variant=cuda', '--nvml_lib_path=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so', '--nccl_lib_path=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2', '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a', '-DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCMAKE_CXX_STANDARD=20', '--debug', '--package_channel=nightly', '--python-tag=py311', '--plat-name=manylinux_2_28_x86_64'] 2025-05-07T19:50:13.1799451Z [SETUP.PY] Parsed setup.py arguments: Namespace(verbose=True, debug=True, dryrun=False, build_target='default', build_variant='cuda', package_channel='nightly', nvml_lib_path='/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so', nccl_lib_path='/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2', use_fb_only=False, cxxprefix=None) 2025-05-07T19:50:13.1803037Z [SETUP.PY] Other arguments: ['bdist_wheel', '--dist-dir', '/__w/FBGEMM/FBGEMM/fbgemm_gpu/dist/.tmp-w8ozr4b7', '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a', '-DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCMAKE_CXX_STANDARD=20', '--python-tag=py311', '--plat-name=manylinux_2_28_x86_64'] 2025-05-07T19:50:13.1805243Z [SETUP.PY] CUDA CUB directory environment variable not set. Using default CUB location. 2025-05-07T19:50:13.1805896Z [SETUP.PY] Using CUDA = /github/home/miniconda/envs/build_binary 2025-05-07T19:50:13.1806510Z [SETUP.PY] Generating version file at: /__w/FBGEMM/FBGEMM/fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:50:13.1807141Z [SETUP.PY] Setting the FBGEMM build target: default ... 2025-05-07T19:50:13.1807577Z [SETUP.PY] Setting the FBGEMM build variant: cuda ... 2025-05-07T19:50:13.1812584Z [SETUP.PY] Passing CMake arguments: ['-DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch', '-D_GLIBCXX_USE_CXX11_ABI=1', '-DCMAKE_VERBOSE_MAKEFILE=ON', '-DCMAKE_EXPORT_COMPILE_COMMANDS=TRUE', '-DFBGEMM_BUILD_TARGET=default', '-DFBGEMM_BUILD_VARIANT=cuda', '-DNVML_LIB_PATH=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so', '-DNCCL_INCLUDE_DIRS=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include', '-DNCCL_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2', "-DCMAKE_C_FLAGS='-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib'", "-DCMAKE_CXX_FLAGS='-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib'", '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a', '-DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCMAKE_CXX_STANDARD=20'] 2025-05-07T19:50:13.1817665Z 2025-05-07T19:50:13.1817673Z 2025-05-07T19:50:13.1817978Z -------------------------------------------------------------------------------- 2025-05-07T19:50:13.1818370Z -- Trying 'Ninja' generator 2025-05-07T19:50:13.1818686Z -------------------------------- 2025-05-07T19:50:13.1818962Z --------------------------- 2025-05-07T19:50:13.1819251Z ---------------------- 2025-05-07T19:50:13.1819493Z ----------------- 2025-05-07T19:50:13.1819759Z ------------ 2025-05-07T19:50:13.1820010Z ------- 2025-05-07T19:50:13.1820225Z -- 2025-05-07T19:50:13.2239060Z CMake Deprecation Warning at CMakeLists.txt:1 (cmake_minimum_required): 2025-05-07T19:50:13.2240804Z Compatibility with CMake < 3.10 will be removed from a future version of 2025-05-07T19:50:13.2242079Z CMake. 2025-05-07T19:50:13.2242423Z 2025-05-07T19:50:13.2243089Z Update the VERSION argument value. Or, use the ... syntax 2025-05-07T19:50:13.2244776Z to tell CMake that the project requires at least but has been updated 2025-05-07T19:50:13.2246228Z to work with policies introduced by or earlier. 2025-05-07T19:50:13.2247031Z 2025-05-07T19:50:13.2247057Z 2025-05-07T19:50:13.2247605Z Not searching for unused variables given on the command line. 2025-05-07T19:50:13.2697001Z -- The C compiler identification is GNU 11.4.0 2025-05-07T19:50:13.2772792Z -- Detecting C compiler ABI info 2025-05-07T19:50:13.3648300Z -- Detecting C compiler ABI info - done 2025-05-07T19:50:13.3825828Z -- Check for working C compiler: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-cc - skipped 2025-05-07T19:50:13.3826530Z -- Detecting C compile features 2025-05-07T19:50:13.3829066Z -- Detecting C compile features - done 2025-05-07T19:50:13.4620106Z -- The CXX compiler identification is GNU 11.4.0 2025-05-07T19:50:13.4690799Z -- Detecting CXX compiler ABI info 2025-05-07T19:50:13.5643397Z -- Detecting CXX compiler ABI info - done 2025-05-07T19:50:13.5833133Z -- Check for working CXX compiler: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ - skipped 2025-05-07T19:50:13.5835763Z -- Detecting CXX compile features 2025-05-07T19:50:13.5841783Z -- Detecting CXX compile features - done 2025-05-07T19:50:13.5908940Z -- Configuring done (0.4s) 2025-05-07T19:50:13.5952404Z -- Generating done (0.0s) 2025-05-07T19:50:13.5968392Z -- Build files have been written to: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_cmake_test_compile/build 2025-05-07T19:50:13.6009529Z -- 2025-05-07T19:50:13.6009927Z ------- 2025-05-07T19:50:13.6010677Z ------------ 2025-05-07T19:50:13.6010999Z ----------------- 2025-05-07T19:50:13.6011295Z ---------------------- 2025-05-07T19:50:13.6011564Z --------------------------- 2025-05-07T19:50:13.6011877Z -------------------------------- 2025-05-07T19:50:13.6012191Z -- Trying 'Ninja' generator - success 2025-05-07T19:50:13.6012629Z -------------------------------------------------------------------------------- 2025-05-07T19:50:13.6012935Z 2025-05-07T19:50:13.6025056Z Configuring Project 2025-05-07T19:50:13.6025920Z Working directory: 2025-05-07T19:50:13.6026465Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build 2025-05-07T19:50:13.6026918Z Command: 2025-05-07T19:50:13.6046608Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/cmake/data/bin/cmake /__w/FBGEMM/FBGEMM/fbgemm_gpu -G Ninja -DCMAKE_MAKE_PROGRAM:FILEPATH=/github/home/miniconda/envs/build_binary/bin/ninja --no-warn-unused-cli -DCMAKE_INSTALL_PREFIX:PATH=/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install -DPYTHON_VERSION_STRING:STRING=3.11.11 -DSKBUILD:INTERNAL=TRUE -DCMAKE_MODULE_PATH:PATH=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/skbuild/resources/cmake -DPYTHON_EXECUTABLE:PATH=/github/home/miniconda/envs/build_binary/bin/python -DPYTHON_INCLUDE_DIR:PATH=/github/home/miniconda/envs/build_binary/include/python3.11 -DPYTHON_LIBRARY:PATH=/github/home/miniconda/envs/build_binary/lib/libpython3.11.so -DPython_EXECUTABLE:PATH=/github/home/miniconda/envs/build_binary/bin/python -DPython_ROOT_DIR:PATH=/github/home/miniconda/envs/build_binary -DPython_FIND_REGISTRY:STRING=NEVER -DPython_INCLUDE_DIR:PATH=/github/home/miniconda/envs/build_binary/include/python3.11 -DPython_NumPy_INCLUDE_DIRS:PATH=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/numpy/_core/include -DPython3_EXECUTABLE:PATH=/github/home/miniconda/envs/build_binary/bin/python -DPython3_ROOT_DIR:PATH=/github/home/miniconda/envs/build_binary -DPython3_FIND_REGISTRY:STRING=NEVER -DPython3_INCLUDE_DIR:PATH=/github/home/miniconda/envs/build_binary/include/python3.11 -DPython3_NumPy_INCLUDE_DIRS:PATH=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/numpy/_core/include -DCMAKE_MAKE_PROGRAM:FILEPATH=/github/home/miniconda/envs/build_binary/bin/ninja -DCMAKE_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ar -DCMAKE_CXX_COMPILER_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ar -DCMAKE_C_COMPILER_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ar -DCMAKE_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ranlib -DCMAKE_CXX_COMPILER_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ranlib -DCMAKE_C_COMPILER_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ranlib -DCMAKE_LINKER=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ld -DCMAKE_STRIP=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-strip -DCMAKE_BUILD_TYPE=Release -DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch -D_GLIBCXX_USE_CXX11_ABI=1 -DCMAKE_VERBOSE_MAKEFILE=ON -DCMAKE_EXPORT_COMPILE_COMMANDS=TRUE -DFBGEMM_BUILD_TARGET=default -DFBGEMM_BUILD_VARIANT=cuda -DNVML_LIB_PATH=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -DNCCL_INCLUDE_DIRS=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -DNCCL_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 '-DCMAKE_C_FLAGS='"'"'-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib'"'"'' '-DCMAKE_CXX_FLAGS='"'"'-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib'"'"'' '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a' -DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCMAKE_CXX_STANDARD=20 '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a' -DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCMAKE_CXX_STANDARD=20 -DCMAKE_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ar -DCMAKE_CXX_COMPILER_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ar -DCMAKE_C_COMPILER_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ar -DCMAKE_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ranlib -DCMAKE_CXX_COMPILER_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ranlib -DCMAKE_C_COMPILER_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ranlib -DCMAKE_LINKER=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ld -DCMAKE_STRIP=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-strip -DCMAKE_BUILD_TYPE=Release 2025-05-07T19:50:13.6065169Z 2025-05-07T19:50:13.6433180Z 2025-05-07T19:50:13.6433200Z 2025-05-07T19:50:13.6433782Z ================================================================================ 2025-05-07T19:50:13.6434907Z Default C compiler flags 2025-05-07T19:50:13.6436009Z (values may be overridden by CMAKE_CXX_STANDARD and CXX_STANDARD): 2025-05-07T19:50:13.6436787Z 2025-05-07T19:50:13.6437528Z -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib 2025-05-07T19:50:13.6438316Z ================================================================================ 2025-05-07T19:50:13.6438568Z 2025-05-07T19:50:13.6438572Z 2025-05-07T19:50:13.6438576Z 2025-05-07T19:50:13.6438701Z ================================================================================ 2025-05-07T19:50:13.6439090Z Default C++ compiler flags 2025-05-07T19:50:13.6439490Z (values may be overridden by CMAKE_CXX_STANDARD and CXX_STANDARD): 2025-05-07T19:50:13.6439846Z 2025-05-07T19:50:13.6440324Z -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib 2025-05-07T19:50:13.6441070Z ================================================================================ 2025-05-07T19:50:13.6441317Z 2025-05-07T19:50:13.6441519Z Not searching for unused variables given on the command line. 2025-05-07T19:50:13.6441909Z 2025-05-07T19:50:13.6441918Z 2025-05-07T19:50:13.6442046Z ================================================================================ 2025-05-07T19:50:13.6442423Z AVX2_FLAGS: 2025-05-07T19:50:13.6442561Z 2025-05-07T19:50:13.6442653Z -mavx2 2025-05-07T19:50:13.6442896Z -mf16c 2025-05-07T19:50:13.6443105Z -mfma 2025-05-07T19:50:13.6443347Z -fopenmp 2025-05-07T19:50:13.6443601Z ================================================================================ 2025-05-07T19:50:13.6443879Z 2025-05-07T19:50:13.6443883Z 2025-05-07T19:50:13.6443887Z 2025-05-07T19:50:13.6444005Z ================================================================================ 2025-05-07T19:50:13.6444373Z AVX512_FLAGS: 2025-05-07T19:50:13.6444510Z 2025-05-07T19:50:13.6444600Z -mavx2 2025-05-07T19:50:13.6444841Z -mf16c 2025-05-07T19:50:13.6445047Z -mfma 2025-05-07T19:50:13.6445290Z -mavx512f 2025-05-07T19:50:13.6445508Z -mavx512bw 2025-05-07T19:50:13.6445759Z -mavx512dq 2025-05-07T19:50:13.6445976Z -mavx512vl 2025-05-07T19:50:13.6446226Z -fopenmp 2025-05-07T19:50:13.6446591Z ================================================================================ 2025-05-07T19:50:13.6446864Z 2025-05-07T19:50:13.6446868Z 2025-05-07T19:50:13.6446871Z 2025-05-07T19:50:13.6446995Z ================================================================================ 2025-05-07T19:50:13.6447391Z The project is built using scikit-build 2025-05-07T19:50:13.6447742Z ================================================================================ 2025-05-07T19:50:13.6448018Z 2025-05-07T19:50:13.6448022Z 2025-05-07T19:50:13.6448025Z 2025-05-07T19:50:13.6448147Z ================================================================================ 2025-05-07T19:50:13.6448515Z Build Settings 2025-05-07T19:50:13.6448656Z 2025-05-07T19:50:13.6448773Z FBGEMM_BUILD_TARGET : default 2025-05-07T19:50:13.6449116Z FBGEMM_BUILD_VARIANT : cuda 2025-05-07T19:50:13.6449308Z 2025-05-07T19:50:13.6449415Z NVCC_VERBOSE : 2025-05-07T19:50:13.6449832Z CUDNN_INCLUDE_DIR : 2025-05-07T19:50:13.6450104Z CUDNN_LIBRARY : 2025-05-07T19:50:13.6450579Z NVML_LIB_PATH : /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:13.6451112Z TORCH_CUDA_ARCH_LIST : 7.0 2025-05-07T19:50:13.6451387Z 8.0 2025-05-07T19:50:13.6451615Z 9.0 2025-05-07T19:50:13.6451819Z 9.0a 2025-05-07T19:50:13.6451942Z 2025-05-07T19:50:13.6452077Z HIP_ROOT_DIR : 2025-05-07T19:50:13.6452347Z HIPCC_VERBOSE : 2025-05-07T19:50:13.6452644Z AMDGPU_TARGETS : 2025-05-07T19:50:13.6452921Z PYTORCH_ROCM_ARCH : 2025-05-07T19:50:13.6453246Z ================================================================================ 2025-05-07T19:50:13.6453485Z 2025-05-07T19:50:13.7232428Z -- The CXX compiler identification is GNU 11.4.0 2025-05-07T19:50:13.7624033Z -- The C compiler identification is GNU 11.4.0 2025-05-07T19:50:14.6945279Z -- The CUDA compiler identification is NVIDIA 12.6.85 with host compiler GNU 11.4.0 2025-05-07T19:50:14.7037862Z -- Detecting CXX compiler ABI info 2025-05-07T19:50:14.8023349Z -- Detecting CXX compiler ABI info - done 2025-05-07T19:50:14.8214763Z -- Check for working CXX compiler: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ - skipped 2025-05-07T19:50:14.8215918Z -- Detecting CXX compile features 2025-05-07T19:50:14.8223623Z -- Detecting CXX compile features - done 2025-05-07T19:50:14.8338926Z -- Detecting C compiler ABI info 2025-05-07T19:50:14.9208877Z -- Detecting C compiler ABI info - done 2025-05-07T19:50:14.9388658Z -- Check for working C compiler: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-cc - skipped 2025-05-07T19:50:14.9390820Z -- Detecting C compile features 2025-05-07T19:50:14.9394017Z -- Detecting C compile features - done 2025-05-07T19:50:14.9494447Z -- Detecting CUDA compiler ABI info 2025-05-07T19:50:15.8708311Z -- Detecting CUDA compiler ABI info - done 2025-05-07T19:50:15.9271076Z -- Check for working CUDA compiler: /github/home/miniconda/envs/build_binary/bin/nvcc - skipped 2025-05-07T19:50:15.9290946Z -- Detecting CUDA compile features 2025-05-07T19:50:15.9293634Z -- Detecting CUDA compile features - done 2025-05-07T19:50:15.9369523Z -- Performing Test C_HAS_AVX_1 2025-05-07T19:50:16.1914489Z -- Performing Test C_HAS_AVX_1 - Failed 2025-05-07T19:50:16.1915314Z -- Performing Test C_HAS_AVX_2 2025-05-07T19:50:16.4647203Z -- Performing Test C_HAS_AVX_2 - Success 2025-05-07T19:50:16.4648239Z -- Performing Test C_HAS_AVX2_1 2025-05-07T19:50:16.7201212Z -- Performing Test C_HAS_AVX2_1 - Failed 2025-05-07T19:50:16.7203636Z -- Performing Test C_HAS_AVX2_2 2025-05-07T19:50:16.9889392Z -- Performing Test C_HAS_AVX2_2 - Success 2025-05-07T19:50:16.9890481Z -- Performing Test C_HAS_AVX512_1 2025-05-07T19:50:17.2450083Z -- Performing Test C_HAS_AVX512_1 - Failed 2025-05-07T19:50:17.2451180Z -- Performing Test C_HAS_AVX512_2 2025-05-07T19:50:17.4619579Z -- Performing Test C_HAS_AVX512_2 - Success 2025-05-07T19:50:17.4620673Z -- Performing Test CXX_HAS_AVX_1 2025-05-07T19:50:17.7169204Z -- Performing Test CXX_HAS_AVX_1 - Failed 2025-05-07T19:50:17.7170288Z -- Performing Test CXX_HAS_AVX_2 2025-05-07T19:50:17.9922294Z -- Performing Test CXX_HAS_AVX_2 - Success 2025-05-07T19:50:17.9923345Z -- Performing Test CXX_HAS_AVX2_1 2025-05-07T19:50:18.2485692Z -- Performing Test CXX_HAS_AVX2_1 - Failed 2025-05-07T19:50:18.2486216Z -- Performing Test CXX_HAS_AVX2_2 2025-05-07T19:50:18.5171589Z -- Performing Test CXX_HAS_AVX2_2 - Success 2025-05-07T19:50:18.5172687Z -- Performing Test CXX_HAS_AVX512_1 2025-05-07T19:50:18.7741686Z -- Performing Test CXX_HAS_AVX512_1 - Failed 2025-05-07T19:50:18.7742760Z -- Performing Test CXX_HAS_AVX512_2 2025-05-07T19:50:18.9915627Z -- Performing Test CXX_HAS_AVX512_2 - Success 2025-05-07T19:50:19.0090299Z -- Found CUDA: /github/home/miniconda/envs/build_binary/targets/x86_64-linux (found version "12.6") 2025-05-07T19:50:19.0128091Z -- Found CUDAToolkit: /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include (found version "12.6.85") 2025-05-07T19:50:19.0206827Z -- Performing Test CMAKE_HAVE_LIBC_PTHREAD 2025-05-07T19:50:19.1089775Z -- Performing Test CMAKE_HAVE_LIBC_PTHREAD - Failed 2025-05-07T19:50:19.1091008Z -- Looking for pthread_create in pthreads 2025-05-07T19:50:19.1866012Z -- Looking for pthread_create in pthreads - not found 2025-05-07T19:50:19.1866624Z -- Looking for pthread_create in pthread 2025-05-07T19:50:19.2752000Z -- Looking for pthread_create in pthread - found 2025-05-07T19:50:19.2762924Z -- Found Threads: TRUE 2025-05-07T19:50:19.4363377Z -- PyTorch: CUDA detected: 12.6 2025-05-07T19:50:19.4363998Z -- PyTorch: CUDA nvcc is: /github/home/miniconda/envs/build_binary/targets/x86_64-linux/bin/nvcc 2025-05-07T19:50:19.4364792Z -- PyTorch: CUDA toolkit directory: /github/home/miniconda/envs/build_binary/targets/x86_64-linux 2025-05-07T19:50:19.5570477Z -- PyTorch: Header version is: 12.6 2025-05-07T19:50:19.6352576Z -- Found Python: /github/home/miniconda/envs/build_binary/bin/python (found version "3.11.11") found components: Interpreter 2025-05-07T19:50:19.6370233Z CMake Warning at /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/share/cmake/Caffe2/public/cuda.cmake:140 (message): 2025-05-07T19:50:19.6372884Z -- USE_CUDNN is set to 0. Compiling without cuDNN support 2025-05-07T19:50:19.6374287Z -- USE_CUSPARSELT is set to 0. Compiling without cuSPARSELt support 2025-05-07T19:50:19.6375702Z -- USE_CUDSS is set to 0. Compiling without cuDSS support 2025-05-07T19:50:19.6376713Z -- USE_CUFILE is set to 0. Compiling without cuFile support 2025-05-07T19:50:19.6377157Z Failed to compute shorthash for libnvrtc.so 2025-05-07T19:50:19.6377523Z Call Stack (most recent call first): 2025-05-07T19:50:19.6378205Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/share/cmake/Caffe2/Caffe2Config.cmake:86 (include) 2025-05-07T19:50:19.6379315Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:68 (find_package) 2025-05-07T19:50:19.6380158Z /__w/FBGEMM/FBGEMM/cmake/modules/PyTorchSetup.cmake:14 (find_package) 2025-05-07T19:50:19.6380622Z CMakeLists.txt:112 (include) 2025-05-07T19:50:19.6380805Z 2025-05-07T19:50:19.6380809Z 2025-05-07T19:50:19.6381391Z -- Added CUDA NVCC flags for: -gencode;arch=compute_70,code=sm_70;-gencode;arch=compute_80,code=sm_80;-gencode;arch=compute_90,code=sm_90;-gencode;arch=compute_90a,code=sm_90a 2025-05-07T19:50:19.6705273Z CMake Warning at /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:22 (message): 2025-05-07T19:50:19.6706238Z static library kineto_LIBRARY-NOTFOUND not found. 2025-05-07T19:50:19.6706738Z Call Stack (most recent call first): 2025-05-07T19:50:19.6707738Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:125 (append_torchlib_if_found) 2025-05-07T19:50:19.6708781Z /__w/FBGEMM/FBGEMM/cmake/modules/PyTorchSetup.cmake:14 (find_package) 2025-05-07T19:50:19.6709733Z CMakeLists.txt:112 (include) 2025-05-07T19:50:19.6709932Z 2025-05-07T19:50:19.6710003Z 2025-05-07T19:50:19.6710466Z -- Found Torch: /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so 2025-05-07T19:50:19.6710998Z 2025-05-07T19:50:19.6711002Z 2025-05-07T19:50:19.6711155Z ================================================================================ 2025-05-07T19:50:19.6711503Z PyTorch Flags: 2025-05-07T19:50:19.6711762Z 2025-05-07T19:50:19.6711979Z TORCH_INCLUDE_DIRS: 2025-05-07T19:50:19.6712460Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:50:19.6713291Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:19.6713941Z 2025-05-07T19:50:19.6714173Z TORCH_LIBRARIES: 2025-05-07T19:50:19.6714439Z torch 2025-05-07T19:50:19.6714682Z torch_library 2025-05-07T19:50:19.6715146Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:50:19.6715895Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:19.6716622Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:19.6717206Z 2025-05-07T19:50:19.6717442Z TORCH_CUDA_OPTIONS: 2025-05-07T19:50:19.6717737Z --expt-relaxed-constexpr 2025-05-07T19:50:19.6718060Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:19.6718368Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:19.6718710Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:19.6719024Z ================================================================================ 2025-05-07T19:50:19.6719297Z 2025-05-07T19:50:19.6719301Z 2025-05-07T19:50:19.6719305Z 2025-05-07T19:50:19.6719430Z ================================================================================ 2025-05-07T19:50:19.6719769Z NCCL Flags 2025-05-07T19:50:19.6719931Z 2025-05-07T19:50:19.6720335Z NCCL_INCLUDE_DIRS=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:50:19.6721591Z NCCL_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:19.6722354Z ================================================================================ 2025-05-07T19:50:19.6722614Z 2025-05-07T19:50:19.6722618Z 2025-05-07T19:50:19.6722621Z 2025-05-07T19:50:19.6722740Z ================================================================================ 2025-05-07T19:50:19.6723096Z CUDA Driver Path 2025-05-07T19:50:19.6723236Z 2025-05-07T19:50:19.6723593Z CUDA_DRIVER_LIBRARIES=/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:19.6724202Z ================================================================================ 2025-05-07T19:50:19.6724427Z 2025-05-07T19:50:19.6724716Z -- Found NVML_LIB_PATH: /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:19.6741093Z 2025-05-07T19:50:19.6741164Z 2025-05-07T19:50:19.6741575Z ================================================================================ 2025-05-07T19:50:19.6742069Z GPU CPP Library Target: asmjit (SHARED) 2025-05-07T19:50:19.6742390Z 2025-05-07T19:50:19.6742720Z CPU_SRCS: 2025-05-07T19:50:19.6742847Z 2025-05-07T19:50:19.6742939Z 2025-05-07T19:50:19.6743177Z GPU_SRCS: 2025-05-07T19:50:19.6743302Z 2025-05-07T19:50:19.6743421Z 2025-05-07T19:50:19.6743631Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:19.6743789Z 2025-05-07T19:50:19.6743903Z 2025-05-07T19:50:19.6744119Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:19.6744294Z 2025-05-07T19:50:19.6744379Z 2025-05-07T19:50:19.6744580Z OTHER_SRCS: 2025-05-07T19:50:19.6745011Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64assembler.cpp 2025-05-07T19:50:19.6745655Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64builder.cpp 2025-05-07T19:50:19.6746317Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64compiler.cpp 2025-05-07T19:50:19.6747203Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64emithelper.cpp 2025-05-07T19:50:19.6747845Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64formatter.cpp 2025-05-07T19:50:19.6748492Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64func.cpp 2025-05-07T19:50:19.6749099Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64instapi.cpp 2025-05-07T19:50:19.6749880Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64instdb.cpp 2025-05-07T19:50:19.6750523Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64operand.cpp 2025-05-07T19:50:19.6751139Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64rapass.cpp 2025-05-07T19:50:19.6751795Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/armformatter.cpp 2025-05-07T19:50:19.6752428Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/archtraits.cpp 2025-05-07T19:50:19.6753091Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/assembler.cpp 2025-05-07T19:50:19.6753714Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/builder.cpp 2025-05-07T19:50:19.6754365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/codeholder.cpp 2025-05-07T19:50:19.6755029Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/codewriter.cpp 2025-05-07T19:50:19.6755655Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/compiler.cpp 2025-05-07T19:50:19.6756294Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/constpool.cpp 2025-05-07T19:50:19.6756908Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/cpuinfo.cpp 2025-05-07T19:50:19.6757559Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/emithelper.cpp 2025-05-07T19:50:19.6758183Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/emitter.cpp 2025-05-07T19:50:19.6758842Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/emitterutils.cpp 2025-05-07T19:50:19.6759639Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/environment.cpp 2025-05-07T19:50:19.6760300Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/errorhandler.cpp 2025-05-07T19:50:19.6760976Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/formatter.cpp 2025-05-07T19:50:19.6761588Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/func.cpp 2025-05-07T19:50:19.6762260Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/funcargscontext.cpp 2025-05-07T19:50:19.6762941Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/globals.cpp 2025-05-07T19:50:19.6763538Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/inst.cpp 2025-05-07T19:50:19.6764157Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/instdb.cpp 2025-05-07T19:50:19.6764779Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/jitallocator.cpp 2025-05-07T19:50:19.6765444Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/jitruntime.cpp 2025-05-07T19:50:19.6766067Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/logger.cpp 2025-05-07T19:50:19.6766806Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/operand.cpp 2025-05-07T19:50:19.6767421Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/osutils.cpp 2025-05-07T19:50:19.6768003Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/ralocal.cpp 2025-05-07T19:50:19.6768609Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/rapass.cpp 2025-05-07T19:50:19.6769193Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/rastack.cpp 2025-05-07T19:50:19.6769799Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/string.cpp 2025-05-07T19:50:19.6770379Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/support.cpp 2025-05-07T19:50:19.6770986Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/target.cpp 2025-05-07T19:50:19.6771651Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/type.cpp 2025-05-07T19:50:19.6772226Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/virtmem.cpp 2025-05-07T19:50:19.6772881Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zone.cpp 2025-05-07T19:50:19.6773485Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonehash.cpp 2025-05-07T19:50:19.6774080Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonelist.cpp 2025-05-07T19:50:19.6774714Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonestack.cpp 2025-05-07T19:50:19.6775343Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonetree.cpp 2025-05-07T19:50:19.6775945Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonevector.cpp 2025-05-07T19:50:19.6776593Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86assembler.cpp 2025-05-07T19:50:19.6777204Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86builder.cpp 2025-05-07T19:50:19.6777843Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86compiler.cpp 2025-05-07T19:50:19.6778490Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86emithelper.cpp 2025-05-07T19:50:19.6779110Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86formatter.cpp 2025-05-07T19:50:19.6779736Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86func.cpp 2025-05-07T19:50:19.6780329Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86instapi.cpp 2025-05-07T19:50:19.6780951Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86instdb.cpp 2025-05-07T19:50:19.6781547Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86operand.cpp 2025-05-07T19:50:19.6782169Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86rapass.cpp 2025-05-07T19:50:19.6782643Z 2025-05-07T19:50:19.6782845Z CC_FLAGS: 2025-05-07T19:50:19.6782972Z 2025-05-07T19:50:19.6783099Z 2025-05-07T19:50:19.6783305Z NVCC_FLAGS: 2025-05-07T19:50:19.6783467Z 2025-05-07T19:50:19.6783633Z 2025-05-07T19:50:19.6783840Z HIPCC_FLAGS: 2025-05-07T19:50:19.6784006Z 2025-05-07T19:50:19.6784091Z 2025-05-07T19:50:19.6784294Z INCLUDE_DIRS: 2025-05-07T19:50:19.6784797Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:19.6785317Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:19.6785722Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:19.6786085Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:19.6786618Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:50:19.6787465Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:19.6788150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:19.6788615Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:19.6789077Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:19.6789707Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:19.6790285Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:19.6790775Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:19.6791399Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:50:19.6791939Z 2025-05-07T19:50:19.6792194Z Selected Source Files: 2025-05-07T19:50:19.6792360Z 2025-05-07T19:50:19.6792448Z 2025-05-07T19:50:19.6792694Z HIPified Source Files: 2025-05-07T19:50:19.6792858Z 2025-05-07T19:50:19.6792973Z 2025-05-07T19:50:19.6793190Z Library Dependencies: 2025-05-07T19:50:19.6793470Z torch 2025-05-07T19:50:19.6793677Z torch_library 2025-05-07T19:50:19.6794163Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:50:19.6794867Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:19.6795794Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:19.6796623Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:19.6797423Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:19.6798085Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:19.6798508Z 2025-05-07T19:50:19.6798739Z Output Library: 2025-05-07T19:50:19.6798973Z asmjit 2025-05-07T19:50:19.6799201Z 2025-05-07T19:50:19.6799422Z Destination Directory: 2025-05-07T19:50:19.6799709Z fbgemm_gpu 2025-05-07T19:50:19.6799967Z ================================================================================ 2025-05-07T19:50:19.6800251Z 2025-05-07T19:50:19.6800255Z 2025-05-07T19:50:19.6800261Z 2025-05-07T19:50:19.6800388Z ================================================================================ 2025-05-07T19:50:19.6800799Z GPU CPP Library Target: fbgemm (SHARED) 2025-05-07T19:50:19.6801124Z 2025-05-07T19:50:19.6801373Z CPU_SRCS: 2025-05-07T19:50:19.6801502Z 2025-05-07T19:50:19.6801618Z 2025-05-07T19:50:19.6801821Z GPU_SRCS: 2025-05-07T19:50:19.6802093Z 2025-05-07T19:50:19.6802179Z 2025-05-07T19:50:19.6802499Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:19.6802676Z 2025-05-07T19:50:19.6802765Z 2025-05-07T19:50:19.6802971Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:19.6803144Z 2025-05-07T19:50:19.6803234Z 2025-05-07T19:50:19.6803466Z OTHER_SRCS: 2025-05-07T19:50:19.6803748Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDM.cc 2025-05-07T19:50:19.6804228Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAutovec.cc 2025-05-07T19:50:19.6804693Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMNBit.cc 2025-05-07T19:50:19.6805143Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/QuantUtils.cc 2025-05-07T19:50:19.6805556Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/RefImplementations.cc 2025-05-07T19:50:19.6806067Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/RowWiseSparseAdagradFused.cc 2025-05-07T19:50:19.6806635Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/SparseAdagrad.cc 2025-05-07T19:50:19.6807088Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/Utils.cc 2025-05-07T19:50:19.6807931Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAvx2.cc 2025-05-07T19:50:19.6808384Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/QuantUtilsAvx2.cc 2025-05-07T19:50:19.6808864Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAvx2.cc 2025-05-07T19:50:19.6809313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/QuantUtilsAvx2.cc 2025-05-07T19:50:19.6809797Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAvx512.cc 2025-05-07T19:50:19.6810191Z 2025-05-07T19:50:19.6810432Z CC_FLAGS: 2025-05-07T19:50:19.6810560Z 2025-05-07T19:50:19.6810651Z 2025-05-07T19:50:19.6810878Z NVCC_FLAGS: 2025-05-07T19:50:19.6811007Z 2025-05-07T19:50:19.6811122Z 2025-05-07T19:50:19.6811323Z HIPCC_FLAGS: 2025-05-07T19:50:19.6811460Z 2025-05-07T19:50:19.6811574Z 2025-05-07T19:50:19.6811778Z INCLUDE_DIRS: 2025-05-07T19:50:19.6812062Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:19.6812400Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:19.6812733Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:19.6813066Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:19.6813619Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:50:19.6814470Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:19.6815154Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:19.6815625Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:19.6816077Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:19.6816609Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:19.6817163Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:19.6817689Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:19.6818396Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:50:19.6818932Z 2025-05-07T19:50:19.6819178Z Selected Source Files: 2025-05-07T19:50:19.6819345Z 2025-05-07T19:50:19.6819435Z 2025-05-07T19:50:19.6819801Z HIPified Source Files: 2025-05-07T19:50:19.6819953Z 2025-05-07T19:50:19.6820040Z 2025-05-07T19:50:19.6820269Z Library Dependencies: 2025-05-07T19:50:19.6820512Z torch 2025-05-07T19:50:19.6820739Z torch_library 2025-05-07T19:50:19.6821172Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:50:19.6821859Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:19.6822566Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:19.6823338Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:19.6824081Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:19.6824555Z asmjit 2025-05-07T19:50:19.6824903Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:19.6825298Z 2025-05-07T19:50:19.6825518Z Output Library: 2025-05-07T19:50:19.6825765Z fbgemm 2025-05-07T19:50:19.6825954Z 2025-05-07T19:50:19.6826183Z Destination Directory: 2025-05-07T19:50:19.6826424Z fbgemm_gpu 2025-05-07T19:50:19.6826683Z ================================================================================ 2025-05-07T19:50:19.6826912Z 2025-05-07T19:50:19.6826916Z 2025-05-07T19:50:19.6826919Z 2025-05-07T19:50:19.6827036Z ================================================================================ 2025-05-07T19:50:19.6827395Z Running code generation script ... 2025-05-07T19:50:19.6828153Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_backward_split.py --opensource 2025-05-07T19:50:19.6828903Z ================================================================================ 2025-05-07T19:50:19.6829207Z 2025-05-07T19:50:20.2105597Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:20.2108329Z [GENERAATE BACKWARD SPLIT]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_backward_split.py', '--opensource'] 2025-05-07T19:50:20.2110842Z Written: gen_embedding_backward_dense_split_weighted_vbe_cuda.cu 2025-05-07T19:50:20.2112269Z Written: gen_embedding_backward_dense_split_weighted_cuda.cu 2025-05-07T19:50:20.2113765Z Written: gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:20.2115330Z Written: gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:20.2116499Z Written: gen_embedding_backward_dense_split_unweighted_cuda.cu 2025-05-07T19:50:20.2116998Z Written: gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:50:20.2117472Z Written: gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:50:20.2118015Z Written: gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:20.2118531Z Written: gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:20.2119046Z Written: gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T19:50:20.2119583Z Written: gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:20.2120099Z Written: gen_embedding_backward_dense_split_weighted_kernel_cta.cu 2025-05-07T19:50:20.2120653Z Written: gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:20.2121208Z Written: gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:20.2121768Z Written: gen_embedding_backward_dense_split_unweighted_kernel_cta.cu 2025-05-07T19:50:20.2122293Z Written: gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:20.2122857Z Written: gen_embedding_backward_dense_split_weighted_kernel_warp.cu 2025-05-07T19:50:20.2123428Z Written: gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:20.2124311Z Written: gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:20.2124883Z Written: gen_embedding_backward_dense_split_unweighted_kernel_warp.cu 2025-05-07T19:50:20.2125385Z Written: gen_embedding_optimizer_dense_split_device_kernel.cuh 2025-05-07T19:50:20.2125842Z Written: gen_embedding_backward_split_dense.cpp 2025-05-07T19:50:20.2126220Z Written: gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:50:20.2126678Z Written: gen_embedding_backward_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:20.2127212Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:20.2127721Z Written: gen_embedding_backward_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:20.2128231Z Written: gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:20.2128742Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:20.2129289Z Written: gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:20.2129802Z Written: gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:20.2130382Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:20.2130976Z Written: gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:20.2131513Z Written: gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:20.2132095Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:20.2132661Z Written: gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:20.2133206Z Written: gen_embedding_optimizer_adagrad_split_device_kernel.cuh 2025-05-07T19:50:20.2133650Z Written: gen_embedding_backward_split_adagrad.cpp 2025-05-07T19:50:20.2134080Z Written: gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:50:20.2134565Z Written: gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:20.2134977Z Written: lookup_adagrad.py 2025-05-07T19:50:20.2135454Z Written: gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:50:20.2135870Z Written: gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:50:20.2136355Z Written: gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:20.2136838Z Written: gen_embedding_backward_adam_split_weighted_vbe_cuda.cu 2025-05-07T19:50:20.2137326Z Written: gen_embedding_backward_adam_split_weighted_cuda.cu 2025-05-07T19:50:20.2137828Z Written: gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:20.2138322Z Written: gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:20.2138826Z Written: gen_embedding_backward_adam_split_unweighted_cuda.cu 2025-05-07T19:50:20.2139300Z Written: gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:50:20.2139792Z Written: gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T19:50:20.2140275Z Written: gen_embedding_backward_adam_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:20.2140806Z Written: gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:20.2141307Z Written: gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T19:50:20.2141783Z Written: gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:20.2142296Z Written: gen_embedding_backward_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:20.2142811Z Written: gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:20.2143380Z Written: gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:20.2143901Z Written: gen_embedding_backward_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:20.2144440Z Written: gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:20.2144974Z Written: gen_embedding_backward_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:20.2145497Z Written: gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:20.2146144Z Written: gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:20.2146671Z Written: gen_embedding_backward_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:20.2147187Z Written: gen_embedding_optimizer_adam_split_device_kernel.cuh 2025-05-07T19:50:20.2147606Z Written: gen_embedding_backward_split_adam.cpp 2025-05-07T19:50:20.2148011Z Written: gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:50:20.2148478Z Written: gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:20.2148869Z Written: lookup_adam.py 2025-05-07T19:50:20.2149203Z Written: gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:50:20.2149729Z Written: gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:20.2150426Z Written: gen_embedding_backward_lamb_split_weighted_cuda.cu 2025-05-07T19:50:20.2150938Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:20.2151490Z Written: gen_embedding_backward_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:20.2151994Z Written: gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T19:50:20.2152544Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:20.2153104Z Written: gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:20.2153616Z Written: gen_embedding_backward_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:20.2154200Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:20.2154772Z Written: gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:20.2155336Z Written: gen_embedding_backward_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:20.2155901Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:20.2156507Z Written: gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:20.2157057Z Written: gen_embedding_optimizer_lamb_split_device_kernel.cuh 2025-05-07T19:50:20.2157511Z Written: gen_embedding_backward_split_lamb.cpp 2025-05-07T19:50:20.2158040Z Written: gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:50:20.2158510Z Written: gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:20.2158965Z Written: lookup_lamb.py 2025-05-07T19:50:20.2159286Z Written: gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:50:20.2159777Z Written: gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:20.2160316Z Written: gen_embedding_backward_lars_sgd_split_weighted_cuda.cu 2025-05-07T19:50:20.2160858Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:20.2161438Z Written: gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:20.2161957Z Written: gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T19:50:20.2162633Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:20.2163152Z Written: gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:20.2163700Z Written: gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:20.2164279Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:20.2164838Z Written: gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:20.2165396Z Written: gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:20.2165952Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:20.2166549Z Written: gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:20.2167068Z Written: gen_embedding_optimizer_lars_sgd_split_device_kernel.cuh 2025-05-07T19:50:20.2167529Z Written: gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T19:50:20.2167964Z Written: gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:50:20.2168417Z Written: gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:20.2168852Z Written: lookup_lars_sgd.py 2025-05-07T19:50:20.2169256Z Written: gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:50:20.2169735Z Written: gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:20.2170260Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu 2025-05-07T19:50:20.2170877Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:20.2171506Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu 2025-05-07T19:50:20.2172081Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T19:50:20.2172710Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:20.2173323Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T19:50:20.2173950Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:20.2174592Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:20.2175280Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:20.2175941Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:20.2176625Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:20.2177287Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:20.2977137Z Written: gen_embedding_optimizer_partial_rowwise_adam_split_device_kernel.cuh 2025-05-07T19:50:20.2978964Z Written: gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T19:50:20.2980466Z Written: gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:50:20.2982190Z Written: gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:20.2983632Z Written: lookup_partial_rowwise_adam.py 2025-05-07T19:50:20.2984764Z Written: gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:50:20.2985831Z Written: gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:20.2986466Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu 2025-05-07T19:50:20.2987147Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:20.2987807Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:20.2988472Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T19:50:20.2989133Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:20.2989935Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:20.2990622Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:20.2991324Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:20.2992073Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:20.2992754Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:20.2993484Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:20.2994220Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:20.2994881Z Written: gen_embedding_optimizer_partial_rowwise_lamb_split_device_kernel.cuh 2025-05-07T19:50:20.2995479Z Written: gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T19:50:20.2996118Z Written: gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:50:20.2996696Z Written: gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:20.2997172Z Written: lookup_partial_rowwise_lamb.py 2025-05-07T19:50:20.2997607Z Written: gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:50:20.2998299Z Written: gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:20.2998865Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu 2025-05-07T19:50:20.2999437Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu 2025-05-07T19:50:20.2999969Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu 2025-05-07T19:50:20.3000514Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:20.3001058Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu 2025-05-07T19:50:20.3001657Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:20.3002260Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu 2025-05-07T19:50:20.3002821Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:20.3003402Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu 2025-05-07T19:50:20.3003954Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:20.3004538Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:50:20.3005102Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:50:20.3005683Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_meta.cpp 2025-05-07T19:50:20.3006240Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:20.3006800Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_meta.cpp 2025-05-07T19:50:20.3007412Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:20.3007996Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:50:20.3008598Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:20.3009156Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_meta.cpp 2025-05-07T19:50:20.3009843Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:20.3010450Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:20.3011052Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:20.3011669Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu 2025-05-07T19:50:20.3012235Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:20.3012866Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:20.3013525Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:20.3014151Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:20.3014790Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:20.3015390Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu 2025-05-07T19:50:20.3015998Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:20.3016591Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:20.3017223Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:20.3017843Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu 2025-05-07T19:50:20.3018409Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:20.3019018Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:20.3019641Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:20.3020277Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:20.3020918Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:20.3021626Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu 2025-05-07T19:50:20.3022245Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:20.3022857Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:20.3023501Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu 2025-05-07T19:50:20.3024127Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:20.3024789Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu 2025-05-07T19:50:20.3025443Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:20.3026067Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu 2025-05-07T19:50:20.3026718Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:20.3027361Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu 2025-05-07T19:50:20.3028001Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu 2025-05-07T19:50:20.3028598Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu 2025-05-07T19:50:20.3029169Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu 2025-05-07T19:50:20.3030058Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu 2025-05-07T19:50:20.3030647Z Written: gen_embedding_optimizer_rowwise_adagrad_ssd_device_kernel.cuh 2025-05-07T19:50:20.3031248Z Written: gen_embedding_optimizer_rowwise_adagrad_split_device_kernel.cuh 2025-05-07T19:50:20.3031771Z Written: gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T19:50:20.3032263Z Written: gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:20.3032830Z Written: gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:20.3033389Z Written: lookup_rowwise_adagrad_ssd.py 2025-05-07T19:50:20.3033818Z Written: gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T19:50:20.3034308Z Written: gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:20.3034898Z Written: gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:20.3035389Z Written: lookup_rowwise_adagrad.py 2025-05-07T19:50:20.3035826Z Written: gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:50:20.3036426Z Written: gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:20.3036962Z Written: gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:20.3037566Z Written: gen_embedding_optimizer_approx_rowwise_adagrad_split_device_kernel.cuh 2025-05-07T19:50:20.3038112Z Written: gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T19:50:20.3038643Z Written: gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:20.3039217Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:20.3039810Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:20.3040369Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:20.3041037Z Written: gen_embedding_optimizer_rowwise_adagrad_with_weight_decay_split_device_kernel.cuh 2025-05-07T19:50:20.3041690Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:20.3042272Z Written: gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:20.3042950Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:20.3043596Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:20.3044272Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:20.3045080Z Written: gen_embedding_optimizer_approx_rowwise_adagrad_with_weight_decay_split_device_kernel.cuh 2025-05-07T19:50:20.3045764Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:20.3046435Z Written: gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:20.3047134Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:20.3047871Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:20.4018886Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:20.4021265Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu 2025-05-07T19:50:20.4023259Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu 2025-05-07T19:50:20.4025152Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:20.4025865Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:20.4026551Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu 2025-05-07T19:50:20.4027213Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:50:20.4027891Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T19:50:20.4028587Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:20.4029275Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:20.4030336Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T19:50:20.4031059Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:20.4032079Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu 2025-05-07T19:50:20.4032861Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:20.4033636Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:20.4034409Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu 2025-05-07T19:50:20.4035142Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:20.4035907Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu 2025-05-07T19:50:20.4036775Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:20.4037503Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:20.4038232Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu 2025-05-07T19:50:20.4038904Z Written: gen_embedding_optimizer_rowwise_adagrad_with_counter_split_device_kernel.cuh 2025-05-07T19:50:20.4039525Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:20.4040070Z Written: gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:20.4040699Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:20.4041250Z Written: lookup_rowwise_adagrad_with_counter.py 2025-05-07T19:50:20.4041720Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:20.4042360Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:20.4043040Z Written: gen_embedding_optimizer_approx_rowwise_adagrad_with_counter_split_device_kernel.cuh 2025-05-07T19:50:20.4043702Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:20.4044439Z Written: gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:20.4045097Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:20.4045778Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:20.4046436Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:20.4047122Z Written: gen_embedding_optimizer_rowwise_weighted_adagrad_split_device_kernel.cuh 2025-05-07T19:50:20.4047685Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T19:50:20.4048232Z Written: gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:50:20.4048834Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:20.4049417Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:50:20.4050022Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:20.4050561Z Written: gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu 2025-05-07T19:50:20.4051039Z Written: gen_embedding_backward_sgd_split_weighted_cuda.cu 2025-05-07T19:50:20.4051507Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:20.4052022Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:20.4052507Z Written: gen_embedding_backward_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:20.4052968Z Written: gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:50:20.4053451Z Written: gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T19:50:20.4053922Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:20.4054442Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:20.4054921Z Written: gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:20.4055504Z Written: gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:20.4056030Z Written: gen_embedding_backward_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:20.4056546Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:20.4057103Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:20.4057618Z Written: gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:20.4058152Z Written: gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:20.4058656Z Written: gen_embedding_backward_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:20.4059206Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:20.4059776Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:20.4060291Z Written: gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:20.4060788Z Written: gen_embedding_optimizer_sgd_split_device_kernel.cuh 2025-05-07T19:50:20.4061203Z Written: gen_embedding_backward_split_sgd.cpp 2025-05-07T19:50:20.4061595Z Written: gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:50:20.4062020Z Written: gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:20.4062428Z Written: lookup_sgd.py 2025-05-07T19:50:20.4062747Z Written: gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:50:20.4063120Z Written: gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:50:20.4063560Z Written: gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:20.4064042Z Written: gen_embedding_optimizer_approx_sgd_split_device_kernel.cuh 2025-05-07T19:50:20.4064521Z Written: gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T19:50:20.4064933Z Written: gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:50:20.4065428Z Written: gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:20.4065900Z Written: gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:50:20.4066467Z Written: gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:20.4067163Z Written: gen_embedding_backward_none_split_weighted_cuda.cu 2025-05-07T19:50:20.4067664Z Written: gen_embedding_backward_none_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:20.4068215Z Written: gen_embedding_backward_none_split_unweighted_cuda.cu 2025-05-07T19:50:20.4068704Z Written: gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T19:50:20.4069239Z Written: gen_embedding_backward_none_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:20.4070040Z Written: gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T19:50:20.4070602Z Written: gen_embedding_backward_none_split_weighted_kernel_cta.cu 2025-05-07T19:50:20.4071195Z Written: gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:20.4071770Z Written: gen_embedding_backward_none_split_unweighted_kernel_cta.cu 2025-05-07T19:50:20.4072344Z Written: gen_embedding_backward_none_split_weighted_kernel_warp.cu 2025-05-07T19:50:20.4072919Z Written: gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:20.4073540Z Written: gen_embedding_backward_none_split_unweighted_kernel_warp.cu 2025-05-07T19:50:20.4074068Z Written: gen_embedding_optimizer_none_split_device_kernel.cuh 2025-05-07T19:50:20.4074557Z Written: gen_embedding_backward_split_none.cpp 2025-05-07T19:50:20.4075001Z Written: gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:50:20.4075470Z Written: gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:50:20.4075916Z Written: lookup_none.py 2025-05-07T19:50:20.4076239Z Written: gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:50:20.4076734Z Written: gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:50:20.4077271Z Written: gen_embedding_backward_split_weighted_device_kernel_hip.hip 2025-05-07T19:50:20.4077887Z Written: gen_embedding_backward_split_unweighted_nobag_device_kernel_hip.hip 2025-05-07T19:50:20.4078630Z Written: gen_embedding_backward_split_unweighted_device_kernel_hip.hip 2025-05-07T19:50:20.4079193Z Written: gen_embedding_backward_ssd_weighted_vbe_device_kernel.cuh 2025-05-07T19:50:20.4079776Z Written: gen_embedding_backward_split_weighted_vbe_device_kernel.cuh 2025-05-07T19:50:20.4080312Z Written: gen_embedding_backward_ssd_weighted_device_kernel.cuh 2025-05-07T19:50:20.4080860Z Written: gen_embedding_backward_split_weighted_device_kernel.cuh 2025-05-07T19:50:20.4081415Z Written: gen_embedding_backward_ssd_unweighted_nobag_device_kernel.cuh 2025-05-07T19:50:20.4082032Z Written: gen_embedding_backward_split_unweighted_nobag_device_kernel.cuh 2025-05-07T19:50:20.4082728Z Written: gen_embedding_backward_ssd_unweighted_vbe_device_kernel.cuh 2025-05-07T19:50:20.4083256Z Written: gen_embedding_backward_split_unweighted_vbe_device_kernel.cuh 2025-05-07T19:50:20.4083789Z Written: gen_embedding_backward_ssd_unweighted_device_kernel.cuh 2025-05-07T19:50:20.4084287Z Written: gen_embedding_backward_split_unweighted_device_kernel.cuh 2025-05-07T19:50:20.4085220Z Written: gen_embedding_backward_split_common_device_kernel.cuh 2025-05-07T19:50:20.4085722Z Written: gen_embedding_backward_split_grad_embedding_ops.cu 2025-05-07T19:50:20.4086261Z Written: gen_embedding_backward_dense_indice_weights_codegen_cuda.cu 2025-05-07T19:50:20.4086828Z Written: gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu 2025-05-07T19:50:20.4087366Z Written: gen_embedding_backward_split_indice_weights_codegen_cuda.cu 2025-05-07T19:50:20.4087826Z Written: pt2_arg_utils.h 2025-05-07T19:50:20.4088096Z Written: __init__.py 2025-05-07T19:50:20.4088388Z Written: lookup_args_ssd.py 2025-05-07T19:50:20.4088673Z Written: lookup_args.py 2025-05-07T19:50:20.4138083Z 2025-05-07T19:50:20.4138130Z 2025-05-07T19:50:20.4138520Z ================================================================================ 2025-05-07T19:50:20.4138957Z Running code generation script ... 2025-05-07T19:50:20.4139845Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_embedding_optimizer.py --opensource 2025-05-07T19:50:20.4140952Z ================================================================================ 2025-05-07T19:50:20.4141229Z 2025-05-07T19:50:20.5143269Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:20.5144330Z [GENERATE OPTIMIZERS]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_embedding_optimizer.py', '--opensource'] 2025-05-07T19:50:20.5145087Z Written: gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu 2025-05-07T19:50:20.5145622Z Written: gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu 2025-05-07T19:50:20.5146108Z Written: gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:50:20.5146650Z Written: gen_embedding_optimizer_rowwise_adagrad_split_device_kernel.cuh 2025-05-07T19:50:20.5147299Z Written: split_embedding_optimizer_rowwise_adagrad.py 2025-05-07T19:50:20.5147659Z Written: optimizer_args.py 2025-05-07T19:50:20.5231544Z 2025-05-07T19:50:20.5232102Z 2025-05-07T19:50:20.5232625Z ================================================================================ 2025-05-07T19:50:20.5233073Z Running code generation script ... 2025-05-07T19:50:20.5234027Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_quantized.py --opensource 2025-05-07T19:50:20.5234838Z ================================================================================ 2025-05-07T19:50:20.5235081Z 2025-05-07T19:50:20.6372019Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:20.6374730Z [GENERATE FORWARD QUANTIZED]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_quantized.py', '--opensource'] 2025-05-07T19:50:20.6376782Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu 2025-05-07T19:50:20.6377513Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu 2025-05-07T19:50:20.6378573Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu 2025-05-07T19:50:20.6379270Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu 2025-05-07T19:50:20.6379927Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu 2025-05-07T19:50:20.6380617Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu 2025-05-07T19:50:20.6381335Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu 2025-05-07T19:50:20.6382054Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu 2025-05-07T19:50:20.6382798Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu 2025-05-07T19:50:20.6383502Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu 2025-05-07T19:50:20.6384250Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu 2025-05-07T19:50:20.6385417Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu 2025-05-07T19:50:20.6386175Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu 2025-05-07T19:50:20.6386938Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu 2025-05-07T19:50:20.6387664Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu 2025-05-07T19:50:20.6388422Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu 2025-05-07T19:50:20.6389177Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu 2025-05-07T19:50:20.6390005Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu 2025-05-07T19:50:20.6390729Z Written: gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu 2025-05-07T19:50:20.6391555Z Written: gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu 2025-05-07T19:50:20.6392280Z Written: gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu 2025-05-07T19:50:20.6392888Z Written: gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:50:20.6393455Z Written: gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:50:20.6461439Z 2025-05-07T19:50:20.6461682Z 2025-05-07T19:50:20.6462019Z ================================================================================ 2025-05-07T19:50:20.6462439Z Running code generation script ... 2025-05-07T19:50:20.6463372Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_split.py --opensource 2025-05-07T19:50:20.6464198Z ================================================================================ 2025-05-07T19:50:20.6464442Z 2025-05-07T19:50:20.9802022Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:20.9804123Z [GENERATE FORWARD SPLIT]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_split.py', '--opensource'] 2025-05-07T19:50:20.9804904Z Written: gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:20.9805442Z Written: gen_embedding_forward_dense_weighted_codegen_cuda.cu 2025-05-07T19:50:20.9806060Z Written: gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:20.9806585Z Written: gen_embedding_forward_dense_unweighted_codegen_cuda.cu 2025-05-07T19:50:20.9807054Z Written: gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:20.9807560Z Written: gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:20.9808027Z Written: gen_embedding_forward_ssd_weighted_codegen_cuda.cu 2025-05-07T19:50:20.9808513Z Written: gen_embedding_forward_split_weighted_codegen_cuda.cu 2025-05-07T19:50:20.9809029Z Written: gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:20.9809772Z Written: gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:20.9810309Z Written: gen_embedding_forward_ssd_unweighted_codegen_cuda.cu 2025-05-07T19:50:20.9810784Z Written: gen_embedding_forward_split_unweighted_codegen_cuda.cu 2025-05-07T19:50:20.9811323Z Written: gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:20.9811836Z Written: gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu 2025-05-07T19:50:20.9812372Z Written: gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:20.9812928Z Written: gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu 2025-05-07T19:50:20.9813425Z Written: gen_embedding_forward_dense_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:20.9813940Z Written: gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:50:20.9814432Z Written: gen_embedding_forward_dense_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:20.9814964Z Written: gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:50:20.9815445Z Written: gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:20.9815955Z Written: gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:20.9816457Z Written: gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:50:20.9816917Z Written: gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:50:20.9817435Z Written: gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:20.9817938Z Written: gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:20.9818461Z Written: gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:50:20.9818935Z Written: gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:50:20.9819427Z Written: gen_embedding_forward_dense_weighted_vbe_kernel.cu 2025-05-07T19:50:20.9819895Z Written: gen_embedding_forward_dense_weighted_kernel.cu 2025-05-07T19:50:20.9820477Z Written: gen_embedding_forward_dense_unweighted_nobag_kernel.cu 2025-05-07T19:50:20.9820974Z Written: gen_embedding_forward_dense_unweighted_vbe_kernel.cu 2025-05-07T19:50:20.9821423Z Written: gen_embedding_forward_dense_unweighted_kernel.cu 2025-05-07T19:50:20.9821880Z Written: gen_embedding_forward_ssd_weighted_vbe_kernel.cu 2025-05-07T19:50:20.9822316Z Written: gen_embedding_forward_split_weighted_vbe_kernel.cu 2025-05-07T19:50:20.9822769Z Written: gen_embedding_forward_ssd_weighted_kernel.cu 2025-05-07T19:50:20.9823212Z Written: gen_embedding_forward_split_weighted_kernel.cu 2025-05-07T19:50:20.9823647Z Written: gen_embedding_forward_ssd_unweighted_nobag_kernel.cu 2025-05-07T19:50:20.9824148Z Written: gen_embedding_forward_split_unweighted_nobag_kernel.cu 2025-05-07T19:50:20.9824608Z Written: gen_embedding_forward_ssd_unweighted_vbe_kernel.cu 2025-05-07T19:50:20.9825087Z Written: gen_embedding_forward_split_unweighted_vbe_kernel.cu 2025-05-07T19:50:20.9825522Z Written: gen_embedding_forward_ssd_unweighted_kernel.cu 2025-05-07T19:50:20.9825977Z Written: gen_embedding_forward_split_unweighted_kernel.cu 2025-05-07T19:50:20.9826435Z Written: gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu 2025-05-07T19:50:20.9826916Z Written: gen_embedding_forward_split_weighted_gwd_kernel.cu 2025-05-07T19:50:20.9827407Z Written: gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu 2025-05-07T19:50:20.9827877Z Written: gen_embedding_forward_split_unweighted_gwd_kernel.cu 2025-05-07T19:50:20.9828353Z Written: gen_embedding_forward_split_weighted_v2_kernel.cu 2025-05-07T19:50:20.9924505Z Written: gen_embedding_forward_split_unweighted_v2_kernel.cu 2025-05-07T19:50:20.9925240Z Written: gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:20.9926021Z Written: gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:20.9926591Z Written: gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:20.9927133Z Written: gen_embedding_forward_split_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:20.9927866Z Written: gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:50:20.9928334Z Written: gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:50:20.9928806Z Written: gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:20.9929107Z 2025-05-07T19:50:20.9929112Z 2025-05-07T19:50:20.9929266Z ================================================================================ 2025-05-07T19:50:20.9929635Z Running code generation script ... 2025-05-07T19:50:20.9930434Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_index_select.py --opensource 2025-05-07T19:50:20.9931236Z ================================================================================ 2025-05-07T19:50:20.9931505Z 2025-05-07T19:50:21.2479244Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:21.2481773Z [INDEX SELECT GENERATOR]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_index_select.py', '--opensource'] 2025-05-07T19:50:21.2482870Z Written: gen_batch_index_select_dim0_forward_codegen_cuda.cu 2025-05-07T19:50:21.2483349Z Written: gen_batch_index_select_dim0_forward_kernel.cu 2025-05-07T19:50:21.2483788Z Written: gen_batch_index_select_dim0_forward_kernel_small.cu 2025-05-07T19:50:21.2484266Z Written: gen_batch_index_select_dim0_backward_codegen_cuda.cu 2025-05-07T19:50:21.2485156Z Written: gen_batch_index_select_dim0_backward_kernel_cta.cu 2025-05-07T19:50:21.2485657Z Written: gen_batch_index_select_dim0_backward_kernel_warp.cu 2025-05-07T19:50:21.2486195Z Written: gen_embedding_backward_split_batch_index_select_device_kernel.cuh 2025-05-07T19:50:21.2486762Z Written: gen_embedding_backward_split_grad_index_select.cu 2025-05-07T19:50:21.2487264Z Written: gen_embedding_backward_split_common_device_kernel.cuh 2025-05-07T19:50:21.2581111Z -- Adding merge_pooled_embeddings sources 2025-05-07T19:50:21.2590947Z 2025-05-07T19:50:21.2591446Z 2025-05-07T19:50:21.2591843Z ================================================================================ 2025-05-07T19:50:21.2593013Z GPU CPP Library Target: fbgemm_gpu_tbe_cache (SHARED) 2025-05-07T19:50:21.2593613Z 2025-05-07T19:50:21.2593850Z CPU_SRCS: 2025-05-07T19:50:21.2594261Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cpp 2025-05-07T19:50:21.2594974Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cpp 2025-05-07T19:50:21.2595656Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cpp 2025-05-07T19:50:21.2596307Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cpp 2025-05-07T19:50:21.2596948Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cpp 2025-05-07T19:50:21.2597476Z 2025-05-07T19:50:21.2597699Z GPU_SRCS: 2025-05-07T19:50:21.2598056Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.cu 2025-05-07T19:50:21.2598706Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.cu 2025-05-07T19:50:21.2599411Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cu 2025-05-07T19:50:21.2600086Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cu 2025-05-07T19:50:21.2600745Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.cu 2025-05-07T19:50:21.2601357Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.cu 2025-05-07T19:50:21.2602026Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cu 2025-05-07T19:50:21.2602649Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cu 2025-05-07T19:50:21.2603371Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.cu 2025-05-07T19:50:21.2604061Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu 2025-05-07T19:50:21.2604545Z 2025-05-07T19:50:21.2604782Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:21.2604931Z 2025-05-07T19:50:21.2605139Z 2025-05-07T19:50:21.2605374Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:21.2605521Z 2025-05-07T19:50:21.2605608Z 2025-05-07T19:50:21.2605836Z OTHER_SRCS: 2025-05-07T19:50:21.2605968Z 2025-05-07T19:50:21.2606049Z 2025-05-07T19:50:21.2606264Z CC_FLAGS: 2025-05-07T19:50:21.2606388Z 2025-05-07T19:50:21.2606487Z 2025-05-07T19:50:21.2606687Z NVCC_FLAGS: 2025-05-07T19:50:21.2606942Z --expt-relaxed-constexpr 2025-05-07T19:50:21.2607229Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:21.2607536Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:21.2607841Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:21.2608123Z 2025-05-07T19:50:21.2608319Z HIPCC_FLAGS: 2025-05-07T19:50:21.2608473Z 2025-05-07T19:50:21.2608556Z 2025-05-07T19:50:21.2608746Z INCLUDE_DIRS: 2025-05-07T19:50:21.2609015Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:21.2609335Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:21.2609654Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:21.2610008Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:21.2610510Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:50:21.2611347Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:21.2612005Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:21.2612452Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:21.2612889Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:21.2613387Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:21.2613933Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:21.2614401Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:21.2614994Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:50:21.2615583Z 2025-05-07T19:50:21.2615814Z Selected Source Files: 2025-05-07T19:50:21.2616248Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cpp 2025-05-07T19:50:21.2616924Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cpp 2025-05-07T19:50:21.2617615Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cpp 2025-05-07T19:50:21.2618228Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cpp 2025-05-07T19:50:21.2618862Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cpp 2025-05-07T19:50:21.2619498Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.cu 2025-05-07T19:50:21.2620123Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.cu 2025-05-07T19:50:21.2620759Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cu 2025-05-07T19:50:21.2621435Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cu 2025-05-07T19:50:21.2622096Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.cu 2025-05-07T19:50:21.2622689Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.cu 2025-05-07T19:50:21.2623346Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cu 2025-05-07T19:50:21.2623945Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cu 2025-05-07T19:50:21.2624567Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.cu 2025-05-07T19:50:21.2625253Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu 2025-05-07T19:50:21.2625736Z 2025-05-07T19:50:21.2625979Z HIPified Source Files: 2025-05-07T19:50:21.2626143Z 2025-05-07T19:50:21.2626230Z 2025-05-07T19:50:21.2626466Z Library Dependencies: 2025-05-07T19:50:21.2626715Z torch 2025-05-07T19:50:21.2626945Z torch_library 2025-05-07T19:50:21.2627397Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:50:21.2628191Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:21.2628926Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:21.2630027Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:21.2630832Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:21.2631466Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:21.2631923Z 2025-05-07T19:50:21.2632129Z Output Library: 2025-05-07T19:50:21.2632405Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:21.2632651Z 2025-05-07T19:50:21.2632896Z Destination Directory: 2025-05-07T19:50:21.2633155Z fbgemm_gpu 2025-05-07T19:50:21.2633429Z ================================================================================ 2025-05-07T19:50:21.2633674Z 2025-05-07T19:50:21.3066203Z 2025-05-07T19:50:21.3066364Z 2025-05-07T19:50:21.3066890Z ================================================================================ 2025-05-07T19:50:21.3068288Z GPU CPP Library Target: fbgemm_gpu_tbe_inference (SHARED) 2025-05-07T19:50:21.3069316Z 2025-05-07T19:50:21.3070103Z CPU_SRCS: 2025-05-07T19:50:21.3070982Z codegen/inference/embedding_forward_quantized_host_cpu.cpp 2025-05-07T19:50:21.3072381Z gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:50:21.3073686Z gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:50:21.3074728Z 2025-05-07T19:50:21.3075281Z GPU_SRCS: 2025-05-07T19:50:21.3076091Z codegen/inference/embedding_forward_quantized_host.cpp 2025-05-07T19:50:21.3077156Z codegen/inference/embedding_forward_quantized_split_lookup.cu 2025-05-07T19:50:21.3077715Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu 2025-05-07T19:50:21.3078357Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu 2025-05-07T19:50:21.3079233Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu 2025-05-07T19:50:21.3079866Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu 2025-05-07T19:50:21.3080508Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu 2025-05-07T19:50:21.3081111Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu 2025-05-07T19:50:21.3081776Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu 2025-05-07T19:50:21.3082453Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu 2025-05-07T19:50:21.3083136Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu 2025-05-07T19:50:21.3083796Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu 2025-05-07T19:50:21.3084867Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu 2025-05-07T19:50:21.3085663Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu 2025-05-07T19:50:21.3086340Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu 2025-05-07T19:50:21.3087005Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu 2025-05-07T19:50:21.3087653Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu 2025-05-07T19:50:21.3088315Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu 2025-05-07T19:50:21.3088967Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu 2025-05-07T19:50:21.3089612Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu 2025-05-07T19:50:21.3090253Z gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu 2025-05-07T19:50:21.3090861Z gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu 2025-05-07T19:50:21.3091719Z gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu 2025-05-07T19:50:21.3092156Z 2025-05-07T19:50:21.3092399Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:21.3092549Z 2025-05-07T19:50:21.3092664Z 2025-05-07T19:50:21.3092862Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:21.3093010Z 2025-05-07T19:50:21.3093121Z 2025-05-07T19:50:21.3093322Z OTHER_SRCS: 2025-05-07T19:50:21.3093450Z 2025-05-07T19:50:21.3093556Z 2025-05-07T19:50:21.3093747Z CC_FLAGS: 2025-05-07T19:50:21.3093894Z 2025-05-07T19:50:21.3093975Z 2025-05-07T19:50:21.3094168Z NVCC_FLAGS: 2025-05-07T19:50:21.3094417Z --expt-relaxed-constexpr 2025-05-07T19:50:21.3094702Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:21.3095012Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:21.3095315Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:21.3095590Z 2025-05-07T19:50:21.3095804Z HIPCC_FLAGS: 2025-05-07T19:50:21.3095927Z 2025-05-07T19:50:21.3096006Z 2025-05-07T19:50:21.3096208Z INCLUDE_DIRS: 2025-05-07T19:50:21.3096466Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:21.3096768Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:21.3097035Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:21.3097321Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:21.3097805Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:50:21.3098574Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:21.3099199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:21.3099601Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:21.3100015Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:21.3100477Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:21.3100982Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:21.3101453Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:21.3102176Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:50:21.3102707Z 2025-05-07T19:50:21.3102963Z Selected Source Files: 2025-05-07T19:50:21.3103301Z codegen/inference/embedding_forward_quantized_host_cpu.cpp 2025-05-07T19:50:21.3103752Z gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:50:21.3104206Z gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:50:21.3104673Z codegen/inference/embedding_forward_quantized_host.cpp 2025-05-07T19:50:21.3105156Z codegen/inference/embedding_forward_quantized_split_lookup.cu 2025-05-07T19:50:21.3105753Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu 2025-05-07T19:50:21.3106391Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu 2025-05-07T19:50:21.3107009Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu 2025-05-07T19:50:21.3107655Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu 2025-05-07T19:50:21.3108263Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu 2025-05-07T19:50:21.3108907Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu 2025-05-07T19:50:21.3109692Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu 2025-05-07T19:50:21.3110599Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu 2025-05-07T19:50:21.3111297Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu 2025-05-07T19:50:21.3111955Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu 2025-05-07T19:50:21.3112656Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu 2025-05-07T19:50:21.3113336Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu 2025-05-07T19:50:21.3114004Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu 2025-05-07T19:50:21.3114759Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu 2025-05-07T19:50:21.3115398Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu 2025-05-07T19:50:21.3116165Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu 2025-05-07T19:50:21.3116789Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu 2025-05-07T19:50:21.3117423Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu 2025-05-07T19:50:21.3118032Z gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu 2025-05-07T19:50:21.3118624Z gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu 2025-05-07T19:50:21.3119227Z gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu 2025-05-07T19:50:21.3119637Z 2025-05-07T19:50:21.3119852Z HIPified Source Files: 2025-05-07T19:50:21.3120020Z 2025-05-07T19:50:21.3120098Z 2025-05-07T19:50:21.3120306Z Library Dependencies: 2025-05-07T19:50:21.3120529Z torch 2025-05-07T19:50:21.3120747Z torch_library 2025-05-07T19:50:21.3121227Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:50:21.3121904Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:21.3122614Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:21.3123409Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:21.3124166Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:21.3124651Z asmjit 2025-05-07T19:50:21.3124848Z fbgemm 2025-05-07T19:50:21.3125074Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:21.3125318Z fbgemm_gpu_config 2025-05-07T19:50:21.3125698Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:21.3126165Z 2025-05-07T19:50:21.3126372Z Output Library: 2025-05-07T19:50:21.3126603Z fbgemm_gpu_tbe_inference 2025-05-07T19:50:21.3126865Z 2025-05-07T19:50:21.3127065Z Destination Directory: 2025-05-07T19:50:21.3127323Z fbgemm_gpu 2025-05-07T19:50:21.3127546Z ================================================================================ 2025-05-07T19:50:21.3127790Z 2025-05-07T19:50:21.5296852Z 2025-05-07T19:50:21.5297054Z 2025-05-07T19:50:21.5297545Z ================================================================================ 2025-05-07T19:50:21.5298754Z GPU CPP Library Target: fbgemm_gpu_config (SHARED) 2025-05-07T19:50:21.5299710Z 2025-05-07T19:50:21.5300247Z CPU_SRCS: 2025-05-07T19:50:21.5300851Z src/config/feature_gates.cpp 2025-05-07T19:50:21.5301587Z 2025-05-07T19:50:21.5302086Z GPU_SRCS: 2025-05-07T19:50:21.5302427Z 2025-05-07T19:50:21.5302641Z 2025-05-07T19:50:21.5303168Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:21.5303598Z 2025-05-07T19:50:21.5303812Z 2025-05-07T19:50:21.5304376Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:21.5304809Z 2025-05-07T19:50:21.5305038Z 2025-05-07T19:50:21.5305565Z OTHER_SRCS: 2025-05-07T19:50:21.5305826Z 2025-05-07T19:50:21.5305907Z 2025-05-07T19:50:21.5306103Z CC_FLAGS: 2025-05-07T19:50:21.5306217Z 2025-05-07T19:50:21.5306296Z 2025-05-07T19:50:21.5306498Z NVCC_FLAGS: 2025-05-07T19:50:21.5306621Z 2025-05-07T19:50:21.5306701Z 2025-05-07T19:50:21.5306898Z HIPCC_FLAGS: 2025-05-07T19:50:21.5307026Z 2025-05-07T19:50:21.5307107Z 2025-05-07T19:50:21.5307319Z INCLUDE_DIRS: 2025-05-07T19:50:21.5307558Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:21.5307899Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:21.5308210Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:21.5308540Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:21.5309066Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:50:21.5310031Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:21.5310972Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:21.5311407Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:21.5311859Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:21.5312375Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:21.5312913Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:21.5313402Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:21.5314055Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:50:21.5314587Z 2025-05-07T19:50:21.5314812Z Selected Source Files: 2025-05-07T19:50:21.5315081Z src/config/feature_gates.cpp 2025-05-07T19:50:21.5315370Z 2025-05-07T19:50:21.5315574Z HIPified Source Files: 2025-05-07T19:50:21.5315757Z 2025-05-07T19:50:21.5315838Z 2025-05-07T19:50:21.5316041Z Library Dependencies: 2025-05-07T19:50:21.5316370Z torch 2025-05-07T19:50:21.5316570Z torch_library 2025-05-07T19:50:21.5317046Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:50:21.5317761Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:21.5318478Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:21.5319302Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:21.5320065Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:21.5320689Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:21.5321105Z 2025-05-07T19:50:21.5321312Z Output Library: 2025-05-07T19:50:21.5321538Z fbgemm_gpu_config 2025-05-07T19:50:21.5321775Z 2025-05-07T19:50:21.5321992Z Destination Directory: 2025-05-07T19:50:21.5322240Z fbgemm_gpu 2025-05-07T19:50:21.5322597Z ================================================================================ 2025-05-07T19:50:21.5322839Z 2025-05-07T19:50:21.5322843Z 2025-05-07T19:50:21.5322846Z 2025-05-07T19:50:21.5322963Z ================================================================================ 2025-05-07T19:50:21.5323361Z GPU CPP Library Target: fbgemm_gpu_tbe_utils (SHARED) 2025-05-07T19:50:21.5323836Z 2025-05-07T19:50:21.5324153Z CPU_SRCS: 2025-05-07T19:50:21.5324613Z src/split_embeddings_utils/split_embeddings_utils_cpu.cpp 2025-05-07T19:50:21.5325073Z src/split_embeddings_utils/split_embeddings_utils_meta.cpp 2025-05-07T19:50:21.5325665Z 2025-05-07T19:50:21.5325853Z GPU_SRCS: 2025-05-07T19:50:21.5326149Z src/split_embeddings_utils/split_embeddings_utils.cpp 2025-05-07T19:50:21.5326624Z src/split_embeddings_utils/generate_vbe_metadata.cu 2025-05-07T19:50:21.5327047Z src/split_embeddings_utils/get_infos_metadata.cu 2025-05-07T19:50:21.5327450Z src/split_embeddings_utils/radix_sort_pairs.cu 2025-05-07T19:50:21.5327884Z src/split_embeddings_utils/transpose_embedding_input.cu 2025-05-07T19:50:21.5328252Z 2025-05-07T19:50:21.5328450Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:21.5328593Z 2025-05-07T19:50:21.5328692Z 2025-05-07T19:50:21.5328892Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:21.5329031Z 2025-05-07T19:50:21.5329138Z 2025-05-07T19:50:21.5329323Z OTHER_SRCS: 2025-05-07T19:50:21.5329467Z 2025-05-07T19:50:21.5329553Z 2025-05-07T19:50:21.5329734Z CC_FLAGS: 2025-05-07T19:50:21.5329874Z 2025-05-07T19:50:21.5329953Z 2025-05-07T19:50:21.5330138Z NVCC_FLAGS: 2025-05-07T19:50:21.5330272Z 2025-05-07T19:50:21.5330349Z 2025-05-07T19:50:21.5330559Z HIPCC_FLAGS: 2025-05-07T19:50:21.5330679Z 2025-05-07T19:50:21.5330759Z 2025-05-07T19:50:21.5330972Z INCLUDE_DIRS: 2025-05-07T19:50:21.5331209Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:21.5331552Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:21.5331836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:21.5332176Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:21.5332763Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:50:21.5333595Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:21.5334286Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:21.5334710Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:21.5335182Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:21.5335677Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:21.5336225Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:21.5336996Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:21.5337595Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:50:21.5338137Z 2025-05-07T19:50:21.5338336Z Selected Source Files: 2025-05-07T19:50:21.5338701Z src/split_embeddings_utils/split_embeddings_utils_cpu.cpp 2025-05-07T19:50:21.5339161Z src/split_embeddings_utils/split_embeddings_utils_meta.cpp 2025-05-07T19:50:21.5339619Z src/split_embeddings_utils/split_embeddings_utils.cpp 2025-05-07T19:50:21.5340025Z src/split_embeddings_utils/generate_vbe_metadata.cu 2025-05-07T19:50:21.5340440Z src/split_embeddings_utils/get_infos_metadata.cu 2025-05-07T19:50:21.5340814Z src/split_embeddings_utils/radix_sort_pairs.cu 2025-05-07T19:50:21.5341230Z src/split_embeddings_utils/transpose_embedding_input.cu 2025-05-07T19:50:21.5341617Z 2025-05-07T19:50:21.5341823Z HIPified Source Files: 2025-05-07T19:50:21.5341982Z 2025-05-07T19:50:21.5342075Z 2025-05-07T19:50:21.5342276Z Library Dependencies: 2025-05-07T19:50:21.5342528Z torch 2025-05-07T19:50:21.5342729Z torch_library 2025-05-07T19:50:21.5343184Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:50:21.5343880Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:21.5346244Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:21.5347074Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:21.5347826Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:21.5348446Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:21.5348858Z 2025-05-07T19:50:21.5349074Z Output Library: 2025-05-07T19:50:21.5349306Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:21.5349652Z 2025-05-07T19:50:21.5349860Z Destination Directory: 2025-05-07T19:50:21.5350118Z fbgemm_gpu 2025-05-07T19:50:21.5350357Z ================================================================================ 2025-05-07T19:50:21.5350617Z 2025-05-07T19:50:21.5350621Z 2025-05-07T19:50:21.5350625Z 2025-05-07T19:50:21.5350749Z ================================================================================ 2025-05-07T19:50:21.5351204Z GPU CPP Library Target: fbgemm_gpu_sparse_async_cumsum (SHARED) 2025-05-07T19:50:21.5351596Z 2025-05-07T19:50:21.5351810Z CPU_SRCS: 2025-05-07T19:50:21.5352050Z src/sparse_ops/sparse_async_cumsum.cpp 2025-05-07T19:50:21.5352434Z 2025-05-07T19:50:21.5352653Z GPU_SRCS: 2025-05-07T19:50:21.5352886Z src/sparse_ops/sparse_async_cumsum.cu 2025-05-07T19:50:21.5353203Z 2025-05-07T19:50:21.5353414Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:21.5353584Z 2025-05-07T19:50:21.5353666Z 2025-05-07T19:50:21.5353894Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:21.5354038Z 2025-05-07T19:50:21.5354119Z 2025-05-07T19:50:21.5354331Z OTHER_SRCS: 2025-05-07T19:50:21.5354453Z 2025-05-07T19:50:21.5354536Z 2025-05-07T19:50:21.5354746Z CC_FLAGS: 2025-05-07T19:50:21.5354867Z 2025-05-07T19:50:21.5354949Z 2025-05-07T19:50:21.5355165Z NVCC_FLAGS: 2025-05-07T19:50:21.5355285Z 2025-05-07T19:50:21.5355368Z 2025-05-07T19:50:21.5355578Z HIPCC_FLAGS: 2025-05-07T19:50:21.5355709Z 2025-05-07T19:50:21.5355811Z 2025-05-07T19:50:21.5356119Z INCLUDE_DIRS: 2025-05-07T19:50:21.5356492Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:21.5356816Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:21.5357128Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:21.5357442Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:21.5358139Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:50:21.5358958Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:21.5359649Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:21.5360111Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:21.5360567Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:21.5361079Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:21.5361623Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:21.5362131Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:21.5362705Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:50:21.5363227Z 2025-05-07T19:50:21.5363436Z Selected Source Files: 2025-05-07T19:50:21.5363710Z src/sparse_ops/sparse_async_cumsum.cpp 2025-05-07T19:50:21.5364053Z src/sparse_ops/sparse_async_cumsum.cu 2025-05-07T19:50:21.5364357Z 2025-05-07T19:50:21.5364560Z HIPified Source Files: 2025-05-07T19:50:21.5364723Z 2025-05-07T19:50:21.5364816Z 2025-05-07T19:50:21.5365026Z Library Dependencies: 2025-05-07T19:50:21.5365288Z torch 2025-05-07T19:50:21.5365495Z torch_library 2025-05-07T19:50:21.5365961Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:50:21.5366653Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:21.5367366Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:21.5368278Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:21.5369022Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:21.5369522Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:21.5369881Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:21.5370295Z 2025-05-07T19:50:21.5370478Z Output Library: 2025-05-07T19:50:21.5370726Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:21.5370982Z 2025-05-07T19:50:21.5371184Z Destination Directory: 2025-05-07T19:50:21.5371434Z fbgemm_gpu 2025-05-07T19:50:21.5371656Z ================================================================================ 2025-05-07T19:50:21.5371891Z 2025-05-07T19:50:21.5371895Z 2025-05-07T19:50:21.5371899Z 2025-05-07T19:50:21.5372030Z ================================================================================ 2025-05-07T19:50:21.5372403Z GPU CPP Library Target: fbgemm_gpu_tbe_common (SHARED) 2025-05-07T19:50:21.5372750Z 2025-05-07T19:50:21.5372934Z CPU_SRCS: 2025-05-07T19:50:21.5373200Z codegen/utils/embedding_bounds_check_host_cpu.cpp 2025-05-07T19:50:21.5373652Z codegen/training/forward/embedding_forward_split_cpu.cpp 2025-05-07T19:50:21.5374065Z codegen/training/pt2/pt2_autograd_utils.cpp 2025-05-07T19:50:21.5374400Z 2025-05-07T19:50:21.5374708Z GPU_SRCS: 2025-05-07T19:50:21.5374958Z codegen/utils/embedding_bounds_check_v1.cu 2025-05-07T19:50:21.5375305Z codegen/utils/embedding_bounds_check_v2.cu 2025-05-07T19:50:21.5375676Z codegen/utils/embedding_bounds_check_host.cpp 2025-05-07T19:50:21.5375989Z 2025-05-07T19:50:21.5376194Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:21.5376335Z 2025-05-07T19:50:21.5376407Z 2025-05-07T19:50:21.5376595Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:21.5376729Z 2025-05-07T19:50:21.5376816Z 2025-05-07T19:50:21.5376998Z OTHER_SRCS: 2025-05-07T19:50:21.5377110Z 2025-05-07T19:50:21.5377192Z 2025-05-07T19:50:21.5377367Z CC_FLAGS: 2025-05-07T19:50:21.5377476Z 2025-05-07T19:50:21.5377560Z 2025-05-07T19:50:21.5377797Z NVCC_FLAGS: 2025-05-07T19:50:21.5378024Z --expt-relaxed-constexpr 2025-05-07T19:50:21.5378281Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:21.5378565Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:21.5378849Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:21.5379104Z 2025-05-07T19:50:21.5379279Z HIPCC_FLAGS: 2025-05-07T19:50:21.5379394Z 2025-05-07T19:50:21.5379462Z 2025-05-07T19:50:21.5379639Z INCLUDE_DIRS: 2025-05-07T19:50:21.5379856Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:21.5380156Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:21.5380418Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:21.5380714Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:21.5381186Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:50:21.5381962Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:21.5382780Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:21.5383184Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:21.5383623Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:21.5384092Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:21.5384853Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:21.5385304Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:21.5385871Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:50:21.5386388Z 2025-05-07T19:50:21.5386574Z Selected Source Files: 2025-05-07T19:50:21.5386868Z codegen/utils/embedding_bounds_check_host_cpu.cpp 2025-05-07T19:50:21.5387282Z codegen/training/forward/embedding_forward_split_cpu.cpp 2025-05-07T19:50:21.5387683Z codegen/training/pt2/pt2_autograd_utils.cpp 2025-05-07T19:50:21.5388021Z codegen/utils/embedding_bounds_check_host.cpp 2025-05-07T19:50:21.5388520Z codegen/utils/embedding_bounds_check_v1.cu 2025-05-07T19:50:21.5388847Z codegen/utils/embedding_bounds_check_v2.cu 2025-05-07T19:50:21.5389147Z 2025-05-07T19:50:21.5389331Z HIPified Source Files: 2025-05-07T19:50:21.5389575Z 2025-05-07T19:50:21.5389648Z 2025-05-07T19:50:21.5389840Z Library Dependencies: 2025-05-07T19:50:21.5390058Z torch 2025-05-07T19:50:21.5390244Z torch_library 2025-05-07T19:50:21.5390667Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:50:21.5391357Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:21.5392068Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:21.5392892Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:21.5393649Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:21.5394111Z fbgemm 2025-05-07T19:50:21.5394317Z fbgemm_gpu_config 2025-05-07T19:50:21.5394666Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:21.5395074Z 2025-05-07T19:50:21.5395252Z Output Library: 2025-05-07T19:50:21.5395473Z fbgemm_gpu_tbe_common 2025-05-07T19:50:21.5395696Z 2025-05-07T19:50:21.5395899Z Destination Directory: 2025-05-07T19:50:21.5396128Z fbgemm_gpu 2025-05-07T19:50:21.5396366Z ================================================================================ 2025-05-07T19:50:21.5396595Z 2025-05-07T19:50:21.5396599Z 2025-05-07T19:50:21.5396603Z 2025-05-07T19:50:21.5396721Z ================================================================================ 2025-05-07T19:50:21.5397104Z GPU CPP Library Target: fbgemm_gpu_tbe_optimizers (SHARED) 2025-05-07T19:50:21.5397456Z 2025-05-07T19:50:21.5397632Z CPU_SRCS: 2025-05-07T19:50:21.5397762Z 2025-05-07T19:50:21.5397839Z 2025-05-07T19:50:21.5398014Z GPU_SRCS: 2025-05-07T19:50:21.5398282Z gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:50:21.5398795Z gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu 2025-05-07T19:50:21.5399225Z gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu 2025-05-07T19:50:21.5399587Z 2025-05-07T19:50:21.5399776Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:21.5399915Z 2025-05-07T19:50:21.5400008Z 2025-05-07T19:50:21.5400197Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:21.5400346Z 2025-05-07T19:50:21.5400423Z 2025-05-07T19:50:21.5400598Z OTHER_SRCS: 2025-05-07T19:50:21.5400729Z 2025-05-07T19:50:21.5400808Z 2025-05-07T19:50:21.5400978Z CC_FLAGS: 2025-05-07T19:50:21.5401109Z 2025-05-07T19:50:21.5401186Z 2025-05-07T19:50:21.5401381Z NVCC_FLAGS: 2025-05-07T19:50:21.5401585Z --expt-relaxed-constexpr 2025-05-07T19:50:21.5401861Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:21.5402135Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:21.5402431Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:21.5402675Z 2025-05-07T19:50:21.5402857Z HIPCC_FLAGS: 2025-05-07T19:50:21.5402980Z 2025-05-07T19:50:21.5403053Z 2025-05-07T19:50:21.5403233Z INCLUDE_DIRS: 2025-05-07T19:50:21.5403454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:21.5403769Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:21.5404049Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:21.5404362Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:21.5404864Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:50:21.5405653Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:21.5406319Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:21.5406739Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:21.5407300Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:21.5407759Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:21.5408347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:21.5408821Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:21.5409369Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:50:21.5409926Z 2025-05-07T19:50:21.5410146Z Selected Source Files: 2025-05-07T19:50:21.5410484Z gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:50:21.5410899Z gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu 2025-05-07T19:50:21.5411355Z gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu 2025-05-07T19:50:21.5411710Z 2025-05-07T19:50:21.5411949Z HIPified Source Files: 2025-05-07T19:50:21.5412112Z 2025-05-07T19:50:21.5412226Z 2025-05-07T19:50:21.5412443Z Library Dependencies: 2025-05-07T19:50:21.5412716Z torch 2025-05-07T19:50:21.5412926Z torch_library 2025-05-07T19:50:21.5413398Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:50:21.5414095Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:21.5414836Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:21.5415675Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:21.5416427Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:21.5417068Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:21.5417483Z 2025-05-07T19:50:21.5417717Z Output Library: 2025-05-07T19:50:21.5417966Z fbgemm_gpu_tbe_optimizers 2025-05-07T19:50:21.5418251Z 2025-05-07T19:50:21.5418468Z Destination Directory: 2025-05-07T19:50:21.5418745Z fbgemm_gpu 2025-05-07T19:50:21.5418989Z ================================================================================ 2025-05-07T19:50:21.5419252Z 2025-05-07T19:50:21.5419376Z 2025-05-07T19:50:21.5419380Z 2025-05-07T19:50:21.5419501Z ================================================================================ 2025-05-07T19:50:21.5420032Z GPU CPP Library Target: fbgemm_gpu_tbe_training_forward (SHARED) 2025-05-07T19:50:21.5420428Z 2025-05-07T19:50:21.5420654Z CPU_SRCS: 2025-05-07T19:50:21.5420918Z gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:50:21.5421281Z 2025-05-07T19:50:21.5421485Z GPU_SRCS: 2025-05-07T19:50:21.5421780Z gen_embedding_forward_split_weighted_kernel.cu 2025-05-07T19:50:21.5422192Z gen_embedding_forward_dense_weighted_kernel.cu 2025-05-07T19:50:21.5422563Z gen_embedding_forward_ssd_weighted_kernel.cu 2025-05-07T19:50:21.5422978Z gen_embedding_forward_split_unweighted_nobag_kernel.cu 2025-05-07T19:50:21.5423587Z gen_embedding_forward_dense_unweighted_nobag_kernel.cu 2025-05-07T19:50:21.5424151Z gen_embedding_forward_ssd_unweighted_nobag_kernel.cu 2025-05-07T19:50:21.5424560Z gen_embedding_forward_split_unweighted_kernel.cu 2025-05-07T19:50:21.5424981Z gen_embedding_forward_dense_unweighted_kernel.cu 2025-05-07T19:50:21.5425377Z gen_embedding_forward_ssd_unweighted_kernel.cu 2025-05-07T19:50:21.5425813Z gen_embedding_forward_split_weighted_codegen_cuda.cu 2025-05-07T19:50:21.5426282Z gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:50:21.5426728Z gen_embedding_forward_split_unweighted_codegen_cuda.cu 2025-05-07T19:50:21.5427193Z gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:50:21.5427628Z gen_embedding_forward_dense_weighted_codegen_cuda.cu 2025-05-07T19:50:21.5428092Z gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:50:21.5428533Z gen_embedding_forward_dense_unweighted_codegen_cuda.cu 2025-05-07T19:50:21.5429015Z gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:50:21.5429581Z gen_embedding_forward_ssd_weighted_codegen_cuda.cu 2025-05-07T19:50:21.5430010Z gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:50:21.5430475Z gen_embedding_forward_ssd_unweighted_codegen_cuda.cu 2025-05-07T19:50:21.5430928Z gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:50:21.5431491Z gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:21.5431963Z gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:21.5432429Z gen_embedding_forward_split_weighted_vbe_kernel.cu 2025-05-07T19:50:21.5432841Z gen_embedding_forward_split_weighted_v2_kernel.cu 2025-05-07T19:50:21.5433303Z gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu 2025-05-07T19:50:21.5433791Z gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:21.5434216Z gen_embedding_forward_split_weighted_gwd_kernel.cu 2025-05-07T19:50:21.5434640Z gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:21.5435052Z gen_embedding_forward_dense_weighted_vbe_kernel.cu 2025-05-07T19:50:21.5435473Z gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu 2025-05-07T19:50:21.5435893Z gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:21.5436331Z gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:21.5436752Z gen_embedding_forward_ssd_weighted_vbe_kernel.cu 2025-05-07T19:50:21.5437179Z gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:21.5437651Z gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:21.5438096Z gen_embedding_forward_split_unweighted_vbe_kernel.cu 2025-05-07T19:50:21.5438520Z gen_embedding_forward_split_unweighted_v2_kernel.cu 2025-05-07T19:50:21.5438954Z gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu 2025-05-07T19:50:21.5439450Z gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:21.5440112Z gen_embedding_forward_split_unweighted_gwd_kernel.cu 2025-05-07T19:50:21.5440561Z gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:21.5441012Z gen_embedding_forward_dense_unweighted_vbe_kernel.cu 2025-05-07T19:50:21.5441436Z gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu 2025-05-07T19:50:21.5441899Z gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:21.5442449Z gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:21.5442899Z gen_embedding_forward_ssd_unweighted_vbe_kernel.cu 2025-05-07T19:50:21.5443347Z gen_embedding_forward_split_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:21.5443844Z gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:21.5444332Z gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:21.5444756Z gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:50:21.5445169Z gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:21.5445491Z 2025-05-07T19:50:21.5445694Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:21.5445839Z 2025-05-07T19:50:21.5445919Z 2025-05-07T19:50:21.5446134Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:21.5446275Z 2025-05-07T19:50:21.5446352Z 2025-05-07T19:50:21.5446548Z OTHER_SRCS: 2025-05-07T19:50:21.5446668Z 2025-05-07T19:50:21.5446759Z 2025-05-07T19:50:21.5446941Z CC_FLAGS: 2025-05-07T19:50:21.5447057Z 2025-05-07T19:50:21.5447150Z 2025-05-07T19:50:21.5447335Z NVCC_FLAGS: 2025-05-07T19:50:21.5447579Z --expt-relaxed-constexpr 2025-05-07T19:50:21.5447857Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:21.5448157Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:21.5448444Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:21.5448718Z 2025-05-07T19:50:21.5448898Z HIPCC_FLAGS: 2025-05-07T19:50:21.5449042Z 2025-05-07T19:50:21.5449118Z 2025-05-07T19:50:21.5449301Z INCLUDE_DIRS: 2025-05-07T19:50:21.5449557Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:21.5449899Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:21.5450179Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:21.5450499Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:21.5450997Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:50:21.5451811Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:21.5452537Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:21.5452982Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:21.5453442Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:21.5453922Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:21.5454473Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:21.5454942Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:21.5455540Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:50:21.5456055Z 2025-05-07T19:50:21.5456262Z Selected Source Files: 2025-05-07T19:50:21.5456559Z gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:50:21.5456972Z gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:50:21.5457402Z gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:50:21.5457827Z gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:50:21.5458261Z gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:50:21.5458674Z gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:50:21.5459087Z gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:50:21.5459513Z gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:21.5459953Z gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:21.5460418Z gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:21.5460868Z gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:21.5461300Z gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:50:21.5461676Z gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:21.5462069Z gen_embedding_forward_split_weighted_kernel.cu 2025-05-07T19:50:21.5462429Z gen_embedding_forward_dense_weighted_kernel.cu 2025-05-07T19:50:21.5462802Z gen_embedding_forward_ssd_weighted_kernel.cu 2025-05-07T19:50:21.5463189Z gen_embedding_forward_split_unweighted_nobag_kernel.cu 2025-05-07T19:50:21.5463684Z gen_embedding_forward_dense_unweighted_nobag_kernel.cu 2025-05-07T19:50:21.5464117Z gen_embedding_forward_ssd_unweighted_nobag_kernel.cu 2025-05-07T19:50:21.5464510Z gen_embedding_forward_split_unweighted_kernel.cu 2025-05-07T19:50:21.5464902Z gen_embedding_forward_dense_unweighted_kernel.cu 2025-05-07T19:50:21.5465266Z gen_embedding_forward_ssd_unweighted_kernel.cu 2025-05-07T19:50:21.5465675Z gen_embedding_forward_split_weighted_codegen_cuda.cu 2025-05-07T19:50:21.5466082Z gen_embedding_forward_split_unweighted_codegen_cuda.cu 2025-05-07T19:50:21.5466505Z gen_embedding_forward_dense_weighted_codegen_cuda.cu 2025-05-07T19:50:21.5466908Z gen_embedding_forward_dense_unweighted_codegen_cuda.cu 2025-05-07T19:50:21.5467313Z gen_embedding_forward_ssd_weighted_codegen_cuda.cu 2025-05-07T19:50:21.5467722Z gen_embedding_forward_ssd_unweighted_codegen_cuda.cu 2025-05-07T19:50:21.5468137Z gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:21.5468564Z gen_embedding_forward_split_weighted_vbe_kernel.cu 2025-05-07T19:50:21.5468943Z gen_embedding_forward_split_weighted_v2_kernel.cu 2025-05-07T19:50:21.5469368Z gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu 2025-05-07T19:50:21.5469913Z gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:21.5470358Z gen_embedding_forward_split_weighted_gwd_kernel.cu 2025-05-07T19:50:21.5470772Z gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:21.5471174Z gen_embedding_forward_dense_weighted_vbe_kernel.cu 2025-05-07T19:50:21.5471579Z gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu 2025-05-07T19:50:21.5471991Z gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:21.5472402Z gen_embedding_forward_ssd_weighted_vbe_kernel.cu 2025-05-07T19:50:21.5472810Z gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:21.5473245Z gen_embedding_forward_split_unweighted_vbe_kernel.cu 2025-05-07T19:50:21.5473722Z gen_embedding_forward_split_unweighted_v2_kernel.cu 2025-05-07T19:50:21.5474162Z gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu 2025-05-07T19:50:21.5474648Z gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:21.5475103Z gen_embedding_forward_split_unweighted_gwd_kernel.cu 2025-05-07T19:50:21.5475539Z gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:21.5475972Z gen_embedding_forward_dense_unweighted_vbe_kernel.cu 2025-05-07T19:50:21.5476406Z gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu 2025-05-07T19:50:21.5476836Z gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:21.5477263Z gen_embedding_forward_ssd_unweighted_vbe_kernel.cu 2025-05-07T19:50:21.5477713Z gen_embedding_forward_split_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:21.5478182Z gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:21.5478663Z gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:21.5479024Z 2025-05-07T19:50:21.5479244Z HIPified Source Files: 2025-05-07T19:50:21.5479400Z 2025-05-07T19:50:21.5479482Z 2025-05-07T19:50:21.5479696Z Library Dependencies: 2025-05-07T19:50:21.5479928Z torch 2025-05-07T19:50:21.5480134Z torch_library 2025-05-07T19:50:21.5480584Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:50:21.5481300Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:21.5482039Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:21.5482849Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:21.5483630Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:21.5484120Z fbgemm_gpu_tbe_common 2025-05-07T19:50:21.5484676Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:21.5485082Z 2025-05-07T19:50:21.5485296Z Output Library: 2025-05-07T19:50:21.5485670Z fbgemm_gpu_tbe_training_forward 2025-05-07T19:50:21.5485936Z 2025-05-07T19:50:21.5486152Z Destination Directory: 2025-05-07T19:50:21.5486393Z fbgemm_gpu 2025-05-07T19:50:21.5486652Z ================================================================================ 2025-05-07T19:50:21.5486885Z 2025-05-07T19:50:21.5486889Z 2025-05-07T19:50:21.5486893Z 2025-05-07T19:50:21.5487004Z ================================================================================ 2025-05-07T19:50:21.5487465Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_pt2 (SHARED) 2025-05-07T19:50:21.5487898Z 2025-05-07T19:50:21.5488087Z CPU_SRCS: 2025-05-07T19:50:21.5488350Z gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:50:21.5488728Z gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:21.5489114Z gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:50:21.5489444Z gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:50:21.5489799Z gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:50:21.5490152Z gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:50:21.5490572Z gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:50:21.5491015Z gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:50:21.5491426Z gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:50:21.5491864Z gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:21.5492308Z gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:50:21.5492752Z gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:21.5493260Z gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:21.5493859Z gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:21.5494439Z gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:21.5494976Z gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:50:21.5495522Z gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:21.5495949Z gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:21.5496439Z gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:21.5496904Z gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:21.5497348Z gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:21.5497767Z gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:21.5498224Z gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:21.5498738Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:21.5499284Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:21.5499776Z gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:50:21.5500284Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:21.5500838Z gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:21.5501356Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:21.5501981Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:21.5502675Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:21.5503351Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:21.5503978Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:21.5504406Z 2025-05-07T19:50:21.5504607Z GPU_SRCS: 2025-05-07T19:50:21.5504894Z gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:21.5505389Z gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:21.5505867Z gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:21.5506279Z gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:21.5506779Z gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:21.5507219Z gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:21.5507731Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:21.5508297Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:21.5508808Z gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:50:21.5509322Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:21.5509985Z gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:21.5510521Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:21.5511141Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:21.5511848Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:21.5512537Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:21.5513164Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:21.5513722Z gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:21.5514103Z 2025-05-07T19:50:21.5514304Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:21.5514447Z 2025-05-07T19:50:21.5514532Z 2025-05-07T19:50:21.5514728Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:21.5514867Z 2025-05-07T19:50:21.5514946Z 2025-05-07T19:50:21.5515160Z OTHER_SRCS: 2025-05-07T19:50:21.5515287Z 2025-05-07T19:50:21.5515364Z 2025-05-07T19:50:21.5515574Z CC_FLAGS: 2025-05-07T19:50:21.5515696Z 2025-05-07T19:50:21.5515799Z 2025-05-07T19:50:21.5515994Z NVCC_FLAGS: 2025-05-07T19:50:21.5516245Z --expt-relaxed-constexpr 2025-05-07T19:50:21.5516525Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:21.5516843Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:21.5517149Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:21.5517531Z 2025-05-07T19:50:21.5517730Z HIPCC_FLAGS: 2025-05-07T19:50:21.5517883Z 2025-05-07T19:50:21.5517968Z 2025-05-07T19:50:21.5518157Z INCLUDE_DIRS: 2025-05-07T19:50:21.5518421Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:21.5518746Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:21.5519066Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:21.5519444Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:21.5519966Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:50:21.5520811Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:21.5521489Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:21.5521930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:21.5522376Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:21.5522878Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:21.5523432Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:21.5523904Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:21.5524486Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:50:21.5525007Z 2025-05-07T19:50:21.5525222Z Selected Source Files: 2025-05-07T19:50:21.5525519Z gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:50:21.5525914Z gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:21.5526294Z gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:50:21.5526644Z gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:50:21.5527005Z gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:50:21.5527356Z gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:50:21.5527769Z gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:50:21.5528216Z gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:50:21.5528635Z gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:50:21.5529120Z gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:21.5529597Z gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:50:21.5530032Z gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:21.5530544Z gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:21.5531154Z gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:21.5531745Z gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:21.5532272Z gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:50:21.5532713Z gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:21.5533155Z gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:21.5533619Z gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:21.5534103Z gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:21.5534552Z gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:21.5534975Z gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:21.5535429Z gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:21.5535925Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:21.5536494Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:21.5536978Z gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:50:21.5537509Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:21.5538068Z gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:21.5538583Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:21.5539205Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:21.5539893Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:21.5540646Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:21.5541274Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:21.5541786Z gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:21.5542283Z gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:21.5542913Z gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:21.5543361Z gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:21.5543789Z gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:21.5544246Z gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:21.5544753Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:21.5545334Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:21.5545854Z gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:50:21.5546381Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:21.5546946Z gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:21.5547465Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:21.5548096Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:21.5548788Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:21.5549550Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:21.5550174Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:21.5550757Z gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:21.5551169Z 2025-05-07T19:50:21.5551368Z HIPified Source Files: 2025-05-07T19:50:21.5551553Z 2025-05-07T19:50:21.5551637Z 2025-05-07T19:50:21.5551906Z Library Dependencies: 2025-05-07T19:50:21.5552158Z torch 2025-05-07T19:50:21.5552356Z torch_library 2025-05-07T19:50:21.5552823Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:50:21.5553539Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:21.5554254Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:21.5555076Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:21.5555832Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:21.5556334Z fbgemm 2025-05-07T19:50:21.5556531Z fbgemm_gpu_config 2025-05-07T19:50:21.5573977Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:21.5574251Z fbgemm_gpu_tbe_common 2025-05-07T19:50:21.5574523Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:21.5574813Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:21.5575261Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:21.5575705Z 2025-05-07T19:50:21.5575899Z Output Library: 2025-05-07T19:50:21.5576159Z fbgemm_gpu_tbe_training_backward_pt2 2025-05-07T19:50:21.5576446Z 2025-05-07T19:50:21.5576672Z Destination Directory: 2025-05-07T19:50:21.5576918Z fbgemm_gpu 2025-05-07T19:50:21.5577170Z ================================================================================ 2025-05-07T19:50:21.5577412Z 2025-05-07T19:50:21.5577668Z 2025-05-07T19:50:21.5577673Z 2025-05-07T19:50:21.5577808Z ================================================================================ 2025-05-07T19:50:21.5578236Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward (SHARED) 2025-05-07T19:50:21.5578632Z 2025-05-07T19:50:21.5578822Z CPU_SRCS: 2025-05-07T19:50:21.5579163Z codegen/training/backward/embedding_backward_dense_host_cpu.cpp 2025-05-07T19:50:21.5579606Z gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:50:21.5580132Z gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:50:21.5580511Z gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:21.5580911Z gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:50:21.5581260Z gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:50:21.5581595Z gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:50:21.5581951Z gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:50:21.5582337Z gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:50:21.5582795Z gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:50:21.5583186Z gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:50:21.5583625Z gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:21.5584083Z gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:50:21.5584708Z gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:21.5585277Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:21.5585876Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:21.5586488Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:21.5587010Z gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:50:21.5587451Z gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:50:21.5587855Z gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:50:21.5588225Z gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:50:21.5588528Z 2025-05-07T19:50:21.5588705Z GPU_SRCS: 2025-05-07T19:50:21.5588980Z gen_embedding_backward_split_grad_embedding_ops.cu 2025-05-07T19:50:21.5589526Z gen_embedding_backward_split_indice_weights_codegen_cuda.cu 2025-05-07T19:50:21.5590003Z gen_embedding_backward_dense_indice_weights_codegen_cuda.cu 2025-05-07T19:50:21.5590460Z gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu 2025-05-07T19:50:21.5590930Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu 2025-05-07T19:50:21.5591563Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu 2025-05-07T19:50:21.5592103Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu 2025-05-07T19:50:21.5592644Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu 2025-05-07T19:50:21.5593198Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:21.5593796Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:21.5594335Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu 2025-05-07T19:50:21.5594854Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu 2025-05-07T19:50:21.5595394Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu 2025-05-07T19:50:21.5595864Z gen_embedding_backward_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:21.5596330Z gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:21.5596797Z gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:21.5597295Z gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:21.5597792Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:21.5598340Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:21.5598811Z gen_embedding_backward_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:21.5599282Z gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:21.5599771Z gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:21.5600246Z gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:21.5600756Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:21.5601280Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:21.5601829Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:21.5602589Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:21.5603162Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:21.5603688Z gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:21.5604167Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:21.5604685Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:21.5605117Z gen_embedding_backward_sgd_split_weighted_cuda.cu 2025-05-07T19:50:21.5605499Z gen_embedding_backward_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:21.5605891Z gen_embedding_backward_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:21.5606318Z gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:21.5606767Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:21.5607224Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:21.5607658Z gen_embedding_backward_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:21.5608036Z gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:21.5608463Z gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:21.5608859Z gen_embedding_backward_adam_split_weighted_cuda.cu 2025-05-07T19:50:21.5609249Z gen_embedding_backward_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:21.5609661Z gen_embedding_backward_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:21.5610063Z gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:21.5610512Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:21.5610976Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:21.5611415Z gen_embedding_backward_adam_split_unweighted_cuda.cu 2025-05-07T19:50:21.5611805Z gen_embedding_backward_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:21.5612250Z gen_embedding_backward_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:21.5612641Z gen_embedding_backward_lamb_split_weighted_cuda.cu 2025-05-07T19:50:21.5613096Z gen_embedding_backward_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:21.5613521Z gen_embedding_backward_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:21.5613929Z gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:21.5614391Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:21.5614855Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:21.5615284Z gen_embedding_backward_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:21.5615680Z gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:21.5616117Z gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:21.5616532Z gen_embedding_backward_lars_sgd_split_weighted_cuda.cu 2025-05-07T19:50:21.5616936Z gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:21.5617396Z gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:21.5617852Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:21.5618345Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:21.5618837Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:21.5619307Z gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:21.5619742Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:21.5620198Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:21.5620682Z gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu 2025-05-07T19:50:21.5621185Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:21.5621728Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:21.5622266Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:21.5622892Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:21.5623478Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:21.5624017Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu 2025-05-07T19:50:21.5624535Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:21.5625069Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:21.5625586Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu 2025-05-07T19:50:21.5626089Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:21.5626615Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:21.5627150Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:21.5627715Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:21.5628315Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:21.5628862Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:21.5629387Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:21.5630224Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:21.5630713Z gen_embedding_backward_none_split_weighted_cuda.cu 2025-05-07T19:50:21.5631138Z gen_embedding_backward_none_split_weighted_kernel_cta.cu 2025-05-07T19:50:21.5631569Z gen_embedding_backward_none_split_weighted_kernel_warp.cu 2025-05-07T19:50:21.5632022Z gen_embedding_backward_none_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:21.5632485Z gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:21.5632992Z gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:21.5633451Z gen_embedding_backward_none_split_unweighted_cuda.cu 2025-05-07T19:50:21.5634876Z gen_embedding_backward_none_split_unweighted_kernel_cta.cu 2025-05-07T19:50:21.5635341Z gen_embedding_backward_none_split_unweighted_kernel_warp.cu 2025-05-07T19:50:21.5635848Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu 2025-05-07T19:50:21.5636462Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu 2025-05-07T19:50:21.5637091Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu 2025-05-07T19:50:21.5637731Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:21.5638391Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:21.5639093Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:21.5639742Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu 2025-05-07T19:50:21.5640374Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu 2025-05-07T19:50:21.5641033Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu 2025-05-07T19:50:21.5641496Z 2025-05-07T19:50:21.5641692Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:21.5641837Z 2025-05-07T19:50:21.5641916Z 2025-05-07T19:50:21.5642224Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:21.5642552Z gen_embedding_backward_split_unweighted_nobag_device_kernel_hip.hip 2025-05-07T19:50:21.5643167Z gen_embedding_backward_split_weighted_device_kernel_hip.hip 2025-05-07T19:50:21.5643607Z gen_embedding_backward_split_unweighted_device_kernel_hip.hip 2025-05-07T19:50:21.5643945Z 2025-05-07T19:50:21.5644134Z OTHER_SRCS: 2025-05-07T19:50:21.5644251Z 2025-05-07T19:50:21.5644329Z 2025-05-07T19:50:21.5644504Z CC_FLAGS: 2025-05-07T19:50:21.5644612Z 2025-05-07T19:50:21.5644685Z 2025-05-07T19:50:21.5644855Z NVCC_FLAGS: 2025-05-07T19:50:21.5645065Z --expt-relaxed-constexpr 2025-05-07T19:50:21.5645311Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:21.5645639Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:21.5645911Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:21.5646154Z 2025-05-07T19:50:21.5646328Z HIPCC_FLAGS: 2025-05-07T19:50:21.5646451Z 2025-05-07T19:50:21.5646523Z 2025-05-07T19:50:21.5646705Z INCLUDE_DIRS: 2025-05-07T19:50:21.5646913Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:21.5647214Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:21.5647491Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:21.5647773Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:21.5648243Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:50:21.5648978Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:21.5649594Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:21.5649980Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:21.5650390Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:21.5650833Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:21.5651330Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:21.5651760Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:21.5652286Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:50:21.5652756Z 2025-05-07T19:50:21.5652936Z Selected Source Files: 2025-05-07T19:50:21.5653278Z codegen/training/backward/embedding_backward_dense_host_cpu.cpp 2025-05-07T19:50:21.5653679Z gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:50:21.5654013Z gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:50:21.5654366Z gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:21.5654710Z gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:50:21.5655019Z gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:50:21.5655319Z gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:50:21.5655701Z gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:50:21.5656063Z gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:50:21.5656483Z gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:50:21.5656840Z gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:50:21.5657234Z gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:21.5657653Z gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:50:21.5658028Z gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:21.5658509Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:21.5659055Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:21.5659600Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:21.5660072Z gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:50:21.5660479Z gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:50:21.5660844Z gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:50:21.5661181Z gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:50:21.5661518Z gen_embedding_backward_split_grad_embedding_ops.cu 2025-05-07T19:50:21.5661905Z gen_embedding_backward_split_indice_weights_codegen_cuda.cu 2025-05-07T19:50:21.5662339Z gen_embedding_backward_dense_indice_weights_codegen_cuda.cu 2025-05-07T19:50:21.5662750Z gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu 2025-05-07T19:50:21.5663166Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu 2025-05-07T19:50:21.5663606Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu 2025-05-07T19:50:21.5664087Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu 2025-05-07T19:50:21.5664575Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu 2025-05-07T19:50:21.5665075Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:21.5665701Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:21.5666184Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu 2025-05-07T19:50:21.5666658Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu 2025-05-07T19:50:21.5667137Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu 2025-05-07T19:50:21.5667571Z gen_embedding_backward_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:21.5667971Z gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:21.5668393Z gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:21.5668836Z gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:21.5669292Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:21.5670068Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:21.5670541Z gen_embedding_backward_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:21.5670993Z gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:21.5671483Z gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:21.5671968Z gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:21.5672473Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:21.5672990Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:21.5673528Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:21.5674080Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:21.5674674Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:21.5675235Z gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:21.5675744Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:21.5676307Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:21.5676840Z gen_embedding_backward_sgd_split_weighted_cuda.cu 2025-05-07T19:50:21.5677251Z gen_embedding_backward_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:21.5677671Z gen_embedding_backward_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:21.5678112Z gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:21.5678581Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:21.5679074Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:21.5679530Z gen_embedding_backward_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:21.5679940Z gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:21.5680392Z gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:21.5680807Z gen_embedding_backward_adam_split_weighted_cuda.cu 2025-05-07T19:50:21.5681215Z gen_embedding_backward_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:21.5681661Z gen_embedding_backward_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:21.5682283Z gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:21.5682750Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:21.5683245Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:21.5683693Z gen_embedding_backward_adam_split_unweighted_cuda.cu 2025-05-07T19:50:21.5684102Z gen_embedding_backward_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:21.5684699Z gen_embedding_backward_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:21.5685289Z gen_embedding_backward_lamb_split_weighted_cuda.cu 2025-05-07T19:50:21.5685701Z gen_embedding_backward_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:21.5686133Z gen_embedding_backward_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:21.5686569Z gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:21.5687040Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:21.5687666Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:21.5688130Z gen_embedding_backward_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:21.5688552Z gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:21.5689010Z gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:21.5689453Z gen_embedding_backward_lars_sgd_split_weighted_cuda.cu 2025-05-07T19:50:21.5689884Z gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:21.5690353Z gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:21.5690825Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:21.5691339Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:21.5691868Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:21.5692353Z gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:21.5692822Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:21.5693315Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:21.5693836Z gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu 2025-05-07T19:50:21.5694379Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:21.5694957Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:21.5695528Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:21.5696141Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:21.5696793Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:21.5697478Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu 2025-05-07T19:50:21.5697995Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:21.5698527Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:21.5699118Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu 2025-05-07T19:50:21.5699613Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:21.5700144Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:21.5700674Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:21.5701226Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:21.5701817Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:21.5702352Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:21.5702870Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:21.5703412Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:21.5703878Z gen_embedding_backward_none_split_weighted_cuda.cu 2025-05-07T19:50:21.5704265Z gen_embedding_backward_none_split_weighted_kernel_cta.cu 2025-05-07T19:50:21.5704660Z gen_embedding_backward_none_split_weighted_kernel_warp.cu 2025-05-07T19:50:21.5705073Z gen_embedding_backward_none_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:21.5705505Z gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:21.5705984Z gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:21.5706400Z gen_embedding_backward_none_split_unweighted_cuda.cu 2025-05-07T19:50:21.5706808Z gen_embedding_backward_none_split_unweighted_kernel_cta.cu 2025-05-07T19:50:21.5707235Z gen_embedding_backward_none_split_unweighted_kernel_warp.cu 2025-05-07T19:50:21.5707701Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu 2025-05-07T19:50:21.5708259Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu 2025-05-07T19:50:21.5708836Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu 2025-05-07T19:50:21.5709578Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:21.5710401Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:21.5711092Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:21.5711748Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu 2025-05-07T19:50:21.5712364Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu 2025-05-07T19:50:21.5713008Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu 2025-05-07T19:50:21.5713470Z 2025-05-07T19:50:21.5713674Z HIPified Source Files: 2025-05-07T19:50:21.5713829Z 2025-05-07T19:50:21.5713912Z 2025-05-07T19:50:21.5714110Z Library Dependencies: 2025-05-07T19:50:21.5714344Z torch 2025-05-07T19:50:21.5714527Z torch_library 2025-05-07T19:50:21.5714981Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:50:21.5715664Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:21.5716368Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:21.5717167Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:21.5717927Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:21.5718393Z fbgemm 2025-05-07T19:50:21.5718586Z fbgemm_gpu_config 2025-05-07T19:50:21.5718816Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:21.5719043Z fbgemm_gpu_tbe_common 2025-05-07T19:50:21.5719289Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:21.5719533Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:21.5719940Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:21.5720335Z 2025-05-07T19:50:21.5720525Z Output Library: 2025-05-07T19:50:21.5720815Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:50:21.5721081Z 2025-05-07T19:50:21.5721265Z Destination Directory: 2025-05-07T19:50:21.5721500Z fbgemm_gpu 2025-05-07T19:50:21.5721739Z ================================================================================ 2025-05-07T19:50:21.5721971Z 2025-05-07T19:50:21.5721975Z 2025-05-07T19:50:21.5721980Z 2025-05-07T19:50:21.5722091Z ================================================================================ 2025-05-07T19:50:21.5722632Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_gwd (SHARED) 2025-05-07T19:50:21.5722989Z 2025-05-07T19:50:21.5723172Z CPU_SRCS: 2025-05-07T19:50:21.5723275Z 2025-05-07T19:50:21.5723346Z 2025-05-07T19:50:21.5723521Z GPU_SRCS: 2025-05-07T19:50:21.5723815Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu 2025-05-07T19:50:21.5724305Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu 2025-05-07T19:50:21.5724842Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu 2025-05-07T19:50:21.5725343Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu 2025-05-07T19:50:21.5725854Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu 2025-05-07T19:50:21.5726381Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu 2025-05-07T19:50:21.5726910Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu 2025-05-07T19:50:21.5727428Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:21.5727960Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:21.5728506Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu 2025-05-07T19:50:21.5729034Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:21.5729599Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:21.5730069Z 2025-05-07T19:50:21.5730262Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:21.5730392Z 2025-05-07T19:50:21.5730461Z 2025-05-07T19:50:21.5730644Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:21.5730771Z 2025-05-07T19:50:21.5730841Z 2025-05-07T19:50:21.5731027Z OTHER_SRCS: 2025-05-07T19:50:21.5731137Z 2025-05-07T19:50:21.5731209Z 2025-05-07T19:50:21.5731382Z CC_FLAGS: 2025-05-07T19:50:21.5731485Z 2025-05-07T19:50:21.5731563Z 2025-05-07T19:50:21.5731719Z NVCC_FLAGS: 2025-05-07T19:50:21.5731926Z --expt-relaxed-constexpr 2025-05-07T19:50:21.5732174Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:21.5732439Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:21.5732715Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:21.5732961Z 2025-05-07T19:50:21.5733132Z HIPCC_FLAGS: 2025-05-07T19:50:21.5733260Z 2025-05-07T19:50:21.5733331Z 2025-05-07T19:50:21.5733502Z INCLUDE_DIRS: 2025-05-07T19:50:21.5733729Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:21.5734010Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:21.5734285Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:21.5734577Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:21.5735043Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:50:21.5735778Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:21.5736378Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:21.5736762Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:21.5737156Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:21.5737594Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:21.5738083Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:21.5738500Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:21.5739031Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:50:21.5739507Z 2025-05-07T19:50:21.5739687Z Selected Source Files: 2025-05-07T19:50:21.5740065Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu 2025-05-07T19:50:21.5740574Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu 2025-05-07T19:50:21.5741099Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu 2025-05-07T19:50:21.5741613Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu 2025-05-07T19:50:21.5742271Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu 2025-05-07T19:50:21.5742813Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu 2025-05-07T19:50:21.5743324Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu 2025-05-07T19:50:21.5743852Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:21.5744385Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:21.5744923Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu 2025-05-07T19:50:21.5745468Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:21.5745695Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:21.5745766Z 2025-05-07T19:50:21.5745849Z HIPified Source Files: 2025-05-07T19:50:21.5745853Z 2025-05-07T19:50:21.5745935Z 2025-05-07T19:50:21.5746020Z Library Dependencies: 2025-05-07T19:50:21.5746085Z torch 2025-05-07T19:50:21.5746173Z torch_library 2025-05-07T19:50:21.5746462Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:50:21.5746694Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:21.5747011Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:21.5747340Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:21.5747649Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:21.5747745Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:50:21.5747953Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:21.5748020Z 2025-05-07T19:50:21.5748096Z Output Library: 2025-05-07T19:50:21.5748214Z fbgemm_gpu_tbe_training_backward_gwd 2025-05-07T19:50:21.5748285Z 2025-05-07T19:50:21.5748372Z Destination Directory: 2025-05-07T19:50:21.5748445Z fbgemm_gpu 2025-05-07T19:50:21.5748562Z ================================================================================ 2025-05-07T19:50:21.5748566Z 2025-05-07T19:50:21.5748569Z 2025-05-07T19:50:21.5748573Z 2025-05-07T19:50:21.5748668Z ================================================================================ 2025-05-07T19:50:21.5748859Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_vbe (SHARED) 2025-05-07T19:50:21.5748936Z 2025-05-07T19:50:21.5749009Z CPU_SRCS: 2025-05-07T19:50:21.5749017Z 2025-05-07T19:50:21.5749083Z 2025-05-07T19:50:21.5749169Z GPU_SRCS: 2025-05-07T19:50:21.5749361Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:50:21.5749622Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:50:21.5750004Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:21.5750205Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:50:21.5750454Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:50:21.5750706Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:21.5750872Z gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:50:21.5751034Z gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:21.5751193Z gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:50:21.5751369Z gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:21.5751615Z gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:50:21.5751786Z gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:21.5751991Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu 2025-05-07T19:50:21.5752209Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:21.5752436Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:21.5752621Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu 2025-05-07T19:50:21.5752839Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:21.5753051Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:21.5753249Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:21.5753484Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:21.5753713Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:21.5753916Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu 2025-05-07T19:50:21.5754141Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:21.5754360Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:21.5754601Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu 2025-05-07T19:50:21.5754884Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:21.5755152Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:21.5755402Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:21.5755680Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:21.5755968Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:21.5756180Z gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu 2025-05-07T19:50:21.5756353Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:21.5756538Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:21.5756689Z gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:21.5756866Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:21.5757062Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:21.5757210Z gen_embedding_backward_dense_split_weighted_vbe_cuda.cu 2025-05-07T19:50:21.5757388Z gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:21.5757569Z gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:21.5757739Z gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:21.5757927Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:21.5758122Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:21.5758288Z gen_embedding_backward_adam_split_weighted_vbe_cuda.cu 2025-05-07T19:50:21.5758461Z gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:21.5758640Z gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:21.5758804Z gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:21.5758987Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:21.5759170Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:21.5759244Z 2025-05-07T19:50:21.5759335Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:21.5759339Z 2025-05-07T19:50:21.5759412Z 2025-05-07T19:50:21.5759497Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:21.5759501Z 2025-05-07T19:50:21.5759583Z 2025-05-07T19:50:21.5759665Z OTHER_SRCS: 2025-05-07T19:50:21.5759670Z 2025-05-07T19:50:21.5759745Z 2025-05-07T19:50:21.5759832Z CC_FLAGS: 2025-05-07T19:50:21.5759839Z 2025-05-07T19:50:21.5759910Z 2025-05-07T19:50:21.5759988Z NVCC_FLAGS: 2025-05-07T19:50:21.5760139Z --expt-relaxed-constexpr 2025-05-07T19:50:21.5760246Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:21.5760345Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:21.5760442Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:21.5760522Z 2025-05-07T19:50:21.5760607Z HIPCC_FLAGS: 2025-05-07T19:50:21.5760611Z 2025-05-07T19:50:21.5760685Z 2025-05-07T19:50:21.5760765Z INCLUDE_DIRS: 2025-05-07T19:50:21.5760880Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:21.5760981Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:21.5761082Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:21.5761189Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:21.5761475Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:50:21.5761873Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:21.5762027Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:21.5762295Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:21.5762438Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:21.5762624Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:21.5762816Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:21.5762947Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:21.5763233Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:50:21.5763309Z 2025-05-07T19:50:21.5763389Z Selected Source Files: 2025-05-07T19:50:21.5763574Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:50:21.5763749Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:50:21.5763952Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:21.5764186Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:50:21.5764424Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:50:21.5764672Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:21.5764813Z gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:50:21.5764960Z gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:21.5765115Z gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:50:21.5765268Z gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:21.5765409Z gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:50:21.5765569Z gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:21.5765749Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu 2025-05-07T19:50:21.5765953Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:21.5766171Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:21.5766358Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu 2025-05-07T19:50:21.5766558Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:21.5766758Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:21.5766954Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:21.5767163Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:21.5767372Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:21.5767563Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu 2025-05-07T19:50:21.5767764Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:21.5767965Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:21.5768198Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu 2025-05-07T19:50:21.5768493Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:21.5768741Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:21.5768976Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:21.5769231Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:21.5769485Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:21.5769627Z gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu 2025-05-07T19:50:21.5769796Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:21.5769957Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:21.5770094Z gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:21.5770266Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:21.5770440Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:21.5770579Z gen_embedding_backward_dense_split_weighted_vbe_cuda.cu 2025-05-07T19:50:21.5770744Z gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:21.5770921Z gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:21.5771070Z gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:21.5771242Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:21.5771429Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:21.5771567Z gen_embedding_backward_adam_split_weighted_vbe_cuda.cu 2025-05-07T19:50:21.5771728Z gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:21.5771907Z gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:21.5772052Z gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:21.5772274Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:21.5772447Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:21.5772528Z 2025-05-07T19:50:21.5772612Z HIPified Source Files: 2025-05-07T19:50:21.5772616Z 2025-05-07T19:50:21.5772681Z 2025-05-07T19:50:21.5772775Z Library Dependencies: 2025-05-07T19:50:21.5772841Z torch 2025-05-07T19:50:21.5772915Z torch_library 2025-05-07T19:50:21.5773216Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:50:21.5773450Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:21.5773756Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:21.5774090Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:21.5774362Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:21.5774472Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:50:21.5774673Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:21.5774773Z 2025-05-07T19:50:21.5774854Z Output Library: 2025-05-07T19:50:21.5774962Z fbgemm_gpu_tbe_training_backward_vbe 2025-05-07T19:50:21.5775038Z 2025-05-07T19:50:21.5775144Z Destination Directory: 2025-05-07T19:50:21.5775224Z fbgemm_gpu 2025-05-07T19:50:21.5775330Z ================================================================================ 2025-05-07T19:50:21.5775334Z 2025-05-07T19:50:21.5775338Z 2025-05-07T19:50:21.5775341Z 2025-05-07T19:50:21.5775462Z ================================================================================ 2025-05-07T19:50:21.5775669Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_dense (SHARED) 2025-05-07T19:50:21.5775742Z 2025-05-07T19:50:21.5775839Z CPU_SRCS: 2025-05-07T19:50:21.5775843Z 2025-05-07T19:50:21.5775920Z 2025-05-07T19:50:21.5776001Z GPU_SRCS: 2025-05-07T19:50:21.5776210Z gen_embedding_backward_dense_split_weighted_cuda.cu 2025-05-07T19:50:21.5776359Z gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:50:21.5776518Z gen_embedding_backward_dense_split_weighted_kernel_cta.cu 2025-05-07T19:50:21.5776680Z gen_embedding_backward_dense_split_weighted_kernel_warp.cu 2025-05-07T19:50:21.5776865Z gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:21.5777040Z gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:21.5777228Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:21.5777443Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:21.5777590Z gen_embedding_backward_dense_split_unweighted_cuda.cu 2025-05-07T19:50:21.5777735Z gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T19:50:21.5777930Z gen_embedding_backward_dense_split_unweighted_kernel_cta.cu 2025-05-07T19:50:21.5778106Z gen_embedding_backward_dense_split_unweighted_kernel_warp.cu 2025-05-07T19:50:21.5778225Z gen_embedding_backward_split_dense.cpp 2025-05-07T19:50:21.5778299Z 2025-05-07T19:50:21.5778410Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:21.5778414Z 2025-05-07T19:50:21.5778488Z 2025-05-07T19:50:21.5778573Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:21.5778577Z 2025-05-07T19:50:21.5778675Z 2025-05-07T19:50:21.5778757Z OTHER_SRCS: 2025-05-07T19:50:21.5778761Z 2025-05-07T19:50:21.5778835Z 2025-05-07T19:50:21.5778914Z CC_FLAGS: 2025-05-07T19:50:21.5778936Z 2025-05-07T19:50:21.5779011Z 2025-05-07T19:50:21.5779097Z NVCC_FLAGS: 2025-05-07T19:50:21.5779196Z --expt-relaxed-constexpr 2025-05-07T19:50:21.5779310Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:21.5779421Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:21.5779520Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:21.5779595Z 2025-05-07T19:50:21.5779693Z HIPCC_FLAGS: 2025-05-07T19:50:21.5779696Z 2025-05-07T19:50:21.5779768Z 2025-05-07T19:50:21.5779849Z INCLUDE_DIRS: 2025-05-07T19:50:21.5780023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:21.5780122Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:21.5780222Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:21.5780337Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:21.5780609Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:50:21.5780981Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:21.5781118Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:21.5781294Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:21.5781447Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:21.5781642Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:21.5781847Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:21.5781984Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:21.5782281Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:50:21.5782374Z 2025-05-07T19:50:21.5782464Z Selected Source Files: 2025-05-07T19:50:21.5782607Z gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:50:21.5782781Z gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:21.5782949Z gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T19:50:21.5783058Z gen_embedding_backward_split_dense.cpp 2025-05-07T19:50:21.5783191Z gen_embedding_backward_dense_split_weighted_cuda.cu 2025-05-07T19:50:21.5783365Z gen_embedding_backward_dense_split_weighted_kernel_cta.cu 2025-05-07T19:50:21.5783522Z gen_embedding_backward_dense_split_weighted_kernel_warp.cu 2025-05-07T19:50:21.5783689Z gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:21.5783874Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:21.5784080Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:21.5784287Z gen_embedding_backward_dense_split_unweighted_cuda.cu 2025-05-07T19:50:21.5784613Z gen_embedding_backward_dense_split_unweighted_kernel_cta.cu 2025-05-07T19:50:21.5784981Z gen_embedding_backward_dense_split_unweighted_kernel_warp.cu 2025-05-07T19:50:21.5785065Z 2025-05-07T19:50:21.5785165Z HIPified Source Files: 2025-05-07T19:50:21.5785169Z 2025-05-07T19:50:21.5785323Z 2025-05-07T19:50:21.5785421Z Library Dependencies: 2025-05-07T19:50:21.5785502Z torch 2025-05-07T19:50:21.5785589Z torch_library 2025-05-07T19:50:21.5785925Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:50:21.5786182Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:21.5786525Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:21.5786910Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:21.5787197Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:21.5787303Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:50:21.5787539Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:21.5787620Z 2025-05-07T19:50:21.5787711Z Output Library: 2025-05-07T19:50:21.5787828Z fbgemm_gpu_tbe_training_backward_dense 2025-05-07T19:50:21.5787921Z 2025-05-07T19:50:21.5788016Z Destination Directory: 2025-05-07T19:50:21.5788106Z fbgemm_gpu 2025-05-07T19:50:21.5788237Z ================================================================================ 2025-05-07T19:50:21.5788241Z 2025-05-07T19:50:21.5788245Z 2025-05-07T19:50:21.5788249Z 2025-05-07T19:50:21.5788360Z ================================================================================ 2025-05-07T19:50:21.5788595Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_split_host (SHARED) 2025-05-07T19:50:21.5788834Z 2025-05-07T19:50:21.5788916Z CPU_SRCS: 2025-05-07T19:50:21.5788920Z 2025-05-07T19:50:21.5788998Z 2025-05-07T19:50:21.5789077Z GPU_SRCS: 2025-05-07T19:50:21.5789209Z gen_embedding_backward_split_adagrad.cpp 2025-05-07T19:50:21.5789348Z gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T19:50:21.5789529Z gen_embedding_backward_split_sgd.cpp 2025-05-07T19:50:21.5789661Z gen_embedding_backward_split_adam.cpp 2025-05-07T19:50:21.5789772Z gen_embedding_backward_split_lamb.cpp 2025-05-07T19:50:21.5789892Z gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T19:50:21.5790078Z gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T19:50:21.5790241Z gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T19:50:21.5790353Z gen_embedding_backward_split_none.cpp 2025-05-07T19:50:21.5790540Z gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:21.5790690Z gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T19:50:21.5790857Z gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T19:50:21.5791081Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:21.5791344Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:21.5791546Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:21.5791716Z gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T19:50:21.5791848Z gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T19:50:21.5792032Z gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:21.5792198Z gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:21.5792387Z gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:21.5792601Z gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:21.5792746Z gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T19:50:21.5792901Z gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:21.5793065Z gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T19:50:21.5793292Z gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T19:50:21.5793437Z gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T19:50:21.5793593Z gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:21.5793770Z gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T19:50:21.5793938Z gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:21.5794147Z gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T19:50:21.5794386Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T19:50:21.5794595Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T19:50:21.5794809Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:21.5794970Z gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T19:50:21.5795125Z gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T19:50:21.5795370Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T19:50:21.5795643Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T19:50:21.5795725Z 2025-05-07T19:50:21.5795821Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:21.5795825Z 2025-05-07T19:50:21.5795902Z 2025-05-07T19:50:21.5796017Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:21.5796021Z 2025-05-07T19:50:21.5796104Z 2025-05-07T19:50:21.5796185Z OTHER_SRCS: 2025-05-07T19:50:21.5796189Z 2025-05-07T19:50:21.5796282Z 2025-05-07T19:50:21.5796366Z CC_FLAGS: 2025-05-07T19:50:21.5796371Z 2025-05-07T19:50:21.5796447Z 2025-05-07T19:50:21.5796532Z NVCC_FLAGS: 2025-05-07T19:50:21.5796648Z --expt-relaxed-constexpr 2025-05-07T19:50:21.5796750Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:21.5796857Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:21.5796971Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:21.5797048Z 2025-05-07T19:50:21.5797132Z HIPCC_FLAGS: 2025-05-07T19:50:21.5797188Z 2025-05-07T19:50:21.5797266Z 2025-05-07T19:50:21.5797374Z INCLUDE_DIRS: 2025-05-07T19:50:21.5797490Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:21.5797587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:21.5797718Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:21.5797828Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:21.5798114Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:50:21.5798527Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:21.5798679Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:21.5798845Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:21.5799005Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:21.5799230Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:21.5799434Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:21.5799587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:21.5799930Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:50:21.5800006Z 2025-05-07T19:50:21.5800101Z Selected Source Files: 2025-05-07T19:50:21.5800220Z gen_embedding_backward_split_adagrad.cpp 2025-05-07T19:50:21.5800377Z gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T19:50:21.5800491Z gen_embedding_backward_split_sgd.cpp 2025-05-07T19:50:21.5800606Z gen_embedding_backward_split_adam.cpp 2025-05-07T19:50:21.5800726Z gen_embedding_backward_split_lamb.cpp 2025-05-07T19:50:21.5800841Z gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T19:50:21.5800999Z gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T19:50:21.5801175Z gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T19:50:21.5801287Z gen_embedding_backward_split_none.cpp 2025-05-07T19:50:21.5801476Z gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:21.5801653Z gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T19:50:21.5801829Z gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T19:50:21.5802151Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:21.5802367Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:21.5802572Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:21.5802726Z gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T19:50:21.5802856Z gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T19:50:21.5803034Z gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:21.5803190Z gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:21.5803371Z gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:21.5803560Z gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:21.5803704Z gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T19:50:21.5803855Z gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:21.5803986Z gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T19:50:21.5804143Z gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T19:50:21.5804277Z gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T19:50:21.5804418Z gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:21.5804585Z gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T19:50:21.5804735Z gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:21.5804925Z gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T19:50:21.5805123Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T19:50:21.5805327Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T19:50:21.5805524Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:21.5806106Z gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T19:50:21.5806263Z gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T19:50:21.5806492Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T19:50:21.5806722Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T19:50:21.5806802Z 2025-05-07T19:50:21.5806885Z HIPified Source Files: 2025-05-07T19:50:21.5806889Z 2025-05-07T19:50:21.5806954Z 2025-05-07T19:50:21.5807034Z Library Dependencies: 2025-05-07T19:50:21.5807113Z torch 2025-05-07T19:50:21.5807185Z torch_library 2025-05-07T19:50:21.5807475Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:50:21.5807725Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:21.5808035Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:21.5808372Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:21.5808640Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:21.5808718Z fbgemm_gpu_config 2025-05-07T19:50:21.5808797Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:21.5808996Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:21.5809078Z 2025-05-07T19:50:21.5809151Z Output Library: 2025-05-07T19:50:21.5809262Z fbgemm_gpu_tbe_training_backward_split_host 2025-05-07T19:50:21.5809347Z 2025-05-07T19:50:21.5809436Z Destination Directory: 2025-05-07T19:50:21.5809508Z fbgemm_gpu 2025-05-07T19:50:21.5809609Z ================================================================================ 2025-05-07T19:50:21.5809613Z 2025-05-07T19:50:21.5809629Z 2025-05-07T19:50:21.5809632Z 2025-05-07T19:50:21.5809735Z ================================================================================ 2025-05-07T19:50:21.5809959Z GPU CPP Library Target: fbgemm_gpu_tbe_index_select (SHARED) 2025-05-07T19:50:21.5810026Z 2025-05-07T19:50:21.5810110Z CPU_SRCS: 2025-05-07T19:50:21.5810309Z codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp 2025-05-07T19:50:21.5810486Z codegen/training/index_select/batch_index_select_dim0_ops.cpp 2025-05-07T19:50:21.5810563Z 2025-05-07T19:50:21.5810636Z GPU_SRCS: 2025-05-07T19:50:21.5810814Z codegen/training/index_select/batch_index_select_dim0_host.cpp 2025-05-07T19:50:21.5810945Z gen_batch_index_select_dim0_forward_codegen_cuda.cu 2025-05-07T19:50:21.5811067Z gen_batch_index_select_dim0_forward_kernel.cu 2025-05-07T19:50:21.5811194Z gen_batch_index_select_dim0_forward_kernel_small.cu 2025-05-07T19:50:21.5811331Z gen_batch_index_select_dim0_backward_codegen_cuda.cu 2025-05-07T19:50:21.5811466Z gen_batch_index_select_dim0_backward_kernel_cta.cu 2025-05-07T19:50:21.5811590Z gen_batch_index_select_dim0_backward_kernel_warp.cu 2025-05-07T19:50:21.5811716Z gen_embedding_backward_split_grad_index_select.cu 2025-05-07T19:50:21.5811799Z 2025-05-07T19:50:21.5811884Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:21.5811888Z 2025-05-07T19:50:21.5811954Z 2025-05-07T19:50:21.5812037Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:21.5812041Z 2025-05-07T19:50:21.5812124Z 2025-05-07T19:50:21.5812197Z OTHER_SRCS: 2025-05-07T19:50:21.5812201Z 2025-05-07T19:50:21.5812272Z 2025-05-07T19:50:21.5812366Z CC_FLAGS: 2025-05-07T19:50:21.5812370Z 2025-05-07T19:50:21.5812435Z 2025-05-07T19:50:21.5812510Z NVCC_FLAGS: 2025-05-07T19:50:21.5812603Z --expt-relaxed-constexpr 2025-05-07T19:50:21.5812705Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:21.5812797Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:21.5812886Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:21.5812969Z 2025-05-07T19:50:21.5813045Z HIPCC_FLAGS: 2025-05-07T19:50:21.5813048Z 2025-05-07T19:50:21.5813113Z 2025-05-07T19:50:21.5813186Z INCLUDE_DIRS: 2025-05-07T19:50:21.5813307Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:21.5813462Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:21.5813563Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:21.5813666Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:21.5813931Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:50:21.5814294Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:21.5814431Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:21.5814583Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:21.5814730Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:21.5814918Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:21.5815120Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:21.5815250Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:21.5815535Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:50:21.5815619Z 2025-05-07T19:50:21.5815709Z Selected Source Files: 2025-05-07T19:50:21.5815906Z codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp 2025-05-07T19:50:21.5816095Z codegen/training/index_select/batch_index_select_dim0_ops.cpp 2025-05-07T19:50:21.5816275Z codegen/training/index_select/batch_index_select_dim0_host.cpp 2025-05-07T19:50:21.5816409Z gen_batch_index_select_dim0_forward_codegen_cuda.cu 2025-05-07T19:50:21.5816526Z gen_batch_index_select_dim0_forward_kernel.cu 2025-05-07T19:50:21.5816665Z gen_batch_index_select_dim0_forward_kernel_small.cu 2025-05-07T19:50:21.5816802Z gen_batch_index_select_dim0_backward_codegen_cuda.cu 2025-05-07T19:50:21.5816930Z gen_batch_index_select_dim0_backward_kernel_cta.cu 2025-05-07T19:50:21.5817063Z gen_batch_index_select_dim0_backward_kernel_warp.cu 2025-05-07T19:50:21.5817182Z gen_embedding_backward_split_grad_index_select.cu 2025-05-07T19:50:21.5817250Z 2025-05-07T19:50:21.5817335Z HIPified Source Files: 2025-05-07T19:50:21.5817351Z 2025-05-07T19:50:21.5817465Z 2025-05-07T19:50:21.5817552Z Library Dependencies: 2025-05-07T19:50:21.5817617Z torch 2025-05-07T19:50:21.5817700Z torch_library 2025-05-07T19:50:21.5817988Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:50:21.5818222Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:21.5818542Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:21.5818872Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:21.5819122Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:21.5819216Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:21.5819311Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:21.5819508Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:21.5819579Z 2025-05-07T19:50:21.5819664Z Output Library: 2025-05-07T19:50:21.5819752Z fbgemm_gpu_tbe_index_select 2025-05-07T19:50:21.5819819Z 2025-05-07T19:50:21.5819903Z Destination Directory: 2025-05-07T19:50:21.5819986Z fbgemm_gpu 2025-05-07T19:50:21.5820087Z ================================================================================ 2025-05-07T19:50:21.5820091Z 2025-05-07T19:50:21.5820094Z 2025-05-07T19:50:21.5820098Z 2025-05-07T19:50:21.5820193Z ================================================================================ 2025-05-07T19:50:21.5820377Z GPU CPP Library Target: fbgemm_gpu_embedding_inplace_ops (SHARED) 2025-05-07T19:50:21.5820446Z 2025-05-07T19:50:21.5820522Z CPU_SRCS: 2025-05-07T19:50:21.5820693Z src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp 2025-05-07T19:50:21.5820761Z 2025-05-07T19:50:21.5820833Z GPU_SRCS: 2025-05-07T19:50:21.5820995Z src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp 2025-05-07T19:50:21.5821148Z src/embedding_inplace_ops/embedding_inplace_update.cu 2025-05-07T19:50:21.5821264Z 2025-05-07T19:50:21.5821345Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:21.5821349Z 2025-05-07T19:50:21.5821424Z 2025-05-07T19:50:21.5821506Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:21.5821510Z 2025-05-07T19:50:21.5821576Z 2025-05-07T19:50:21.5821661Z OTHER_SRCS: 2025-05-07T19:50:21.5821665Z 2025-05-07T19:50:21.5821733Z 2025-05-07T19:50:21.5821805Z CC_FLAGS: 2025-05-07T19:50:21.5821809Z 2025-05-07T19:50:21.5821874Z 2025-05-07T19:50:21.5821958Z NVCC_FLAGS: 2025-05-07T19:50:21.5822049Z --expt-relaxed-constexpr 2025-05-07T19:50:21.5822138Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:21.5822246Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:21.5822334Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:21.5822402Z 2025-05-07T19:50:21.5822475Z HIPCC_FLAGS: 2025-05-07T19:50:21.5822479Z 2025-05-07T19:50:21.5822554Z 2025-05-07T19:50:21.5822629Z INCLUDE_DIRS: 2025-05-07T19:50:21.5822727Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:21.5822829Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:21.5822930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:21.5823032Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:21.5823291Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:50:21.5823667Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:21.5823803Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:21.5823950Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:21.5824102Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:21.5824286Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:21.5824473Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:21.5824621Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:21.5824904Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:50:21.5825023Z 2025-05-07T19:50:21.5825115Z Selected Source Files: 2025-05-07T19:50:21.5825295Z src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp 2025-05-07T19:50:21.5825452Z src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp 2025-05-07T19:50:21.5825591Z src/embedding_inplace_ops/embedding_inplace_update.cu 2025-05-07T19:50:21.5825670Z 2025-05-07T19:50:21.5825759Z HIPified Source Files: 2025-05-07T19:50:21.5825763Z 2025-05-07T19:50:21.5825829Z 2025-05-07T19:50:21.5825927Z Library Dependencies: 2025-05-07T19:50:21.5825994Z torch 2025-05-07T19:50:21.5826066Z torch_library 2025-05-07T19:50:21.5826351Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:50:21.5826600Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:21.5826902Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:21.5827233Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:21.5827496Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:21.5827699Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:21.5827766Z 2025-05-07T19:50:21.5827857Z Output Library: 2025-05-07T19:50:21.5827955Z fbgemm_gpu_embedding_inplace_ops 2025-05-07T19:50:21.5828021Z 2025-05-07T19:50:21.5828105Z Destination Directory: 2025-05-07T19:50:21.5828185Z fbgemm_gpu 2025-05-07T19:50:21.5828285Z ================================================================================ 2025-05-07T19:50:21.5828288Z 2025-05-07T19:50:21.5828292Z 2025-05-07T19:50:21.5828295Z 2025-05-07T19:50:21.5828398Z ================================================================================ 2025-05-07T19:50:21.5828520Z GPU CPP Library Target: fbgemm_gpu_py (SHARED) 2025-05-07T19:50:21.5828638Z 2025-05-07T19:50:21.5828708Z CPU_SRCS: 2025-05-07T19:50:21.5828813Z src/memory_utils/memory_utils.cpp 2025-05-07T19:50:21.5828910Z src/memory_utils/memory_utils_ops.cpp 2025-05-07T19:50:21.5829095Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp 2025-05-07T19:50:21.5829291Z src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp 2025-05-07T19:50:21.5829578Z src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp 2025-05-07T19:50:21.5829785Z src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp 2025-05-07T19:50:21.5830174Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp 2025-05-07T19:50:21.5830419Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp 2025-05-07T19:50:21.5830570Z src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp 2025-05-07T19:50:21.5830699Z src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp 2025-05-07T19:50:21.5830835Z src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp 2025-05-07T19:50:21.5830955Z src/input_combine_ops/input_combine_cpu.cpp 2025-05-07T19:50:21.5831109Z src/layout_transform_ops/layout_transform_ops_cpu.cpp 2025-05-07T19:50:21.5831221Z src/quantize_ops/quantize_ops_cpu.cpp 2025-05-07T19:50:21.5831334Z src/quantize_ops/quantize_ops_meta.cpp 2025-05-07T19:50:21.5831455Z src/sparse_ops/sparse_async_batched_cumsum.cpp 2025-05-07T19:50:21.5831561Z src/sparse_ops/sparse_ops_cpu.cpp 2025-05-07T19:50:21.5831670Z src/sparse_ops/sparse_ops_meta.cpp 2025-05-07T19:50:21.5831762Z src/tbe/eeg/eeg_models.cpp 2025-05-07T19:50:21.5831849Z src/tbe/eeg/eeg_utils.cpp 2025-05-07T19:50:21.5831957Z src/tbe/eeg/indices_estimator_ops.cpp 2025-05-07T19:50:21.5832067Z src/tbe/eeg/indices_estimator.cpp 2025-05-07T19:50:21.5832167Z src/tbe/eeg/indices_generator_ops.cpp 2025-05-07T19:50:21.5832264Z src/tbe/eeg/indices_generator.cpp 2025-05-07T19:50:21.5832511Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp 2025-05-07T19:50:21.5832659Z src/layout_transform_ops/layout_transform_ops_gpu.cpp 2025-05-07T19:50:21.5832942Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp 2025-05-07T19:50:21.5833187Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp 2025-05-07T19:50:21.5833289Z src/quantize_ops/quantize_ops_gpu.cpp 2025-05-07T19:50:21.5833386Z src/sparse_ops/sparse_ops_gpu.cpp 2025-05-07T19:50:21.5833488Z src/metric_ops/metric_ops_host.cpp 2025-05-07T19:50:21.5833609Z src/input_combine_ops/input_combine_gpu.cpp 2025-05-07T19:50:21.5833809Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp 2025-05-07T19:50:21.5833900Z src/topology_utils.cpp 2025-05-07T19:50:21.5833984Z 2025-05-07T19:50:21.5834064Z GPU_SRCS: 2025-05-07T19:50:21.5834178Z src/histogram_binning_calibration_ops.cu 2025-05-07T19:50:21.5834289Z src/input_combine_ops/input_combine.cu 2025-05-07T19:50:21.5834509Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu 2025-05-07T19:50:21.5834608Z src/memory_utils/memory_utils.cu 2025-05-07T19:50:21.5834715Z src/memory_utils/memory_utils_ops.cu 2025-05-07T19:50:21.5834926Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu 2025-05-07T19:50:21.5835119Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu 2025-05-07T19:50:21.5835250Z src/jagged_tensor_ops/dense_to_jagged_forward.cu 2025-05-07T19:50:21.5835385Z src/jagged_tensor_ops/jagged_dense_bmm_forward.cu 2025-05-07T19:50:21.5835650Z src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu 2025-05-07T19:50:21.5835833Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu 2025-05-07T19:50:21.5836010Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu 2025-05-07T19:50:21.5836168Z src/jagged_tensor_ops/jagged_index_add_2d_forward.cu 2025-05-07T19:50:21.5836320Z src/jagged_tensor_ops/jagged_index_select_2d_forward.cu 2025-05-07T19:50:21.5836452Z src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu 2025-05-07T19:50:21.5836597Z src/jagged_tensor_ops/jagged_softmax_backward.cu 2025-05-07T19:50:21.5836769Z src/jagged_tensor_ops/jagged_softmax_forward.cu 2025-05-07T19:50:21.5836885Z src/jagged_tensor_ops/jagged_tensor_ops.cu 2025-05-07T19:50:21.5837045Z src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu 2025-05-07T19:50:21.5837217Z src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu 2025-05-07T19:50:21.5837348Z src/jagged_tensor_ops/jagged_unique_indices.cu 2025-05-07T19:50:21.5837495Z src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu 2025-05-07T19:50:21.5837637Z src/layout_transform_ops/layout_transform_ops.cu 2025-05-07T19:50:21.5837737Z src/metric_ops/metric_ops.cu 2025-05-07T19:50:21.5837960Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu 2025-05-07T19:50:21.5838162Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu 2025-05-07T19:50:21.5838347Z src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu 2025-05-07T19:50:21.5838461Z src/quantize_ops/quantize_bfloat16.cu 2025-05-07T19:50:21.5838571Z src/quantize_ops/quantize_fp8_rowwise.cu 2025-05-07T19:50:21.5838709Z src/quantize_ops/quantize_fused_8bit_rowwise.cu 2025-05-07T19:50:21.5838839Z src/quantize_ops/quantize_fused_nbit_rowwise.cu 2025-05-07T19:50:21.5838944Z src/quantize_ops/quantize_hfp8.cu 2025-05-07T19:50:21.5839054Z src/quantize_ops/quantize_msfp.cu 2025-05-07T19:50:21.5839186Z src/quantize_ops/quantize_padded_fp8_rowwise.cu 2025-05-07T19:50:21.5839290Z src/quantize_ops/quantize_mx.cu 2025-05-07T19:50:21.5839419Z src/sparse_ops/sparse_async_batched_cumsum.cu 2025-05-07T19:50:21.5839564Z src/sparse_ops/sparse_block_bucketize_features.cu 2025-05-07T19:50:21.5839682Z src/sparse_ops/sparse_bucketize_features.cu 2025-05-07T19:50:21.5840006Z src/sparse_ops/sparse_batched_unary_embeddings.cu 2025-05-07T19:50:21.5840165Z src/sparse_ops/sparse_compute_frequency_sequence.cu 2025-05-07T19:50:21.5840307Z src/sparse_ops/sparse_expand_into_jagged_permute.cu 2025-05-07T19:50:21.5840413Z src/sparse_ops/sparse_group_index.cu 2025-05-07T19:50:21.5840526Z src/sparse_ops/sparse_index_add.cu 2025-05-07T19:50:21.5840637Z src/sparse_ops/sparse_index_select.cu 2025-05-07T19:50:21.5840801Z src/sparse_ops/sparse_invert_permute.cu 2025-05-07T19:50:21.5840927Z src/sparse_ops/sparse_pack_segments_backward.cu 2025-05-07T19:50:21.5841060Z src/sparse_ops/sparse_pack_segments_forward.cu 2025-05-07T19:50:21.5841165Z src/sparse_ops/sparse_permute_1d.cu 2025-05-07T19:50:21.5841264Z src/sparse_ops/sparse_permute_2d.cu 2025-05-07T19:50:21.5841373Z src/sparse_ops/sparse_permute102.cu 2025-05-07T19:50:21.5841489Z src/sparse_ops/sparse_permute_embeddings.cu 2025-05-07T19:50:21.5841587Z src/sparse_ops/sparse_range.cu 2025-05-07T19:50:21.5841700Z src/sparse_ops/sparse_reorder_batched_ad.cu 2025-05-07T19:50:21.5841823Z src/sparse_ops/sparse_segment_sum_csr.cu 2025-05-07T19:50:21.5841918Z src/sparse_ops/sparse_zipf.cu 2025-05-07T19:50:21.5841990Z 2025-05-07T19:50:21.5842199Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:21.5842203Z 2025-05-07T19:50:21.5842267Z 2025-05-07T19:50:21.5842348Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:21.5842353Z 2025-05-07T19:50:21.5842429Z 2025-05-07T19:50:21.5842515Z OTHER_SRCS: 2025-05-07T19:50:21.5842522Z 2025-05-07T19:50:21.5842590Z 2025-05-07T19:50:21.5842660Z CC_FLAGS: 2025-05-07T19:50:21.5842664Z 2025-05-07T19:50:21.5842743Z 2025-05-07T19:50:21.5842817Z NVCC_FLAGS: 2025-05-07T19:50:21.5842906Z --expt-relaxed-constexpr 2025-05-07T19:50:21.5843009Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:21.5843102Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:21.5843189Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:21.5843253Z 2025-05-07T19:50:21.5843338Z HIPCC_FLAGS: 2025-05-07T19:50:21.5843342Z 2025-05-07T19:50:21.5843408Z 2025-05-07T19:50:21.5843482Z INCLUDE_DIRS: 2025-05-07T19:50:21.5843588Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:21.5843680Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:21.5843777Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:21.5843871Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:21.5844150Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:50:21.5844585Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:21.5844715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:21.5844873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:21.5845015Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:21.5845204Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:21.5845396Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:21.5845530Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:21.5845821Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:50:21.5845888Z 2025-05-07T19:50:21.5845982Z Selected Source Files: 2025-05-07T19:50:21.5846074Z src/memory_utils/memory_utils.cpp 2025-05-07T19:50:21.5846174Z src/memory_utils/memory_utils_ops.cpp 2025-05-07T19:50:21.5846382Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp 2025-05-07T19:50:21.5846581Z src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp 2025-05-07T19:50:21.5846773Z src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp 2025-05-07T19:50:21.5846983Z src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp 2025-05-07T19:50:21.5847198Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp 2025-05-07T19:50:21.5847422Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp 2025-05-07T19:50:21.5847617Z src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp 2025-05-07T19:50:21.5847742Z src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp 2025-05-07T19:50:21.5847865Z src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp 2025-05-07T19:50:21.5847990Z src/input_combine_ops/input_combine_cpu.cpp 2025-05-07T19:50:21.5848131Z src/layout_transform_ops/layout_transform_ops_cpu.cpp 2025-05-07T19:50:21.5848238Z src/quantize_ops/quantize_ops_cpu.cpp 2025-05-07T19:50:21.5848389Z src/quantize_ops/quantize_ops_meta.cpp 2025-05-07T19:50:21.5848520Z src/sparse_ops/sparse_async_batched_cumsum.cpp 2025-05-07T19:50:21.5848616Z src/sparse_ops/sparse_ops_cpu.cpp 2025-05-07T19:50:21.5848715Z src/sparse_ops/sparse_ops_meta.cpp 2025-05-07T19:50:21.5848813Z src/tbe/eeg/eeg_models.cpp 2025-05-07T19:50:21.5848899Z src/tbe/eeg/eeg_utils.cpp 2025-05-07T19:50:21.5848999Z src/tbe/eeg/indices_estimator_ops.cpp 2025-05-07T19:50:21.5849092Z src/tbe/eeg/indices_estimator.cpp 2025-05-07T19:50:21.5849193Z src/tbe/eeg/indices_generator_ops.cpp 2025-05-07T19:50:21.5849284Z src/tbe/eeg/indices_generator.cpp 2025-05-07T19:50:21.5849510Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp 2025-05-07T19:50:21.5849662Z src/layout_transform_ops/layout_transform_ops_gpu.cpp 2025-05-07T19:50:21.5849860Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp 2025-05-07T19:50:21.5850078Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp 2025-05-07T19:50:21.5850192Z src/quantize_ops/quantize_ops_gpu.cpp 2025-05-07T19:50:21.5850284Z src/sparse_ops/sparse_ops_gpu.cpp 2025-05-07T19:50:21.5850382Z src/metric_ops/metric_ops_host.cpp 2025-05-07T19:50:21.5850495Z src/input_combine_ops/input_combine_gpu.cpp 2025-05-07T19:50:21.5850694Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp 2025-05-07T19:50:21.5850778Z src/topology_utils.cpp 2025-05-07T19:50:21.5850884Z src/histogram_binning_calibration_ops.cu 2025-05-07T19:50:21.5850992Z src/input_combine_ops/input_combine.cu 2025-05-07T19:50:21.5851190Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu 2025-05-07T19:50:21.5851279Z src/memory_utils/memory_utils.cu 2025-05-07T19:50:21.5851395Z src/memory_utils/memory_utils_ops.cu 2025-05-07T19:50:21.5851572Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu 2025-05-07T19:50:21.5851745Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu 2025-05-07T19:50:21.5851921Z src/jagged_tensor_ops/dense_to_jagged_forward.cu 2025-05-07T19:50:21.5852063Z src/jagged_tensor_ops/jagged_dense_bmm_forward.cu 2025-05-07T19:50:21.5852298Z src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu 2025-05-07T19:50:21.5852468Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu 2025-05-07T19:50:21.5852645Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu 2025-05-07T19:50:21.5852779Z src/jagged_tensor_ops/jagged_index_add_2d_forward.cu 2025-05-07T19:50:21.5852921Z src/jagged_tensor_ops/jagged_index_select_2d_forward.cu 2025-05-07T19:50:21.5853057Z src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu 2025-05-07T19:50:21.5853184Z src/jagged_tensor_ops/jagged_softmax_backward.cu 2025-05-07T19:50:21.5853297Z src/jagged_tensor_ops/jagged_softmax_forward.cu 2025-05-07T19:50:21.5853405Z src/jagged_tensor_ops/jagged_tensor_ops.cu 2025-05-07T19:50:21.5853573Z src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu 2025-05-07T19:50:21.5853717Z src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu 2025-05-07T19:50:21.5853838Z src/jagged_tensor_ops/jagged_unique_indices.cu 2025-05-07T19:50:21.5853989Z src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu 2025-05-07T19:50:21.5854113Z src/layout_transform_ops/layout_transform_ops.cu 2025-05-07T19:50:21.5854202Z src/metric_ops/metric_ops.cu 2025-05-07T19:50:21.5854407Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu 2025-05-07T19:50:21.5854605Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu 2025-05-07T19:50:21.5854777Z src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu 2025-05-07T19:50:21.5854872Z src/quantize_ops/quantize_bfloat16.cu 2025-05-07T19:50:21.5854986Z src/quantize_ops/quantize_fp8_rowwise.cu 2025-05-07T19:50:21.5855112Z src/quantize_ops/quantize_fused_8bit_rowwise.cu 2025-05-07T19:50:21.5855230Z src/quantize_ops/quantize_fused_nbit_rowwise.cu 2025-05-07T19:50:21.5855340Z src/quantize_ops/quantize_hfp8.cu 2025-05-07T19:50:21.5855438Z src/quantize_ops/quantize_msfp.cu 2025-05-07T19:50:21.5855603Z src/quantize_ops/quantize_padded_fp8_rowwise.cu 2025-05-07T19:50:21.5855696Z src/quantize_ops/quantize_mx.cu 2025-05-07T19:50:21.5855821Z src/sparse_ops/sparse_async_batched_cumsum.cu 2025-05-07T19:50:21.5855949Z src/sparse_ops/sparse_block_bucketize_features.cu 2025-05-07T19:50:21.5856057Z src/sparse_ops/sparse_bucketize_features.cu 2025-05-07T19:50:21.5856194Z src/sparse_ops/sparse_batched_unary_embeddings.cu 2025-05-07T19:50:21.5856326Z src/sparse_ops/sparse_compute_frequency_sequence.cu 2025-05-07T19:50:21.5856458Z src/sparse_ops/sparse_expand_into_jagged_permute.cu 2025-05-07T19:50:21.5856551Z src/sparse_ops/sparse_group_index.cu 2025-05-07T19:50:21.5856653Z src/sparse_ops/sparse_index_add.cu 2025-05-07T19:50:21.5856751Z src/sparse_ops/sparse_index_select.cu 2025-05-07T19:50:21.5856850Z src/sparse_ops/sparse_invert_permute.cu 2025-05-07T19:50:21.5856974Z src/sparse_ops/sparse_pack_segments_backward.cu 2025-05-07T19:50:21.5857095Z src/sparse_ops/sparse_pack_segments_forward.cu 2025-05-07T19:50:21.5857205Z src/sparse_ops/sparse_permute_1d.cu 2025-05-07T19:50:21.5857307Z src/sparse_ops/sparse_permute_2d.cu 2025-05-07T19:50:21.5857405Z src/sparse_ops/sparse_permute102.cu 2025-05-07T19:50:21.5857520Z src/sparse_ops/sparse_permute_embeddings.cu 2025-05-07T19:50:21.5857612Z src/sparse_ops/sparse_range.cu 2025-05-07T19:50:21.5857722Z src/sparse_ops/sparse_reorder_batched_ad.cu 2025-05-07T19:50:21.5857821Z src/sparse_ops/sparse_segment_sum_csr.cu 2025-05-07T19:50:21.5857912Z src/sparse_ops/sparse_zipf.cu 2025-05-07T19:50:21.5857985Z 2025-05-07T19:50:21.5858066Z HIPified Source Files: 2025-05-07T19:50:21.5858070Z 2025-05-07T19:50:21.5858135Z 2025-05-07T19:50:21.5858220Z Library Dependencies: 2025-05-07T19:50:21.5858293Z torch 2025-05-07T19:50:21.5858369Z torch_library 2025-05-07T19:50:21.5858659Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:50:21.5858898Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:21.5859265Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:21.5859596Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:21.5859862Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:21.5859936Z fbgemm 2025-05-07T19:50:21.5860029Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:21.5860123Z fbgemm_gpu_embedding_inplace_ops 2025-05-07T19:50:21.5860223Z fbgemm_gpu_tbe_index_select 2025-05-07T19:50:21.5860302Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:21.5860390Z fbgemm_gpu_tbe_optimizers 2025-05-07T19:50:21.5860476Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:21.5860673Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:21.5860745Z 2025-05-07T19:50:21.5860830Z Output Library: 2025-05-07T19:50:21.5860913Z fbgemm_gpu_py 2025-05-07T19:50:21.5860989Z 2025-05-07T19:50:21.5861083Z Destination Directory: 2025-05-07T19:50:21.5861161Z fbgemm_gpu 2025-05-07T19:50:21.5861272Z ================================================================================ 2025-05-07T19:50:21.5861276Z 2025-05-07T19:50:21.5861371Z -- Configuring done (8.0s) 2025-05-07T19:50:21.7031790Z -- Generating done (0.1s) 2025-05-07T19:50:21.7050428Z -- Build files have been written to: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build 2025-05-07T19:50:21.7207402Z Change Dir: '/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build' 2025-05-07T19:50:21.7207498Z 2025-05-07T19:50:21.7207916Z Run Build Command(s): /github/home/miniconda/envs/build_binary/bin/ninja -v -j 48 install 2025-05-07T19:50:21.8628849Z [1/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp 2025-05-07T19:50:21.8900765Z [2/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp 2025-05-07T19:50:21.8984767Z [3/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp 2025-05-07T19:50:21.8994553Z [4/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp 2025-05-07T19:50:21.9004225Z [5/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp 2025-05-07T19:50:21.9057798Z [6/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp 2025-05-07T19:50:21.9194901Z [7/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp 2025-05-07T19:50:21.9238703Z [8/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp 2025-05-07T19:50:21.9274583Z [9/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp 2025-05-07T19:50:21.9412112Z [10/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp 2025-05-07T19:50:21.9680222Z [11/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp 2025-05-07T19:50:21.9742330Z [12/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp 2025-05-07T19:50:21.9782452Z [13/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp 2025-05-07T19:50:21.9807157Z [14/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp 2025-05-07T19:50:21.9920498Z [15/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp 2025-05-07T19:50:21.9956727Z [16/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp 2025-05-07T19:50:22.0097804Z [17/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp 2025-05-07T19:50:22.0117858Z [18/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp 2025-05-07T19:50:22.0129028Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp:10: 2025-05-07T19:50:22.0130824Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:22.0134147Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.0137982Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:22.0140072Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.0141722Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:22.0145184Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.0149184Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:22.0151323Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.0152908Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:22.0156497Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.0160339Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:22.0162351Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.0163913Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:22.0167340Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.0171072Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:22.0173027Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.0174644Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:22.0178110Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.0181946Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:22.0184021Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.0185872Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:22.0189223Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.0193121Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:22.0195093Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.0196613Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:22.0199960Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.0204195Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:22.0206225Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.0207797Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:22.0211154Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.0214971Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:22.0216934Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.0218496Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:22.0221760Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.0225775Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:22.0227843Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.0229528Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:22.0232917Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.0236784Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:22.0238843Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.0239490Z At global scope: 2025-05-07T19:50:22.0240771Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:22.0251162Z [19/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp 2025-05-07T19:50:22.0320696Z [20/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp 2025-05-07T19:50:22.0448717Z [21/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp 2025-05-07T19:50:22.0574798Z [22/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp 2025-05-07T19:50:22.0605057Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp:10: 2025-05-07T19:50:22.0607212Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:22.0610581Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.0614427Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:22.0616434Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.0618097Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:22.0621404Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.0625108Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:22.0627084Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.0629075Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:22.0632589Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.0636434Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:22.0638510Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.0640108Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:22.0643353Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.0647041Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:22.0648914Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.0650478Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:22.0653794Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.0657920Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:22.0659923Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.0661506Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:22.0664800Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.0668633Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:22.0670790Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.0672419Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:22.0675665Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.0679665Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:22.0681701Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.0683308Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:22.0686820Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.0690665Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:22.0692725Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.0694267Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:22.0697588Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.0701423Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:22.0703749Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.0705362Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:22.0708731Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.0712634Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:22.0714736Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.0715409Z At global scope: 2025-05-07T19:50:22.0716682Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:22.0727885Z [23/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp 2025-05-07T19:50:22.0900580Z [24/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp 2025-05-07T19:50:22.0950639Z [25/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp 2025-05-07T19:50:22.0964928Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64instdb_p.h:12, 2025-05-07T19:50:22.0966261Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp:11: 2025-05-07T19:50:22.0968146Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:22.0971580Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.0975523Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:22.0977606Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.0979652Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:22.0983109Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.0987293Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:22.0989360Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.0991024Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:22.0994574Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.0998502Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:22.1000520Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1002114Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:22.1005710Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1009327Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:22.1011185Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1012793Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:22.1016198Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1020084Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:22.1022127Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1023719Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:22.1027333Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1031316Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:22.1033300Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1034922Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:22.1038310Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1042245Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:22.1044292Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1045901Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:22.1049335Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1053148Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:22.1055388Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1057070Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:22.1060517Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1064385Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:22.1066423Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1068042Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:22.1071559Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1075470Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:22.1077752Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1078432Z At global scope: 2025-05-07T19:50:22.1079705Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:22.1090559Z [26/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp 2025-05-07T19:50:22.1110235Z [27/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp 2025-05-07T19:50:22.1121467Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64instdb_p.h:12, 2025-05-07T19:50:22.1122757Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp:13: 2025-05-07T19:50:22.1124659Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:22.1128170Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1132104Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:22.1134208Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1135857Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:22.1139551Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1143443Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:22.1145510Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1147139Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:22.1150736Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1154627Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:22.1156584Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1158146Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:22.1161473Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1165409Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:22.1167229Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1168867Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:22.1172179Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1175820Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:22.1177708Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1179289Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:22.1182781Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1187173Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:22.1189214Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1190942Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:22.1194398Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1198359Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:22.1200382Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1202011Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:22.1205435Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1209302Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:22.1211620Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1213234Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:22.1216672Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1220656Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:22.1222816Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1224521Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:22.1227996Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1232066Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:22.1234393Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1235051Z At global scope: 2025-05-07T19:50:22.1236340Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:22.1247291Z [28/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp 2025-05-07T19:50:22.1267673Z [29/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp 2025-05-07T19:50:22.1288407Z [30/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp 2025-05-07T19:50:22.1308623Z [31/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp 2025-05-07T19:50:22.1319697Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64instdb_p.h:12, 2025-05-07T19:50:22.1321042Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp:13: 2025-05-07T19:50:22.1322900Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:22.1326334Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1330304Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:22.1332344Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1334213Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:22.1337559Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1341461Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:22.1343382Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1344966Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:22.1348364Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1352354Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:22.1354346Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1356101Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:22.1359517Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1363229Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:22.1365149Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1366754Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:22.1370206Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1374136Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:22.1376192Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1377767Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:22.1381412Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1385483Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:22.1387543Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1389154Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:22.1392631Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1396437Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:22.1398364Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1399968Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:22.1403598Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1407402Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:22.1409383Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1411031Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:22.1414410Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1418324Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:22.1420422Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1422022Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:22.1425387Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1429802Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:22.1431895Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1432559Z At global scope: 2025-05-07T19:50:22.1433800Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:22.1737645Z [32/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp 2025-05-07T19:50:22.1748668Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/a64archtraits_p.h:13, 2025-05-07T19:50:22.1750160Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp:16: 2025-05-07T19:50:22.1752385Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:22.1755795Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1759695Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:22.1761773Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1763429Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:22.1766868Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1770888Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:22.1772978Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1774592Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:22.1778393Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1782443Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:22.1784775Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1786444Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:22.1790089Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1793771Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:22.1795572Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1797225Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:22.1800947Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1804803Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:22.1806853Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1808484Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:22.1811900Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1815769Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:22.1817797Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1819401Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:22.1822803Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1826893Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:22.1828937Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1830637Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:22.1834002Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1837909Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:22.1839890Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1841496Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:22.1844890Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1848894Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:22.1851078Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1852790Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:22.1856158Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1860106Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:22.1862149Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1862837Z At global scope: 2025-05-07T19:50:22.1864158Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:22.1874909Z [33/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp 2025-05-07T19:50:22.1924081Z [34/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp 2025-05-07T19:50:22.1945842Z [35/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp 2025-05-07T19:50:22.1956826Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp:12: 2025-05-07T19:50:22.1958748Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:22.1962029Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1966018Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:22.1968077Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1969677Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:22.1973376Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1977192Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:22.1979228Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1980809Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:22.1984176Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1988297Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:22.1990411Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.1991955Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:22.1995534Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.1999124Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:22.2001031Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.2002679Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:22.2005948Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.2009794Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:22.2011761Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.2013277Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:22.2016612Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.2020687Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:22.2022693Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.2024299Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:22.2027643Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.2031628Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:22.2033634Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.2035227Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:22.2038637Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.2042647Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:22.2044743Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.2046334Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:22.2049664Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.2053558Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:22.2055625Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.2057163Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:22.2060558Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.2064384Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:22.2066482Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.2067352Z At global scope: 2025-05-07T19:50:22.2068582Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:22.2106037Z [36/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp 2025-05-07T19:50:22.2116638Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64emitter.h:12, 2025-05-07T19:50:22.2118064Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64assembler.h:10, 2025-05-07T19:50:22.2119296Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp:9: 2025-05-07T19:50:22.2121195Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:22.2125130Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.2128993Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:22.2130891Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.2132598Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:22.2136279Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.2140311Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:22.2142341Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.2143961Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:22.2147419Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.2151990Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:22.2154133Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.2155769Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:22.2159150Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.2162949Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:22.2164924Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.2166642Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:22.2170278Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.2174274Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:22.2176281Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.2178012Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:22.2181679Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.2185967Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:22.2187924Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.2189611Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:22.2192933Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.2196797Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:22.2199166Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.2200766Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:22.2204265Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.2208188Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:22.2210225Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.2211839Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:22.2215367Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.2219322Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:22.2221660Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.2223351Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:22.2226788Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.2230833Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:22.2232940Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.2233614Z At global scope: 2025-05-07T19:50:22.2234855Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:22.2245719Z [37/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp 2025-05-07T19:50:22.2265758Z [38/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp 2025-05-07T19:50:22.2998388Z [39/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp 2025-05-07T19:50:22.3009382Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64emitter.h:12, 2025-05-07T19:50:22.3010763Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64assembler.h:10, 2025-05-07T19:50:22.3012039Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp:9: 2025-05-07T19:50:22.3013857Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:22.3017399Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.3021365Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:22.3023464Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.3025368Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:22.3028862Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.3032839Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:22.3034871Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.3036558Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:22.3039936Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.3043729Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:22.3045836Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.3047693Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:22.3051206Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.3055004Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:22.3056837Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.3058525Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:22.3062015Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.3065905Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:22.3067976Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.3069769Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:22.3073444Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.3077333Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:22.3079398Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.3081025Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:22.3084871Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.3088750Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:22.3090800Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.3092409Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:22.3096217Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.3100266Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:22.3102308Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.3103998Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:22.3107547Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.3111603Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:22.3113695Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.3115425Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:22.3118910Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.3123148Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:22.3125219Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.3125847Z At global scope: 2025-05-07T19:50:22.3127117Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:22.3138195Z [40/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp 2025-05-07T19:50:22.3267367Z [41/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp 2025-05-07T19:50:22.3298645Z [42/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp 2025-05-07T19:50:22.3310318Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64emitter.h:12, 2025-05-07T19:50:22.3311584Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64emithelper_p.h:13, 2025-05-07T19:50:22.3312574Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp:14: 2025-05-07T19:50:22.3314198Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:22.3317566Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.3321501Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:22.3323588Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.3325215Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:22.3328687Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.3333007Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:22.3335052Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.3336755Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:22.3340208Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.3344168Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:22.3346286Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.3347917Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:22.3351561Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.3355333Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:22.3357371Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.3359106Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:22.3362650Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.3366577Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:22.3368727Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.3370261Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:22.3373742Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.3377682Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:22.3379919Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.3381639Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:22.3385297Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.3389208Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:22.3391330Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.3393012Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:22.3396516Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.3400268Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:22.3402215Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.3404215Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:22.3407727Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.3411547Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:22.3413555Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.3415199Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:22.3418704Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.3422606Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:22.3424660Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.3425308Z At global scope: 2025-05-07T19:50:22.3426830Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:22.3465106Z [43/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp 2025-05-07T19:50:22.3796811Z [44/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp 2025-05-07T19:50:22.3876119Z [45/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp 2025-05-07T19:50:22.4493312Z [46/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp 2025-05-07T19:50:22.5558493Z [47/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp 2025-05-07T19:50:22.5862992Z [48/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp 2025-05-07T19:50:22.7305273Z [49/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp 2025-05-07T19:50:22.8061039Z [50/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp 2025-05-07T19:50:22.8073099Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64emitter.h:12, 2025-05-07T19:50:22.8074589Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64assembler.h:10, 2025-05-07T19:50:22.8075828Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp:12: 2025-05-07T19:50:22.8077791Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:22.8081581Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.8086211Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:22.8088298Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.8089840Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:22.8093322Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.8096899Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:22.8098864Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.8100374Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:22.8103913Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.8108003Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:22.8110150Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.8111765Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:22.8115232Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.8118936Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:22.8120822Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.8122476Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:22.8126107Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.8130037Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:22.8132324Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.8133984Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:22.8137628Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.8141560Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:22.8143503Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.8145120Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:22.8148465Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.8152399Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:22.8154495Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.8156467Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:22.8159991Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.8163947Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:22.8166045Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.8167754Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:22.8171137Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.8174532Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:22.8176517Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.8178205Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:22.8181888Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:22.8186178Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:22.8188357Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:22.8189025Z At global scope: 2025-05-07T19:50:22.8190441Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:22.8931846Z [51/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc 2025-05-07T19:50:22.9110667Z [52/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp 2025-05-07T19:50:23.0917187Z [53/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp 2025-05-07T19:50:23.1094433Z [54/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -mavx512f -mavx512bw -mavx512dq -mavx512vl -fopenmp -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc 2025-05-07T19:50:23.3200275Z [55/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp 2025-05-07T19:50:23.3219831Z [56/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp 2025-05-07T19:50:23.3714634Z [57/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp 2025-05-07T19:50:23.4188921Z [58/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp 2025-05-07T19:50:23.4455106Z [59/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o -c /__w/FBGEMM/FBGEMM/src/QuantUtils.cc 2025-05-07T19:50:23.9736400Z [60/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp 2025-05-07T19:50:23.9746730Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64emitter.h:12, 2025-05-07T19:50:23.9748077Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64assembler.h:10, 2025-05-07T19:50:23.9749149Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp:18: 2025-05-07T19:50:23.9751090Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:23.9754247Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:23.9757928Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:23.9759742Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:23.9761335Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:23.9764800Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:23.9768387Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:23.9770173Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:23.9771675Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:23.9774824Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:23.9778399Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:23.9780183Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:23.9781835Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:23.9785524Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:23.9789040Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:23.9790750Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:23.9792308Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:23.9795640Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:23.9799124Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:23.9801002Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:23.9802446Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:23.9805583Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:23.9809515Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:23.9811434Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:23.9812818Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:23.9815950Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:23.9819428Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:23.9821275Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:23.9822764Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:23.9826037Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:23.9830077Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:23.9831839Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:23.9833382Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:23.9836621Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:23.9840157Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:23.9842036Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:23.9843488Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:23.9846610Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:23.9850323Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:23.9852499Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:23.9853128Z At global scope: 2025-05-07T19:50:23.9854312Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:24.3834981Z [61/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp 2025-05-07T19:50:24.7099392Z [62/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o -c /__w/FBGEMM/FBGEMM/src/Utils.cc 2025-05-07T19:50:25.2819426Z [63/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp 2025-05-07T19:50:25.8813918Z [64/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,asmjit.so -o asmjit.so CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed && : 2025-05-07T19:50:25.9752110Z [65/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o -c /__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc 2025-05-07T19:50:28.2707107Z [66/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o -c /__w/FBGEMM/FBGEMM/src/RefImplementations.cc 2025-05-07T19:50:30.4895240Z [67/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cpp 2025-05-07T19:50:30.5424823Z [68/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o -c /__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc 2025-05-07T19:50:30.5742227Z [69/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cpp 2025-05-07T19:50:30.6526421Z [70/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cpp 2025-05-07T19:50:30.7754426Z [71/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cpp 2025-05-07T19:50:32.0413418Z [72/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc 2025-05-07T19:50:32.4011678Z [73/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cpp 2025-05-07T19:50:32.8476995Z [74/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:50:32.9345864Z [75/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/inference/embedding_forward_quantized_host.cpp 2025-05-07T19:50:34.9120595Z [76/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:50:35.4438410Z [77/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_config_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o -MF CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o.d -o CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/config/feature_gates.cpp 2025-05-07T19:50:36.0878203Z [78/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_config.so -o fbgemm_gpu_config.so CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed && : 2025-05-07T19:50:38.5176471Z [79/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp 2025-05-07T19:50:38.6773442Z [80/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o -c /__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc 2025-05-07T19:50:40.2084905Z [81/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils.cpp 2025-05-07T19:50:40.2480351Z [82/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils_meta.cpp 2025-05-07T19:50:42.6999091Z [83/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/inference/embedding_forward_quantized_host_cpu.cpp 2025-05-07T19:50:44.7947635Z [84/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/pt2/pt2_autograd_utils.cpp 2025-05-07T19:50:44.8102902Z [85/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_sparse_async_cumsum_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o -MF CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o.d -o CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_cumsum.cpp 2025-05-07T19:50:47.7072376Z [86/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_host_cpu.cpp 2025-05-07T19:50:49.2616697Z [87/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_host.cpp 2025-05-07T19:50:49.6073582Z [88/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc 2025-05-07T19:50:54.5614806Z [89/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_optimizers_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:50:54.7742630Z [90/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/forward/embedding_forward_split_cpu.cpp 2025-05-07T19:50:58.3074431Z [91/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:50:59.2030507Z [92/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:50:59.7043397Z [93/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:51:00.8300539Z [94/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:51:04.7178417Z [95/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:51:04.8606014Z [96/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:51:08.8910106Z [97/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:51:09.3468556Z [98/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:51:10.0351083Z [99/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:51:10.9676291Z [100/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:51:14.9590010Z [101/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:51:24.7756070Z [102/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:51:26.4787476Z [103/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:51:29.6160452Z [104/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/inference/embedding_forward_quantized_split_lookup.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o 2025-05-07T19:51:30.2819135Z [105/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o 2025-05-07T19:51:31.1553573Z [106/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o 2025-05-07T19:51:31.1876828Z [107/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o 2025-05-07T19:51:31.3266432Z [108/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc 2025-05-07T19:51:31.6001992Z [109/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o 2025-05-07T19:51:31.6667251Z [110/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o 2025-05-07T19:51:31.9119961Z [111/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o 2025-05-07T19:51:32.0423825Z [112/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm.so -o fbgemm.so CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so asmjit.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so && : 2025-05-07T19:51:32.4059271Z [113/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/generate_vbe_metadata.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o 2025-05-07T19:51:33.1610542Z [114/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/get_infos_metadata.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o 2025-05-07T19:51:36.2217048Z [115/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o 2025-05-07T19:51:42.7215056Z [116/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_v1.cu -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o 2025-05-07T19:51:44.9205355Z [117/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_v2.cu -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o 2025-05-07T19:51:45.5135504Z [118/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_common.so -o fbgemm_gpu_tbe_common.so CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm.so fbgemm_gpu_config.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && : 2025-05-07T19:51:48.8725457Z [119/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_optimizers_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o 2025-05-07T19:51:51.8337239Z [120/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o 2025-05-07T19:51:58.9823777Z [121/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_sparse_async_cumsum_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o -MF CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_cumsum.cu -o CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o 2025-05-07T19:51:59.2132727Z [122/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o 2025-05-07T19:52:17.1437292Z [123/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o 2025-05-07T19:52:22.8834331Z [124/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o 2025-05-07T19:52:26.2713744Z [125/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o 2025-05-07T19:52:30.7698660Z [126/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o 2025-05-07T19:52:31.4427146Z [127/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_optimizers_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o 2025-05-07T19:52:31.4523917Z [128/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_cache.so -o fbgemm_gpu_tbe_cache.so CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -lrt -lpthread -ldl && : 2025-05-07T19:52:32.0266582Z [129/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_optimizers.so -o fbgemm_gpu_tbe_optimizers.so CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -lrt -lpthread -ldl && : 2025-05-07T19:52:36.6619084Z [130/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_dense_unweighted_nobag_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o 2025-05-07T19:52:43.6864319Z [131/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/transpose_embedding_input.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o 2025-05-07T19:52:48.7632324Z [132/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_nobag_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o 2025-05-07T19:52:50.6671997Z [133/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_dense_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o 2025-05-07T19:52:52.6086715Z [134/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_dense_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o 2025-05-07T19:52:57.7114229Z [135/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o 2025-05-07T19:53:01.4235796Z [136/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o 2025-05-07T19:53:01.4263112Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:01.4265224Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:01.4265838Z ^ 2025-05-07T19:53:01.4266154Z 2025-05-07T19:53:01.4266638Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:01.4267379Z 2025-05-07T19:53:01.4269138Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:01.4271510Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:01.4272072Z ^ 2025-05-07T19:53:01.4272407Z 2025-05-07T19:53:01.4274030Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:01.4276081Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:01.4276684Z ^ 2025-05-07T19:53:01.4277034Z 2025-05-07T19:53:01.4279019Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:01.4281297Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:01.4281898Z ^ 2025-05-07T19:53:01.4282244Z 2025-05-07T19:53:01.4282714Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:01.4283433Z 2025-05-07T19:53:01.4285443Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:01.4287395Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:01.4287950Z ^ 2025-05-07T19:53:01.4288227Z 2025-05-07T19:53:01.4289800Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:01.4291959Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:01.4292609Z ^ 2025-05-07T19:53:01.4292937Z 2025-05-07T19:53:01.4294630Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:01.4296660Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:01.4297191Z ^ 2025-05-07T19:53:01.4297540Z 2025-05-07T19:53:01.4298019Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:01.4298744Z 2025-05-07T19:53:01.4300531Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:01.4302826Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:01.4303394Z ^ 2025-05-07T19:53:01.4303676Z 2025-05-07T19:53:01.4305385Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:01.4307594Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:01.4308185Z ^ 2025-05-07T19:53:01.4308453Z 2025-05-07T19:53:01.4310216Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:01.4312332Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:01.4312900Z ^ 2025-05-07T19:53:01.4313198Z 2025-05-07T19:53:01.4313652Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:01.4314261Z 2025-05-07T19:53:01.4315905Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:01.4318270Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:01.4318906Z ^ 2025-05-07T19:53:01.4319214Z 2025-05-07T19:53:01.4320758Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:01.4322611Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:01.4323161Z ^ 2025-05-07T19:53:01.4323457Z 2025-05-07T19:53:01.4325243Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:01.4327457Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:01.4328084Z ^ 2025-05-07T19:53:01.4328404Z 2025-05-07T19:53:01.4328877Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:01.4329623Z 2025-05-07T19:53:01.4331354Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:01.4333491Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:01.4334019Z ^ 2025-05-07T19:53:01.4334301Z 2025-05-07T19:53:01.4335852Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:01.4337804Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:01.4338408Z ^ 2025-05-07T19:53:01.4338703Z 2025-05-07T19:53:10.2775861Z [137/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o 2025-05-07T19:53:10.3813368Z [138/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o 2025-05-07T19:53:10.3838443Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:10.3840866Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:10.3841672Z ^ 2025-05-07T19:53:10.3842026Z 2025-05-07T19:53:10.3842548Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:10.3843239Z 2025-05-07T19:53:10.3844789Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:10.3846824Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:10.3847414Z ^ 2025-05-07T19:53:10.3847715Z 2025-05-07T19:53:10.3849341Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:10.3851448Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:10.3852035Z ^ 2025-05-07T19:53:10.3852331Z 2025-05-07T19:53:10.3854271Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:10.3856373Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:10.3856971Z ^ 2025-05-07T19:53:10.3857263Z 2025-05-07T19:53:10.3858773Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:10.3860888Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:10.3861624Z ^ 2025-05-07T19:53:10.3861905Z 2025-05-07T19:53:10.3862363Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:10.3862995Z 2025-05-07T19:53:10.3864632Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:10.3866543Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:10.3867069Z ^ 2025-05-07T19:53:10.3867362Z 2025-05-07T19:53:10.3868890Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:10.3871050Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:10.3871658Z ^ 2025-05-07T19:53:10.3871935Z 2025-05-07T19:53:10.3873587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:10.3875760Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:10.3876366Z ^ 2025-05-07T19:53:10.3876648Z 2025-05-07T19:53:10.3878286Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:10.3880501Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:10.3881166Z ^ 2025-05-07T19:53:10.3881416Z 2025-05-07T19:53:10.3881801Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:10.3882473Z 2025-05-07T19:53:10.3883922Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:10.3886032Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:10.3886577Z ^ 2025-05-07T19:53:10.3886837Z 2025-05-07T19:53:10.3888230Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:10.3890088Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:10.3890665Z ^ 2025-05-07T19:53:10.3890956Z 2025-05-07T19:53:10.3892525Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:10.3894548Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:10.3895142Z ^ 2025-05-07T19:53:10.3895441Z 2025-05-07T19:53:10.3897364Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:10.3899444Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:10.3900200Z ^ 2025-05-07T19:53:10.3900478Z 2025-05-07T19:53:10.3900850Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:10.3901451Z 2025-05-07T19:53:10.3902924Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:10.3904879Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:10.3905443Z ^ 2025-05-07T19:53:10.3905715Z 2025-05-07T19:53:10.3907341Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:10.3909499Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:10.3910105Z ^ 2025-05-07T19:53:10.3910397Z 2025-05-07T19:53:10.3911885Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:10.3913753Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:10.3914299Z ^ 2025-05-07T19:53:10.3914547Z 2025-05-07T19:53:10.3916022Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:10.3918102Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:10.3919275Z ^ 2025-05-07T19:53:10.3919568Z 2025-05-07T19:53:10.3919989Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:10.3920631Z 2025-05-07T19:53:10.3922165Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:10.3924130Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:10.3924720Z ^ 2025-05-07T19:53:10.3925014Z 2025-05-07T19:53:10.3926488Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:10.3928393Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:10.3929003Z ^ 2025-05-07T19:53:10.3929304Z 2025-05-07T19:53:10.3930680Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:10.3932403Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:10.3932919Z ^ 2025-05-07T19:53:10.3933166Z 2025-05-07T19:53:12.5502017Z [139/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o 2025-05-07T19:53:12.5526707Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.5529139Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:12.5529942Z ^ 2025-05-07T19:53:12.5530240Z 2025-05-07T19:53:12.5530735Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:12.5531433Z 2025-05-07T19:53:12.5533087Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.5535191Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.5535831Z ^ 2025-05-07T19:53:12.5536148Z 2025-05-07T19:53:12.5537799Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.5539802Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.5540372Z ^ 2025-05-07T19:53:12.5540687Z 2025-05-07T19:53:12.5542353Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.5544339Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.5544903Z ^ 2025-05-07T19:53:12.5545165Z 2025-05-07T19:53:12.5547030Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.5549197Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:12.5550130Z ^ 2025-05-07T19:53:12.5550422Z 2025-05-07T19:53:12.5550902Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:12.5551540Z 2025-05-07T19:53:12.5552985Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.5554937Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.5555510Z ^ 2025-05-07T19:53:12.5555832Z 2025-05-07T19:53:12.5557506Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.5559605Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.5560177Z ^ 2025-05-07T19:53:12.5560454Z 2025-05-07T19:53:12.5561974Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.5564058Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.5564630Z ^ 2025-05-07T19:53:12.5564907Z 2025-05-07T19:53:12.5566456Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.5568723Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:12.5569489Z ^ 2025-05-07T19:53:12.5569741Z 2025-05-07T19:53:12.5570150Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:12.5570760Z 2025-05-07T19:53:12.5572255Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.5574266Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.5574834Z ^ 2025-05-07T19:53:12.5575151Z 2025-05-07T19:53:12.5576814Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.5578913Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.5579508Z ^ 2025-05-07T19:53:12.5579808Z 2025-05-07T19:53:12.5581476Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.5583598Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.5584153Z ^ 2025-05-07T19:53:12.5584724Z 2025-05-07T19:53:12.5586410Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.5588662Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:12.5589559Z ^ 2025-05-07T19:53:12.5589842Z 2025-05-07T19:53:12.5590337Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:12.5591303Z 2025-05-07T19:53:12.5592955Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.5595049Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.5595625Z ^ 2025-05-07T19:53:12.5595942Z 2025-05-07T19:53:12.5597602Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.5599709Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.5600257Z ^ 2025-05-07T19:53:12.5600543Z 2025-05-07T19:53:12.5602240Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.5604342Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.5604919Z ^ 2025-05-07T19:53:12.5605209Z 2025-05-07T19:53:12.5606919Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.5609152Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:12.5609945Z ^ 2025-05-07T19:53:12.5610232Z 2025-05-07T19:53:12.5610646Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:12.5611324Z 2025-05-07T19:53:12.5612925Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.5615122Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.5615675Z ^ 2025-05-07T19:53:12.5615991Z 2025-05-07T19:53:12.5617460Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.5619407Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.5619938Z ^ 2025-05-07T19:53:12.5620220Z 2025-05-07T19:53:12.5621744Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.5623658Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.5624245Z ^ 2025-05-07T19:53:12.5624533Z 2025-05-07T19:53:13.4549698Z [140/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/radix_sort_pairs.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o 2025-05-07T19:53:14.1307366Z [141/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_utils.so -o fbgemm_gpu_tbe_utils.so CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -lrt -lpthread -ldl && : 2025-05-07T19:53:14.7240506Z [142/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_sparse_async_cumsum.so -o fbgemm_gpu_sparse_async_cumsum.so CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl && : 2025-05-07T19:53:17.1819231Z [143/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o 2025-05-07T19:53:22.0203841Z [144/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o 2025-05-07T19:53:30.9398215Z [145/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o 2025-05-07T19:53:34.6395725Z [146/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o 2025-05-07T19:53:39.6128192Z [147/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o 2025-05-07T19:53:39.7960672Z [148/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o 2025-05-07T19:53:45.6233101Z [149/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:54:01.2987651Z [150/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:54:05.4997493Z [151/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:54:06.7598123Z [152/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:54:08.4801867Z [153/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o 2025-05-07T19:54:09.5032386Z [154/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:54:13.9420911Z [155/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:14.1498422Z [156/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:54:18.3293965Z [157/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o 2025-05-07T19:54:18.3319371Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3321806Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3322761Z ^ 2025-05-07T19:54:18.3326353Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:18.3329889Z 2025-05-07T19:54:18.3330366Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:18.3331101Z 2025-05-07T19:54:18.3332500Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3334619Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3335592Z ^ 2025-05-07T19:54:18.3339102Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:18.3342352Z 2025-05-07T19:54:18.3345060Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3346974Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3347910Z ^ 2025-05-07T19:54:18.3351510Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:18.3354808Z 2025-05-07T19:54:18.3356128Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3357962Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3358781Z ^ 2025-05-07T19:54:18.3362533Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:18.3365826Z 2025-05-07T19:54:18.3366942Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3369182Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3369909Z ^ 2025-05-07T19:54:18.3373609Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:18.3376922Z 2025-05-07T19:54:18.3378174Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3380051Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3380931Z ^ 2025-05-07T19:54:18.3384272Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:18.3387444Z 2025-05-07T19:54:18.3388929Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3390962Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3391829Z ^ 2025-05-07T19:54:18.3394976Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:18.3398036Z 2025-05-07T19:54:18.3399344Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3401224Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3402058Z ^ 2025-05-07T19:54:18.3405505Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:18.3408516Z 2025-05-07T19:54:18.3409686Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3412022Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3412857Z ^ 2025-05-07T19:54:18.3416427Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:18.3419734Z 2025-05-07T19:54:18.3421080Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3423028Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3423903Z ^ 2025-05-07T19:54:18.3427488Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:18.3431152Z 2025-05-07T19:54:18.3432488Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3434809Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3435779Z ^ 2025-05-07T19:54:18.3439380Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:18.3442730Z 2025-05-07T19:54:18.3444065Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3446098Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3447012Z ^ 2025-05-07T19:54:18.3450577Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:18.3453891Z 2025-05-07T19:54:18.3455193Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3457229Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3458396Z ^ 2025-05-07T19:54:18.3461903Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:18.3465222Z 2025-05-07T19:54:18.3466524Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3468533Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3469588Z ^ 2025-05-07T19:54:18.3473106Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:18.3476425Z 2025-05-07T19:54:18.3477726Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3479744Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3480860Z ^ 2025-05-07T19:54:18.3484673Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:18.3487997Z 2025-05-07T19:54:18.3489425Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3491427Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3492342Z ^ 2025-05-07T19:54:18.3495870Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:18.3499185Z 2025-05-07T19:54:18.3500453Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3502437Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3503691Z ^ 2025-05-07T19:54:18.3507188Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:18.3510586Z 2025-05-07T19:54:18.3511890Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3513947Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3514829Z ^ 2025-05-07T19:54:18.3518463Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:18.3521657Z 2025-05-07T19:54:18.3522983Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3524798Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3525576Z ^ 2025-05-07T19:54:18.3528980Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:18.3532062Z 2025-05-07T19:54:18.3533325Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3535239Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3536155Z ^ 2025-05-07T19:54:18.3539829Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:18.3543204Z 2025-05-07T19:54:18.3544573Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3546626Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3547572Z ^ 2025-05-07T19:54:18.3551667Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:18.3555043Z 2025-05-07T19:54:18.3556366Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3558401Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3559351Z ^ 2025-05-07T19:54:18.3562992Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:18.3566379Z 2025-05-07T19:54:18.3567681Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3569701Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3570657Z ^ 2025-05-07T19:54:18.3574443Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:18.3577800Z 2025-05-07T19:54:18.3579109Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3581104Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3582007Z ^ 2025-05-07T19:54:18.3585784Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:18.3589077Z 2025-05-07T19:54:18.3590431Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3592486Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3593381Z ^ 2025-05-07T19:54:18.3596934Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:18.3600517Z 2025-05-07T19:54:18.3600960Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:18.3601627Z 2025-05-07T19:54:18.3602945Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3604839Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3605796Z ^ 2025-05-07T19:54:18.3609525Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:18.3613158Z 2025-05-07T19:54:18.3614231Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3616361Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3617326Z ^ 2025-05-07T19:54:18.3621467Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:18.3625003Z 2025-05-07T19:54:18.3626336Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3628471Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3629577Z ^ 2025-05-07T19:54:18.3633347Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:18.3636930Z 2025-05-07T19:54:18.3638301Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3640418Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3641356Z ^ 2025-05-07T19:54:18.3645254Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:18.3648692Z 2025-05-07T19:54:18.3650068Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3652210Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3653147Z ^ 2025-05-07T19:54:18.3656885Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:18.3660237Z 2025-05-07T19:54:18.3661447Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3663501Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3664362Z ^ 2025-05-07T19:54:18.3668087Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:18.3671592Z 2025-05-07T19:54:18.3672905Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3674903Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3675847Z ^ 2025-05-07T19:54:18.3679445Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:18.3682742Z 2025-05-07T19:54:18.3684051Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3686248Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3687179Z ^ 2025-05-07T19:54:18.3690584Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:18.3694143Z 2025-05-07T19:54:18.3695520Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3697587Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3698540Z ^ 2025-05-07T19:54:18.3702312Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:18.3705792Z 2025-05-07T19:54:18.3707141Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3709260Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3710327Z ^ 2025-05-07T19:54:18.3714235Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:18.3717730Z 2025-05-07T19:54:18.3719094Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3721036Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3721885Z ^ 2025-05-07T19:54:18.3725222Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:18.3728313Z 2025-05-07T19:54:18.3729524Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3731461Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3732308Z ^ 2025-05-07T19:54:18.3735759Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:18.3739136Z 2025-05-07T19:54:18.3740367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3742334Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3743222Z ^ 2025-05-07T19:54:18.3746489Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:18.3750121Z 2025-05-07T19:54:18.3751520Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3753561Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3754500Z ^ 2025-05-07T19:54:18.3757724Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:18.3760644Z 2025-05-07T19:54:18.3761800Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3763475Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3764325Z ^ 2025-05-07T19:54:18.3767664Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:18.3771015Z 2025-05-07T19:54:18.3772307Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3774161Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3775024Z ^ 2025-05-07T19:54:18.3778199Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:18.3781643Z 2025-05-07T19:54:18.3782883Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3785001Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3785926Z ^ 2025-05-07T19:54:18.3789551Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:18.3792844Z 2025-05-07T19:54:18.3794213Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3795957Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3796693Z ^ 2025-05-07T19:54:18.3800477Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:18.3803720Z 2025-05-07T19:54:18.3805009Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3806913Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3807802Z ^ 2025-05-07T19:54:18.3811265Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:18.3814591Z 2025-05-07T19:54:18.3815881Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3817889Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3818823Z ^ 2025-05-07T19:54:18.3822398Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:18.3826249Z 2025-05-07T19:54:18.3827546Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3829748Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3830469Z ^ 2025-05-07T19:54:18.3833464Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:18.3836741Z 2025-05-07T19:54:18.3838098Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3840073Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3840963Z ^ 2025-05-07T19:54:18.3844732Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:18.3847982Z 2025-05-07T19:54:18.3849271Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3851295Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3852197Z ^ 2025-05-07T19:54:18.3855845Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:18.3859277Z 2025-05-07T19:54:18.3860568Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3862518Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3863400Z ^ 2025-05-07T19:54:18.3866892Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:18.3870517Z 2025-05-07T19:54:18.3870966Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:18.3871641Z 2025-05-07T19:54:18.3872958Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3875022Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3875967Z ^ 2025-05-07T19:54:18.3879602Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:18.3883005Z 2025-05-07T19:54:18.3884093Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3886127Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3886919Z ^ 2025-05-07T19:54:18.3890473Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:18.3893842Z 2025-05-07T19:54:18.3895185Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3897210Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3898034Z ^ 2025-05-07T19:54:18.3901784Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:18.3905311Z 2025-05-07T19:54:18.3906656Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3908162Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3908859Z ^ 2025-05-07T19:54:18.3912313Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:18.3918392Z 2025-05-07T19:54:18.3919600Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3921493Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3922329Z ^ 2025-05-07T19:54:18.3925682Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:18.3928886Z 2025-05-07T19:54:18.3930161Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3931964Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3932829Z ^ 2025-05-07T19:54:18.3936402Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:18.3939506Z 2025-05-07T19:54:18.3940852Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3942903Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3943770Z ^ 2025-05-07T19:54:18.3947342Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:18.3950918Z 2025-05-07T19:54:18.3952195Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3954256Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3955194Z ^ 2025-05-07T19:54:18.3958880Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:18.3962558Z 2025-05-07T19:54:18.3963852Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3965908Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3966844Z ^ 2025-05-07T19:54:18.3970472Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:18.3973827Z 2025-05-07T19:54:18.3975122Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3977148Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3978082Z ^ 2025-05-07T19:54:18.3981840Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:18.3985455Z 2025-05-07T19:54:18.3986774Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3988744Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.3989743Z ^ 2025-05-07T19:54:18.3993325Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:18.3996627Z 2025-05-07T19:54:18.3997851Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.3999800Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4000664Z ^ 2025-05-07T19:54:18.4004284Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:18.4008123Z 2025-05-07T19:54:18.4009382Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4011317Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4012207Z ^ 2025-05-07T19:54:18.4015553Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:18.4018422Z 2025-05-07T19:54:18.4019508Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4021328Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4022296Z ^ 2025-05-07T19:54:18.4025731Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:18.4029578Z 2025-05-07T19:54:18.4030878Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4032857Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4033811Z ^ 2025-05-07T19:54:18.4037496Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:18.4040933Z 2025-05-07T19:54:18.4042280Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4044310Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4045267Z ^ 2025-05-07T19:54:18.4048939Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:18.4052570Z 2025-05-07T19:54:18.4053907Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4055913Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4056844Z ^ 2025-05-07T19:54:18.4060518Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:18.4063870Z 2025-05-07T19:54:18.4065180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4067211Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4068130Z ^ 2025-05-07T19:54:18.4071863Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:18.4075225Z 2025-05-07T19:54:18.4076721Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4078719Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4079568Z ^ 2025-05-07T19:54:18.4083214Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:18.4086826Z 2025-05-07T19:54:18.4088147Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4090094Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4090961Z ^ 2025-05-07T19:54:18.4094566Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:18.4097313Z 2025-05-07T19:54:18.4098229Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4099872Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4100707Z ^ 2025-05-07T19:54:18.4104038Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:18.4107272Z 2025-05-07T19:54:18.4108576Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4110802Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4111723Z ^ 2025-05-07T19:54:18.4115502Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:18.4118837Z 2025-05-07T19:54:18.4120432Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4122198Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4123094Z ^ 2025-05-07T19:54:18.4126536Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:18.4129834Z 2025-05-07T19:54:18.4131221Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4133256Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4134200Z ^ 2025-05-07T19:54:18.4137724Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:18.4141121Z 2025-05-07T19:54:18.4141929Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:18.4142626Z 2025-05-07T19:54:18.4143957Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4146004Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4146916Z ^ 2025-05-07T19:54:18.4150726Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:18.4154132Z 2025-05-07T19:54:18.4155435Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4157462Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4158359Z ^ 2025-05-07T19:54:18.4161937Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:18.4165283Z 2025-05-07T19:54:18.4166785Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4168782Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4169667Z ^ 2025-05-07T19:54:18.4173216Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:18.4176544Z 2025-05-07T19:54:18.4177821Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4179800Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4180685Z ^ 2025-05-07T19:54:18.4184112Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:18.4187939Z 2025-05-07T19:54:18.4189258Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4191316Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4192242Z ^ 2025-05-07T19:54:18.4195707Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:18.4199039Z 2025-05-07T19:54:18.4200156Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4201846Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4202715Z ^ 2025-05-07T19:54:18.4205507Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:18.4208215Z 2025-05-07T19:54:18.4209716Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4211619Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4212521Z ^ 2025-05-07T19:54:18.4215961Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:18.4219073Z 2025-05-07T19:54:18.4220208Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4222343Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4223252Z ^ 2025-05-07T19:54:18.4226721Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:18.4230053Z 2025-05-07T19:54:18.4231294Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4233717Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4234592Z ^ 2025-05-07T19:54:18.4238042Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:18.4241373Z 2025-05-07T19:54:18.4242717Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4244806Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4245781Z ^ 2025-05-07T19:54:18.4249460Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:18.4252701Z 2025-05-07T19:54:18.4254251Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4256206Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4257083Z ^ 2025-05-07T19:54:18.4260531Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:18.4263951Z 2025-05-07T19:54:18.4265320Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4267399Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4268279Z ^ 2025-05-07T19:54:18.4271991Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:18.4275356Z 2025-05-07T19:54:18.4276711Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4279079Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4280023Z ^ 2025-05-07T19:54:18.4283670Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:18.4287153Z 2025-05-07T19:54:18.4288482Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4290389Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4291234Z ^ 2025-05-07T19:54:18.4294796Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:18.4298174Z 2025-05-07T19:54:18.4299657Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4302314Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4303485Z ^ 2025-05-07T19:54:18.4307107Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:18.4310863Z 2025-05-07T19:54:18.4312198Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4314312Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4315205Z ^ 2025-05-07T19:54:18.4318888Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:18.4322385Z 2025-05-07T19:54:18.4323688Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4326008Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4326891Z ^ 2025-05-07T19:54:18.4330575Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:18.4333978Z 2025-05-07T19:54:18.4335296Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4337425Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4338337Z ^ 2025-05-07T19:54:18.4342046Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:18.4345562Z 2025-05-07T19:54:18.4346838Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4349029Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4350156Z ^ 2025-05-07T19:54:18.4353558Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:18.4356635Z 2025-05-07T19:54:18.4357897Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4359912Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4360821Z ^ 2025-05-07T19:54:18.4364304Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:18.4367606Z 2025-05-07T19:54:18.4368961Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4370999Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4371933Z ^ 2025-05-07T19:54:18.4375241Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:18.4378489Z 2025-05-07T19:54:18.4379729Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4381949Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4382835Z ^ 2025-05-07T19:54:18.4386473Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:18.4389965Z 2025-05-07T19:54:18.4391164Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4393005Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4394285Z ^ 2025-05-07T19:54:18.4397750Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:18.4400929Z 2025-05-07T19:54:18.4402170Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4404196Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4405165Z ^ 2025-05-07T19:54:18.4408795Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:18.4412316Z 2025-05-07T19:54:18.4412786Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:18.4413502Z 2025-05-07T19:54:18.4414851Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4417275Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4418258Z ^ 2025-05-07T19:54:18.4421953Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:18.4425110Z 2025-05-07T19:54:18.4426318Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4427960Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4428789Z ^ 2025-05-07T19:54:18.4432754Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:18.4436228Z 2025-05-07T19:54:18.4437535Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4439654Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4440721Z ^ 2025-05-07T19:54:18.4444170Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:18.4447692Z 2025-05-07T19:54:18.4449019Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4451099Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4452048Z ^ 2025-05-07T19:54:18.4455623Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:18.4458988Z 2025-05-07T19:54:18.4460307Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4462361Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4463459Z ^ 2025-05-07T19:54:18.4466953Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:18.4470565Z 2025-05-07T19:54:18.4472208Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4474349Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4475343Z ^ 2025-05-07T19:54:18.4479021Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:18.4482553Z 2025-05-07T19:54:18.4483901Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4486365Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4490238Z ^ 2025-05-07T19:54:18.4494058Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:18.4497493Z 2025-05-07T19:54:18.4498804Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4500865Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4501830Z ^ 2025-05-07T19:54:18.4505289Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:18.4508455Z 2025-05-07T19:54:18.4509911Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4511954Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4513303Z ^ 2025-05-07T19:54:18.4516925Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:18.4520357Z 2025-05-07T19:54:18.4521730Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4523798Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4524788Z ^ 2025-05-07T19:54:18.4528520Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:18.4531784Z 2025-05-07T19:54:18.4533169Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4535077Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4536153Z ^ 2025-05-07T19:54:18.4539189Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:18.4542273Z 2025-05-07T19:54:18.4543568Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4545632Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4546543Z ^ 2025-05-07T19:54:18.4550185Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:18.4553458Z 2025-05-07T19:54:18.4554836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4575027Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4576091Z ^ 2025-05-07T19:54:18.4579623Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:18.4583103Z 2025-05-07T19:54:18.4584741Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4586825Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4587765Z ^ 2025-05-07T19:54:18.4591275Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:18.4594434Z 2025-05-07T19:54:18.4595607Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4597421Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4598300Z ^ 2025-05-07T19:54:18.4602269Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:18.4605588Z 2025-05-07T19:54:18.4606955Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4608932Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4609833Z ^ 2025-05-07T19:54:18.4613468Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:18.4616897Z 2025-05-07T19:54:18.4618177Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4620216Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4621166Z ^ 2025-05-07T19:54:18.4625200Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:18.4628685Z 2025-05-07T19:54:18.4630181Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4632245Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4633206Z ^ 2025-05-07T19:54:18.4636571Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:18.4639684Z 2025-05-07T19:54:18.4640909Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4642848Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4643711Z ^ 2025-05-07T19:54:18.4647375Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:18.4650581Z 2025-05-07T19:54:18.4651813Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4653739Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4654594Z ^ 2025-05-07T19:54:18.4658135Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:18.4661492Z 2025-05-07T19:54:18.4662805Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4664846Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4665714Z ^ 2025-05-07T19:54:18.4669313Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:18.4673171Z 2025-05-07T19:54:18.4674643Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4676888Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4677851Z ^ 2025-05-07T19:54:18.4681598Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:18.4685532Z 2025-05-07T19:54:18.4686927Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:18.4689011Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:18.4689856Z ^ 2025-05-07T19:54:18.4693607Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:18.4696870Z 2025-05-07T19:54:22.7446882Z [158/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:54:23.0744564Z [159/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_dense_weighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o 2025-05-07T19:54:27.1612166Z [160/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:54:27.3325574Z [161/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o 2025-05-07T19:54:27.3479346Z [162/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:54:27.8462893Z [163/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:54:29.3759030Z [164/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:54:29.3892973Z [165/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:54:29.4024401Z [166/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:54:29.4155408Z [167/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:54:29.4288268Z [168/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:54:29.4416845Z [169/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:54:29.4546448Z [170/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:54:29.4679167Z [171/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:54:29.4808907Z [172/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:54:29.5106365Z [173/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:29.5148707Z [174/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:29.5237594Z [175/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:54:29.5275951Z [176/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:54:29.5370649Z [177/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:54:29.5409489Z [178/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:31.2797936Z [179/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:54:31.6333432Z [180/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:54:34.3065034Z [181/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:35.7519823Z [182/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:54:35.9769119Z [183/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:54:36.0370281Z [184/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_dense_unweighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o 2025-05-07T19:54:38.4273719Z [185/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o 2025-05-07T19:54:39.5218426Z [186/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o 2025-05-07T19:54:40.4264738Z [187/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o 2025-05-07T19:54:40.4290299Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:40.4292255Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:40.4296419Z ^ 2025-05-07T19:54:40.4296884Z 2025-05-07T19:54:40.4297442Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:40.4298094Z 2025-05-07T19:54:40.4299599Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:40.4301612Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:40.4302152Z ^ 2025-05-07T19:54:40.4302444Z 2025-05-07T19:54:40.4304085Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:40.4306165Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:40.4306782Z ^ 2025-05-07T19:54:40.4307090Z 2025-05-07T19:54:40.4308790Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:40.4311045Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:40.4311643Z ^ 2025-05-07T19:54:40.4311950Z 2025-05-07T19:54:40.4312412Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:40.4313133Z 2025-05-07T19:54:40.4314737Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:40.4316730Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:40.4317599Z ^ 2025-05-07T19:54:40.4317917Z 2025-05-07T19:54:40.4319477Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:40.4321177Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:40.4321672Z ^ 2025-05-07T19:54:40.4321948Z 2025-05-07T19:54:40.4323433Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:40.4325389Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:40.4325965Z ^ 2025-05-07T19:54:40.4326244Z 2025-05-07T19:54:40.4326704Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:40.4327394Z 2025-05-07T19:54:40.4329003Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:40.4331044Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:40.4331593Z ^ 2025-05-07T19:54:40.4331907Z 2025-05-07T19:54:40.4333485Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:40.4335474Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:40.4336019Z ^ 2025-05-07T19:54:40.4336344Z 2025-05-07T19:54:40.4338218Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:40.4340319Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:40.4340895Z ^ 2025-05-07T19:54:40.4341199Z 2025-05-07T19:54:40.4341678Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:40.4342375Z 2025-05-07T19:54:40.4344059Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:40.4346208Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:40.4346825Z ^ 2025-05-07T19:54:40.4347139Z 2025-05-07T19:54:40.4348842Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:40.4351023Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:40.4351573Z ^ 2025-05-07T19:54:40.4351855Z 2025-05-07T19:54:40.4353335Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:40.4355186Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:40.4355658Z ^ 2025-05-07T19:54:40.4355939Z 2025-05-07T19:54:40.4356321Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:40.4356899Z 2025-05-07T19:54:40.4358402Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:40.4360657Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:40.4361215Z ^ 2025-05-07T19:54:40.4361505Z 2025-05-07T19:54:40.4363101Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:40.4365128Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:40.4365697Z ^ 2025-05-07T19:54:40.4365986Z 2025-05-07T19:54:40.4392403Z [188/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:54:40.4521023Z [189/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:40.4651461Z [190/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:54:40.4780189Z [191/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:54:40.4907490Z [192/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:54:40.5034515Z [193/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:42.0464531Z [194/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:54:42.5023354Z [195/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:43.0295480Z [196/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:43.8475891Z [197/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:54:44.1427285Z [198/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o 2025-05-07T19:54:48.0434269Z [199/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:50.8090957Z [200/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:54:53.6123469Z [201/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:54.2904013Z [202/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:54:54.3497775Z [203/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:54:55.5634721Z [204/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:54:55.9662125Z [205/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:54:56.0179784Z [206/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:54:56.2688235Z [207/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:54:57.2678571Z [208/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:54:58.9022805Z [209/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o 2025-05-07T19:54:58.9049082Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:58.9051317Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:58.9051941Z ^ 2025-05-07T19:54:58.9052265Z 2025-05-07T19:54:58.9052739Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:58.9053474Z 2025-05-07T19:54:58.9055246Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:58.9057455Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:58.9058048Z ^ 2025-05-07T19:54:58.9058388Z 2025-05-07T19:54:58.9060120Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:58.9062249Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:58.9062822Z ^ 2025-05-07T19:54:58.9063131Z 2025-05-07T19:54:58.9064877Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:58.9067061Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:58.9067679Z ^ 2025-05-07T19:54:58.9068321Z 2025-05-07T19:54:58.9068827Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:58.9069686Z 2025-05-07T19:54:58.9071416Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:58.9073587Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:58.9074135Z ^ 2025-05-07T19:54:58.9074467Z 2025-05-07T19:54:58.9076192Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:58.9078382Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:58.9078990Z ^ 2025-05-07T19:54:58.9079332Z 2025-05-07T19:54:58.9081089Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:58.9083287Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:58.9083875Z ^ 2025-05-07T19:54:58.9084189Z 2025-05-07T19:54:58.9084962Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:58.9085656Z 2025-05-07T19:54:58.9087371Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:58.9089549Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:58.9090159Z ^ 2025-05-07T19:54:58.9090807Z 2025-05-07T19:54:58.9092561Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:58.9094756Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:58.9095342Z ^ 2025-05-07T19:54:58.9095676Z 2025-05-07T19:54:58.9097404Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:58.9099616Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:58.9100213Z ^ 2025-05-07T19:54:58.9100553Z 2025-05-07T19:54:58.9101028Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:58.9101747Z 2025-05-07T19:54:58.9103513Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:58.9105706Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:58.9106321Z ^ 2025-05-07T19:54:58.9106634Z 2025-05-07T19:54:58.9108383Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:58.9110618Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:58.9111127Z ^ 2025-05-07T19:54:58.9111418Z 2025-05-07T19:54:58.9113368Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:58.9115564Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:58.9116139Z ^ 2025-05-07T19:54:58.9116462Z 2025-05-07T19:54:58.9116905Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:58.9117536Z 2025-05-07T19:54:58.9119210Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:58.9121326Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:58.9121934Z ^ 2025-05-07T19:54:58.9122239Z 2025-05-07T19:54:58.9124075Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:58.9126132Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:58.9126719Z ^ 2025-05-07T19:54:58.9127039Z 2025-05-07T19:54:59.1198226Z [210/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:55:00.3588041Z [211/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/backward/embedding_backward_dense_host_cpu.cpp 2025-05-07T19:55:01.0702388Z [212/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o 2025-05-07T19:55:01.0726762Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.0728815Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:01.0729592Z ^ 2025-05-07T19:55:01.0729872Z 2025-05-07T19:55:01.0730308Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:01.0730987Z 2025-05-07T19:55:01.0732490Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.0734326Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.0734850Z ^ 2025-05-07T19:55:01.0735162Z 2025-05-07T19:55:01.0736746Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.0738721Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.0739411Z ^ 2025-05-07T19:55:01.0739673Z 2025-05-07T19:55:01.0741172Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.0743216Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.0743839Z ^ 2025-05-07T19:55:01.0744109Z 2025-05-07T19:55:01.0745548Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.0748014Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:01.0748765Z ^ 2025-05-07T19:55:01.0749021Z 2025-05-07T19:55:01.0749430Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:01.0750219Z 2025-05-07T19:55:01.0751744Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.0753713Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.0754237Z ^ 2025-05-07T19:55:01.0754508Z 2025-05-07T19:55:01.0755955Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.0757915Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.0758558Z ^ 2025-05-07T19:55:01.0758878Z 2025-05-07T19:55:01.0760518Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.0762643Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.0763317Z ^ 2025-05-07T19:55:01.0763596Z 2025-05-07T19:55:01.0765434Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.0768147Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:01.0768996Z ^ 2025-05-07T19:55:01.0769352Z 2025-05-07T19:55:01.0770194Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:01.0770914Z 2025-05-07T19:55:01.0772794Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.0775009Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.0775618Z ^ 2025-05-07T19:55:01.0775953Z 2025-05-07T19:55:01.0777801Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.0779948Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.0780592Z ^ 2025-05-07T19:55:01.0780913Z 2025-05-07T19:55:01.0782449Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.0784268Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.0785077Z ^ 2025-05-07T19:55:01.0785334Z 2025-05-07T19:55:01.0786654Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.0788768Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:01.0789478Z ^ 2025-05-07T19:55:01.0789824Z 2025-05-07T19:55:01.0790214Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:01.0790816Z 2025-05-07T19:55:01.0792642Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.0794497Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.0795032Z ^ 2025-05-07T19:55:01.0795315Z 2025-05-07T19:55:01.0796803Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.0798671Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.0799254Z ^ 2025-05-07T19:55:01.0799567Z 2025-05-07T19:55:01.0801117Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.0802885Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.0803438Z ^ 2025-05-07T19:55:01.0803720Z 2025-05-07T19:55:01.0805425Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.0807398Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:01.0808138Z ^ 2025-05-07T19:55:01.0808420Z 2025-05-07T19:55:01.0808859Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:01.0809540Z 2025-05-07T19:55:01.0811209Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.0813608Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.0814288Z ^ 2025-05-07T19:55:01.0814624Z 2025-05-07T19:55:01.0816270Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.0818205Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.0818896Z ^ 2025-05-07T19:55:01.0819150Z 2025-05-07T19:55:01.0820597Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.0822619Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.0823215Z ^ 2025-05-07T19:55:01.0823504Z 2025-05-07T19:55:02.1555911Z [213/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:55:02.2688668Z [214/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o 2025-05-07T19:55:02.3336782Z [215/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:55:02.7998518Z [216/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:55:03.1476137Z [217/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o 2025-05-07T19:55:03.1499490Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.1501761Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:03.1502517Z ^ 2025-05-07T19:55:03.1502829Z 2025-05-07T19:55:03.1503445Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:03.1504095Z 2025-05-07T19:55:03.1505686Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.1507570Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.1508140Z ^ 2025-05-07T19:55:03.1508430Z 2025-05-07T19:55:03.1510036Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.1511991Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.1512524Z ^ 2025-05-07T19:55:03.1512799Z 2025-05-07T19:55:03.1514310Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.1516227Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.1517185Z ^ 2025-05-07T19:55:03.1517510Z 2025-05-07T19:55:03.1519049Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.1521194Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:03.1521926Z ^ 2025-05-07T19:55:03.1522237Z 2025-05-07T19:55:03.1522623Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:03.1523252Z 2025-05-07T19:55:03.1524758Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.1526713Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.1527328Z ^ 2025-05-07T19:55:03.1527632Z 2025-05-07T19:55:03.1529192Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.1531202Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.1531741Z ^ 2025-05-07T19:55:03.1532007Z 2025-05-07T19:55:03.1533506Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.1535420Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.1535953Z ^ 2025-05-07T19:55:03.1536257Z 2025-05-07T19:55:03.1538154Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.1540304Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:03.1541038Z ^ 2025-05-07T19:55:03.1541353Z 2025-05-07T19:55:03.1541779Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:03.1542429Z 2025-05-07T19:55:03.1543972Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.1545891Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.1546429Z ^ 2025-05-07T19:55:03.1546696Z 2025-05-07T19:55:03.1548166Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.1550205Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.1550758Z ^ 2025-05-07T19:55:03.1551025Z 2025-05-07T19:55:03.1552534Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.1554456Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.1555007Z ^ 2025-05-07T19:55:03.1555307Z 2025-05-07T19:55:03.1556796Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.1558901Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:03.1559926Z ^ 2025-05-07T19:55:03.1560217Z 2025-05-07T19:55:03.1560632Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:03.1561275Z 2025-05-07T19:55:03.1562825Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.1564726Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.1565285Z ^ 2025-05-07T19:55:03.1565568Z 2025-05-07T19:55:03.1566983Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.1568835Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.1569358Z ^ 2025-05-07T19:55:03.1569673Z 2025-05-07T19:55:03.1571128Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.1573008Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.1573528Z ^ 2025-05-07T19:55:03.1573821Z 2025-05-07T19:55:03.1575320Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.1577364Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:03.1578078Z ^ 2025-05-07T19:55:03.1578347Z 2025-05-07T19:55:03.1578781Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:03.1579260Z 2025-05-07T19:55:03.1581942Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.1583849Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.1584655Z ^ 2025-05-07T19:55:03.1584940Z 2025-05-07T19:55:03.1586321Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.1587967Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.1588368Z ^ 2025-05-07T19:55:03.1588601Z 2025-05-07T19:55:03.1590076Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.1591690Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.1592117Z ^ 2025-05-07T19:55:03.1592410Z 2025-05-07T19:55:03.6996947Z [218/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:55:04.0163953Z [219/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:55:05.3359830Z [220/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:55:07.8731592Z [221/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:55:08.5461030Z [222/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:55:09.3294488Z [223/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:55:10.0331577Z [224/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:55:10.3601421Z [225/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:55:10.3972608Z [226/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:55:10.4200229Z [227/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:55:11.5046435Z [228/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:55:11.6059393Z [229/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:55:12.3384326Z [230/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_pt2.so -o fbgemm_gpu_tbe_training_backward_pt2.so CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_tbe_common.so fbgemm_gpu_sparse_async_cumsum.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed fbgemm.so fbgemm_gpu_config.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && : 2025-05-07T19:55:13.0328259Z [231/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:55:13.2212534Z [232/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:55:13.9332637Z [233/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o 2025-05-07T19:55:14.6261627Z [234/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:55:14.7359654Z [235/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o 2025-05-07T19:55:14.7384991Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:14.7387512Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:14.7388260Z ^ 2025-05-07T19:55:14.7388571Z 2025-05-07T19:55:14.7389021Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:14.7389949Z 2025-05-07T19:55:14.7391391Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:14.7393267Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:14.7393862Z ^ 2025-05-07T19:55:14.7394161Z 2025-05-07T19:55:14.7395740Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:14.7397863Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:14.7398439Z ^ 2025-05-07T19:55:14.7398721Z 2025-05-07T19:55:14.7400494Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:14.7402523Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:14.7403060Z ^ 2025-05-07T19:55:14.7403366Z 2025-05-07T19:55:14.7405040Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:14.7407247Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:14.7408020Z ^ 2025-05-07T19:55:14.7408309Z 2025-05-07T19:55:14.7408976Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:14.7409645Z 2025-05-07T19:55:14.7411260Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:14.7413394Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:14.7414000Z ^ 2025-05-07T19:55:14.7414303Z 2025-05-07T19:55:14.7416013Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:14.7418127Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:14.7418729Z ^ 2025-05-07T19:55:14.7419023Z 2025-05-07T19:55:14.7420698Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:14.7422826Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:14.7423396Z ^ 2025-05-07T19:55:14.7423684Z 2025-05-07T19:55:14.7425276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:14.7427449Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:14.7428150Z ^ 2025-05-07T19:55:14.7428454Z 2025-05-07T19:55:14.7428864Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:14.7429721Z 2025-05-07T19:55:14.7431351Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:14.7433658Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:14.7434274Z ^ 2025-05-07T19:55:14.7434578Z 2025-05-07T19:55:14.7436290Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:14.7438372Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:14.7438973Z ^ 2025-05-07T19:55:14.7439267Z 2025-05-07T19:55:14.7440822Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:14.7442850Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:14.7443419Z ^ 2025-05-07T19:55:14.7443698Z 2025-05-07T19:55:14.7445229Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:14.7447314Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:14.7448027Z ^ 2025-05-07T19:55:14.7448350Z 2025-05-07T19:55:14.7448797Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:14.7449424Z 2025-05-07T19:55:14.7451125Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:14.7453240Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:14.7454135Z ^ 2025-05-07T19:55:14.7454455Z 2025-05-07T19:55:14.7456205Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:14.7458361Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:14.7458959Z ^ 2025-05-07T19:55:14.7459255Z 2025-05-07T19:55:14.7460987Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:14.7462956Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:14.7463501Z ^ 2025-05-07T19:55:14.7463722Z 2025-05-07T19:55:14.7464994Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:14.7467090Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:14.7467772Z ^ 2025-05-07T19:55:14.7468050Z 2025-05-07T19:55:14.7468483Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:14.7469184Z 2025-05-07T19:55:14.7470851Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:14.7472878Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:14.7473461Z ^ 2025-05-07T19:55:14.7473745Z 2025-05-07T19:55:14.7475413Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:14.7477592Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:14.7478179Z ^ 2025-05-07T19:55:14.7478475Z 2025-05-07T19:55:14.7480043Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:14.7481968Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:14.7482522Z ^ 2025-05-07T19:55:14.7482835Z 2025-05-07T19:55:15.7634501Z [236/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_dense_weighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o 2025-05-07T19:55:16.3621162Z [237/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:55:17.3707724Z [238/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:55:18.0818146Z [239/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:55:24.3517116Z [240/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o 2025-05-07T19:55:24.3541764Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:24.3543981Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:24.3544759Z ^ 2025-05-07T19:55:24.3545038Z 2025-05-07T19:55:24.3545483Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:24.3546127Z 2025-05-07T19:55:24.3547694Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:24.3549815Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:24.3550354Z ^ 2025-05-07T19:55:24.3550646Z 2025-05-07T19:55:24.3552243Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:24.3554240Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:24.3554771Z ^ 2025-05-07T19:55:24.3555037Z 2025-05-07T19:55:24.3556647Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:24.3558563Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:24.3559114Z ^ 2025-05-07T19:55:24.3559365Z 2025-05-07T19:55:24.3560920Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:24.3563336Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:24.3564053Z ^ 2025-05-07T19:55:24.3564321Z 2025-05-07T19:55:24.3564745Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:24.3565427Z 2025-05-07T19:55:24.3567109Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:24.3569098Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:24.3569630Z ^ 2025-05-07T19:55:24.3569914Z 2025-05-07T19:55:24.3571512Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:24.3573525Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:24.3574053Z ^ 2025-05-07T19:55:24.3574327Z 2025-05-07T19:55:24.3575833Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:24.3577780Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:24.3578307Z ^ 2025-05-07T19:55:24.3578581Z 2025-05-07T19:55:24.3580158Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:24.3582308Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:24.3583048Z ^ 2025-05-07T19:55:24.3583544Z 2025-05-07T19:55:24.3583979Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:24.3584949Z 2025-05-07T19:55:24.3586540Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:24.3588520Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:24.3589065Z ^ 2025-05-07T19:55:24.3589334Z 2025-05-07T19:55:24.3590992Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:24.3593017Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:24.3593543Z ^ 2025-05-07T19:55:24.3593821Z 2025-05-07T19:55:24.3595461Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:24.3597414Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:24.3597958Z ^ 2025-05-07T19:55:24.3598221Z 2025-05-07T19:55:24.3599849Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:24.3602031Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:24.3602745Z ^ 2025-05-07T19:55:24.3603008Z 2025-05-07T19:55:24.3603428Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:24.3604105Z 2025-05-07T19:55:24.3605941Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:24.3607970Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:24.3608522Z ^ 2025-05-07T19:55:24.3608828Z 2025-05-07T19:55:24.3610385Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:24.3612364Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:24.3612887Z ^ 2025-05-07T19:55:24.3613160Z 2025-05-07T19:55:24.3614762Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:24.3616775Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:24.3617300Z ^ 2025-05-07T19:55:24.3617569Z 2025-05-07T19:55:24.3619141Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:24.3621312Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:24.3622055Z ^ 2025-05-07T19:55:24.3622340Z 2025-05-07T19:55:24.3622800Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:24.3623480Z 2025-05-07T19:55:24.3625003Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:24.3627269Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:24.3627687Z ^ 2025-05-07T19:55:24.3627907Z 2025-05-07T19:55:24.3629019Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:24.3630951Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:24.3631494Z ^ 2025-05-07T19:55:24.3631726Z 2025-05-07T19:55:24.3633038Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:24.3634932Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:24.3635493Z ^ 2025-05-07T19:55:24.3635760Z 2025-05-07T19:55:36.9639120Z [241/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o 2025-05-07T19:55:36.9663937Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:36.9666176Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:36.9666762Z ^ 2025-05-07T19:55:36.9667372Z 2025-05-07T19:55:36.9667840Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:36.9668515Z 2025-05-07T19:55:36.9670344Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:36.9672452Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:36.9673022Z ^ 2025-05-07T19:55:36.9673328Z 2025-05-07T19:55:36.9675006Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:36.9677102Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:36.9677667Z ^ 2025-05-07T19:55:36.9677966Z 2025-05-07T19:55:36.9679603Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:36.9681656Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:36.9682125Z ^ 2025-05-07T19:55:36.9682378Z 2025-05-07T19:55:36.9682820Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:36.9683499Z 2025-05-07T19:55:36.9685411Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:36.9687553Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:36.9688141Z ^ 2025-05-07T19:55:36.9688443Z 2025-05-07T19:55:36.9690454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:36.9692590Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:36.9693155Z ^ 2025-05-07T19:55:36.9693475Z 2025-05-07T19:55:36.9694923Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:36.9697013Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:36.9697573Z ^ 2025-05-07T19:55:36.9697894Z 2025-05-07T19:55:36.9698355Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:36.9699048Z 2025-05-07T19:55:36.9700748Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:36.9702884Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:36.9703484Z ^ 2025-05-07T19:55:36.9703790Z 2025-05-07T19:55:36.9705466Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:36.9707568Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:36.9708148Z ^ 2025-05-07T19:55:36.9708461Z 2025-05-07T19:55:36.9710290Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:36.9712640Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:36.9713201Z ^ 2025-05-07T19:55:36.9713516Z 2025-05-07T19:55:36.9713971Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:36.9714656Z 2025-05-07T19:55:36.9716345Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:36.9718411Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:36.9718959Z ^ 2025-05-07T19:55:36.9719255Z 2025-05-07T19:55:36.9720933Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:36.9722614Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:36.9723176Z ^ 2025-05-07T19:55:36.9723468Z 2025-05-07T19:55:36.9725013Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:36.9726987Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:36.9727528Z ^ 2025-05-07T19:55:36.9727816Z 2025-05-07T19:55:36.9728245Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:36.9728901Z 2025-05-07T19:55:36.9730473Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:36.9732471Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:36.9733044Z ^ 2025-05-07T19:55:36.9733569Z 2025-05-07T19:55:36.9735178Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:36.9737157Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:36.9737722Z ^ 2025-05-07T19:55:36.9738016Z 2025-05-07T19:55:39.6930275Z [242/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o 2025-05-07T19:55:50.9956772Z [243/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o 2025-05-07T19:55:58.7041513Z [244/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o 2025-05-07T19:55:58.7070159Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7072297Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7073262Z ^ 2025-05-07T19:55:58.7076895Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:55:58.7080273Z 2025-05-07T19:55:58.7080772Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:58.7081470Z 2025-05-07T19:55:58.7082822Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7085171Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7086117Z ^ 2025-05-07T19:55:58.7089251Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:55:58.7092932Z 2025-05-07T19:55:58.7094325Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7096430Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7097380Z ^ 2025-05-07T19:55:58.7100900Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:55:58.7104192Z 2025-05-07T19:55:58.7105571Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7107695Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7108636Z ^ 2025-05-07T19:55:58.7112398Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:55:58.7115766Z 2025-05-07T19:55:58.7119693Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7121955Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7122791Z ^ 2025-05-07T19:55:58.7126332Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:55:58.7129352Z 2025-05-07T19:55:58.7130657Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7132682Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7133606Z ^ 2025-05-07T19:55:58.7136517Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:55:58.7138927Z 2025-05-07T19:55:58.7140272Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7141843Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7142530Z ^ 2025-05-07T19:55:58.7145572Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:55:58.7148529Z 2025-05-07T19:55:58.7149854Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7151577Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7152379Z ^ 2025-05-07T19:55:58.7155577Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:55:58.7158842Z 2025-05-07T19:55:58.7160411Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7162195Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7163070Z ^ 2025-05-07T19:55:58.7166379Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:55:58.7169392Z 2025-05-07T19:55:58.7170722Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7172773Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7193388Z ^ 2025-05-07T19:55:58.7197165Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:55:58.7200525Z 2025-05-07T19:55:58.7201924Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7204534Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7205505Z ^ 2025-05-07T19:55:58.7209053Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:55:58.7212494Z 2025-05-07T19:55:58.7214042Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7216182Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7217119Z ^ 2025-05-07T19:55:58.7220736Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:55:58.7224198Z 2025-05-07T19:55:58.7225579Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7228078Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7228964Z ^ 2025-05-07T19:55:58.7232630Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:55:58.7235971Z 2025-05-07T19:55:58.7237388Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7239544Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7240466Z ^ 2025-05-07T19:55:58.7244115Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:55:58.7247410Z 2025-05-07T19:55:58.7248750Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7250857Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7252039Z ^ 2025-05-07T19:55:58.7255748Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:55:58.7259166Z 2025-05-07T19:55:58.7260522Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7262626Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7263587Z ^ 2025-05-07T19:55:58.7267239Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:55:58.7270705Z 2025-05-07T19:55:58.7272059Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7274296Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7275462Z ^ 2025-05-07T19:55:58.7278981Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:55:58.7282378Z 2025-05-07T19:55:58.7283808Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7286140Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7287094Z ^ 2025-05-07T19:55:58.7290713Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:55:58.7294127Z 2025-05-07T19:55:58.7295548Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7297667Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7298663Z ^ 2025-05-07T19:55:58.7302693Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:55:58.7306177Z 2025-05-07T19:55:58.7307566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7309862Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7310766Z ^ 2025-05-07T19:55:58.7314433Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:55:58.7317922Z 2025-05-07T19:55:58.7319345Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7321542Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7322537Z ^ 2025-05-07T19:55:58.7326421Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:55:58.7329855Z 2025-05-07T19:55:58.7331227Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7333357Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7334315Z ^ 2025-05-07T19:55:58.7338050Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:55:58.7341467Z 2025-05-07T19:55:58.7342997Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7345098Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7346017Z ^ 2025-05-07T19:55:58.7350084Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:55:58.7353721Z 2025-05-07T19:55:58.7355303Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7357462Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7358329Z ^ 2025-05-07T19:55:58.7361993Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:55:58.7365490Z 2025-05-07T19:55:58.7366882Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7368944Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7369875Z ^ 2025-05-07T19:55:58.7373700Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:55:58.7377165Z 2025-05-07T19:55:58.7377657Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:58.7378361Z 2025-05-07T19:55:58.7379766Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7381803Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7382753Z ^ 2025-05-07T19:55:58.7386691Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:55:58.7390188Z 2025-05-07T19:55:58.7391589Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7393652Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7394605Z ^ 2025-05-07T19:55:58.7398357Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:55:58.7401879Z 2025-05-07T19:55:58.7403277Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7405354Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7406438Z ^ 2025-05-07T19:55:58.7410171Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:55:58.7413912Z 2025-05-07T19:55:58.7415335Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7417511Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7418395Z ^ 2025-05-07T19:55:58.7422377Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:55:58.7425662Z 2025-05-07T19:55:58.7427048Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7429131Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7430219Z ^ 2025-05-07T19:55:58.7433857Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:55:58.7437174Z 2025-05-07T19:55:58.7438707Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7440799Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7441752Z ^ 2025-05-07T19:55:58.7445229Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:55:58.7448782Z 2025-05-07T19:55:58.7450153Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7452265Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7453216Z ^ 2025-05-07T19:55:58.7456766Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:55:58.7460056Z 2025-05-07T19:55:58.7461399Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7463484Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7464453Z ^ 2025-05-07T19:55:58.7468471Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:55:58.7472113Z 2025-05-07T19:55:58.7473480Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7475634Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7476731Z ^ 2025-05-07T19:55:58.7480513Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:55:58.7483783Z 2025-05-07T19:55:58.7485568Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7487716Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7488681Z ^ 2025-05-07T19:55:58.7492366Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:55:58.7496127Z 2025-05-07T19:55:58.7497538Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7499653Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7500604Z ^ 2025-05-07T19:55:58.7504147Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:55:58.7507695Z 2025-05-07T19:55:58.7509101Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7511180Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7512058Z ^ 2025-05-07T19:55:58.7516014Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:55:58.7519603Z 2025-05-07T19:55:58.7521009Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7523095Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7524051Z ^ 2025-05-07T19:55:58.7527651Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:55:58.7530564Z 2025-05-07T19:55:58.7531952Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7534054Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7535023Z ^ 2025-05-07T19:55:58.7538674Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:55:58.7542253Z 2025-05-07T19:55:58.7543406Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7545505Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7546460Z ^ 2025-05-07T19:55:58.7550204Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:55:58.7553635Z 2025-05-07T19:55:58.7555000Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7557081Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7558191Z ^ 2025-05-07T19:55:58.7561757Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:55:58.7565163Z 2025-05-07T19:55:58.7566780Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7568641Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7569470Z ^ 2025-05-07T19:55:58.7572989Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:55:58.7576256Z 2025-05-07T19:55:58.7577303Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7578891Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7579545Z ^ 2025-05-07T19:55:58.7582317Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:55:58.7585403Z 2025-05-07T19:55:58.7586592Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7588420Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7589220Z ^ 2025-05-07T19:55:58.7592450Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:55:58.7595332Z 2025-05-07T19:55:58.7596593Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7598531Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7599420Z ^ 2025-05-07T19:55:58.7602752Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:55:58.7605948Z 2025-05-07T19:55:58.7607484Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7609342Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7610266Z ^ 2025-05-07T19:55:58.7613792Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:55:58.7617155Z 2025-05-07T19:55:58.7618550Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7620598Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7621529Z ^ 2025-05-07T19:55:58.7624967Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:55:58.7628393Z 2025-05-07T19:55:58.7629944Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7632419Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7633365Z ^ 2025-05-07T19:55:58.7637041Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:55:58.7640627Z 2025-05-07T19:55:58.7642018Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7644066Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7644986Z ^ 2025-05-07T19:55:58.7648639Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:55:58.7651889Z 2025-05-07T19:55:58.7652388Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:58.7653128Z 2025-05-07T19:55:58.7654744Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7656835Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7657805Z ^ 2025-05-07T19:55:58.7661348Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:55:58.7664719Z 2025-05-07T19:55:58.7666118Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7668172Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7669119Z ^ 2025-05-07T19:55:58.7672676Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:55:58.7675962Z 2025-05-07T19:55:58.7677329Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7679689Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7680641Z ^ 2025-05-07T19:55:58.7684301Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:55:58.7687856Z 2025-05-07T19:55:58.7689187Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7691239Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7692163Z ^ 2025-05-07T19:55:58.7695709Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:55:58.7699124Z 2025-05-07T19:55:58.7700809Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7702955Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7703848Z ^ 2025-05-07T19:55:58.7707502Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:55:58.7710916Z 2025-05-07T19:55:58.7712289Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7714411Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7715295Z ^ 2025-05-07T19:55:58.7718807Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:55:58.7722023Z 2025-05-07T19:55:58.7723394Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7727138Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7728122Z ^ 2025-05-07T19:55:58.7731740Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:55:58.7735094Z 2025-05-07T19:55:58.7736481Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7738633Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7739608Z ^ 2025-05-07T19:55:58.7743303Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:55:58.7746676Z 2025-05-07T19:55:58.7748091Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7750645Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7751605Z ^ 2025-05-07T19:55:58.7755164Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:55:58.7758537Z 2025-05-07T19:55:58.7759890Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7762017Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7763005Z ^ 2025-05-07T19:55:58.7766791Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:55:58.7770131Z 2025-05-07T19:55:58.7771477Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7773603Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7774776Z ^ 2025-05-07T19:55:58.7778414Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:55:58.7781928Z 2025-05-07T19:55:58.7783249Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7785636Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7786603Z ^ 2025-05-07T19:55:58.7790300Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:55:58.7793791Z 2025-05-07T19:55:58.7795228Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7797311Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7798549Z ^ 2025-05-07T19:55:58.7802326Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:55:58.7805817Z 2025-05-07T19:55:58.7807250Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7809403Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7810378Z ^ 2025-05-07T19:55:58.7813987Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:55:58.7817348Z 2025-05-07T19:55:58.7818794Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7820963Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7821917Z ^ 2025-05-07T19:55:58.7825799Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:55:58.7829516Z 2025-05-07T19:55:58.7830896Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7833147Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7834121Z ^ 2025-05-07T19:55:58.7837858Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:55:58.7841341Z 2025-05-07T19:55:58.7842752Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7844853Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7845705Z ^ 2025-05-07T19:55:58.7849742Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:55:58.7853182Z 2025-05-07T19:55:58.7854616Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7856825Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7857795Z ^ 2025-05-07T19:55:58.7861642Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:55:58.7865022Z 2025-05-07T19:55:58.7866448Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7868620Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7869594Z ^ 2025-05-07T19:55:58.7873066Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:55:58.7876692Z 2025-05-07T19:55:58.7878049Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7880148Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7881063Z ^ 2025-05-07T19:55:58.7884944Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:55:58.7888359Z 2025-05-07T19:55:58.7889544Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7891570Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7892526Z ^ 2025-05-07T19:55:58.7896480Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:55:58.7900042Z 2025-05-07T19:55:58.7901403Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7903403Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7904441Z ^ 2025-05-07T19:55:58.7907992Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:55:58.7911473Z 2025-05-07T19:55:58.7912845Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7914996Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7915991Z ^ 2025-05-07T19:55:58.7919698Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:55:58.7923466Z 2025-05-07T19:55:58.7924861Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7927056Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7928044Z ^ 2025-05-07T19:55:58.7931426Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:55:58.7934717Z 2025-05-07T19:55:58.7935192Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:58.7935941Z 2025-05-07T19:55:58.7937345Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7939489Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7940420Z ^ 2025-05-07T19:55:58.7944274Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:55:58.7947740Z 2025-05-07T19:55:58.7949152Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7951487Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7952454Z ^ 2025-05-07T19:55:58.7955986Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:55:58.7959350Z 2025-05-07T19:55:58.7960785Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7962780Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7963671Z ^ 2025-05-07T19:55:58.7967325Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:55:58.7971175Z 2025-05-07T19:55:58.7972592Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7974683Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7975635Z ^ 2025-05-07T19:55:58.7979227Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:55:58.7982480Z 2025-05-07T19:55:58.7983714Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7986040Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7986999Z ^ 2025-05-07T19:55:58.7990998Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:55:58.7994433Z 2025-05-07T19:55:58.7995802Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.7997715Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.7998670Z ^ 2025-05-07T19:55:58.8002276Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:55:58.8005670Z 2025-05-07T19:55:58.8007036Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8009144Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8010084Z ^ 2025-05-07T19:55:58.8013653Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:55:58.8017328Z 2025-05-07T19:55:58.8018656Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8020790Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8021744Z ^ 2025-05-07T19:55:58.8024931Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:55:58.8028152Z 2025-05-07T19:55:58.8029664Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8031505Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8032232Z ^ 2025-05-07T19:55:58.8035129Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:55:58.8037709Z 2025-05-07T19:55:58.8038698Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8040412Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8041248Z ^ 2025-05-07T19:55:58.8044425Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:55:58.8047233Z 2025-05-07T19:55:58.8048393Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8050267Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8051156Z ^ 2025-05-07T19:55:58.8054526Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:55:58.8057745Z 2025-05-07T19:55:58.8059081Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8061011Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8061927Z ^ 2025-05-07T19:55:58.8065164Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:55:58.8068447Z 2025-05-07T19:55:58.8069964Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8072040Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8072980Z ^ 2025-05-07T19:55:58.8076466Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:55:58.8080085Z 2025-05-07T19:55:58.8081506Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8083630Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8084812Z ^ 2025-05-07T19:55:58.8088397Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:55:58.8091884Z 2025-05-07T19:55:58.8093316Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8095455Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8096386Z ^ 2025-05-07T19:55:58.8099984Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:55:58.8103740Z 2025-05-07T19:55:58.8105049Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8107211Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8108171Z ^ 2025-05-07T19:55:58.8111963Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:55:58.8115376Z 2025-05-07T19:55:58.8116803Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8118886Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8119829Z ^ 2025-05-07T19:55:58.8123526Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:55:58.8126786Z 2025-05-07T19:55:58.8128431Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8130528Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8131461Z ^ 2025-05-07T19:55:58.8135218Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:55:58.8138655Z 2025-05-07T19:55:58.8140028Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8142085Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8142979Z ^ 2025-05-07T19:55:58.8146649Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:55:58.8150292Z 2025-05-07T19:55:58.8151747Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8154104Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8155034Z ^ 2025-05-07T19:55:58.8158732Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:55:58.8162035Z 2025-05-07T19:55:58.8163429Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8165546Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8166443Z ^ 2025-05-07T19:55:58.8170109Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:55:58.8173414Z 2025-05-07T19:55:58.8174998Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8177141Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8178128Z ^ 2025-05-07T19:55:58.8181787Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:55:58.8185549Z 2025-05-07T19:55:58.8186936Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8189058Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8190123Z ^ 2025-05-07T19:55:58.8193853Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:55:58.8197321Z 2025-05-07T19:55:58.8198735Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8201178Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8202151Z ^ 2025-05-07T19:55:58.8205705Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:55:58.8209077Z 2025-05-07T19:55:58.8209540Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:58.8210263Z 2025-05-07T19:55:58.8211713Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8213878Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8214875Z ^ 2025-05-07T19:55:58.8218492Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:55:58.8221844Z 2025-05-07T19:55:58.8223537Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8225718Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8226676Z ^ 2025-05-07T19:55:58.8230412Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:55:58.8233804Z 2025-05-07T19:55:58.8235174Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8237349Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8238325Z ^ 2025-05-07T19:55:58.8242031Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:55:58.8245432Z 2025-05-07T19:55:58.8246845Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8249125Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8250099Z ^ 2025-05-07T19:55:58.8253798Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:55:58.8257131Z 2025-05-07T19:55:58.8258513Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8260566Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8261551Z ^ 2025-05-07T19:55:58.8265328Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:55:58.8268690Z 2025-05-07T19:55:58.8270166Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8272513Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8273483Z ^ 2025-05-07T19:55:58.8277095Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:55:58.8280528Z 2025-05-07T19:55:58.8281876Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8284082Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8285286Z ^ 2025-05-07T19:55:58.8288855Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:55:58.8292248Z 2025-05-07T19:55:58.8293583Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8295728Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8297066Z ^ 2025-05-07T19:55:58.8300837Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:55:58.8304388Z 2025-05-07T19:55:58.8305790Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8307813Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8308799Z ^ 2025-05-07T19:55:58.8312492Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:55:58.8315783Z 2025-05-07T19:55:58.8317143Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8319253Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8321778Z ^ 2025-05-07T19:55:58.8325370Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:55:58.8328740Z 2025-05-07T19:55:58.8330040Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8332081Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8333030Z ^ 2025-05-07T19:55:58.8336650Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:55:58.8340051Z 2025-05-07T19:55:58.8341442Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8343471Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8344560Z ^ 2025-05-07T19:55:58.8348058Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:55:58.8351528Z 2025-05-07T19:55:58.8352922Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8355104Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8356091Z ^ 2025-05-07T19:55:58.8359762Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:55:58.8363189Z 2025-05-07T19:55:58.8364633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8366947Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8367910Z ^ 2025-05-07T19:55:58.8371532Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:55:58.8374995Z 2025-05-07T19:55:58.8376410Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8378525Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8379508Z ^ 2025-05-07T19:55:58.8383240Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:55:58.8386946Z 2025-05-07T19:55:58.8388343Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8390416Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8391345Z ^ 2025-05-07T19:55:58.8394988Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:55:58.8398498Z 2025-05-07T19:55:58.8399884Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8402000Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8402947Z ^ 2025-05-07T19:55:58.8406677Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:55:58.8410118Z 2025-05-07T19:55:58.8411489Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8413729Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8414544Z ^ 2025-05-07T19:55:58.8418182Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:55:58.8421661Z 2025-05-07T19:55:58.8423037Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8425144Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8426068Z ^ 2025-05-07T19:55:58.8429762Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:55:58.8433336Z 2025-05-07T19:55:58.8434727Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8436805Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8437764Z ^ 2025-05-07T19:55:58.8441404Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:55:58.8444950Z 2025-05-07T19:55:58.8446296Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8448365Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8449310Z ^ 2025-05-07T19:55:58.8452923Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:55:58.8456117Z 2025-05-07T19:55:58.8457443Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8459506Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8460350Z ^ 2025-05-07T19:55:58.8463369Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:55:58.8465911Z 2025-05-07T19:55:58.8466949Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:58.8468435Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:58.8469255Z ^ 2025-05-07T19:55:58.8472558Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:55:58.8475498Z 2025-05-07T19:56:03.6569488Z [245/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o 2025-05-07T19:56:03.6596527Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:03.6598713Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:03.6599259Z ^ 2025-05-07T19:56:03.6599595Z 2025-05-07T19:56:03.6600018Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:03.6600679Z 2025-05-07T19:56:03.6602325Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:03.6604459Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:03.6604952Z ^ 2025-05-07T19:56:03.6605244Z 2025-05-07T19:56:03.6606896Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:03.6609001Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:03.6609585Z ^ 2025-05-07T19:56:03.6609879Z 2025-05-07T19:56:03.6611469Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:03.6613495Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:03.6613993Z ^ 2025-05-07T19:56:03.6614292Z 2025-05-07T19:56:03.6614739Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:03.6615390Z 2025-05-07T19:56:03.6617047Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:03.6619420Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:03.6620036Z ^ 2025-05-07T19:56:03.6620349Z 2025-05-07T19:56:03.6622061Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:03.6624287Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:03.6624895Z ^ 2025-05-07T19:56:03.6625209Z 2025-05-07T19:56:03.6626889Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:03.6629082Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:03.6629827Z ^ 2025-05-07T19:56:03.6630142Z 2025-05-07T19:56:03.6630587Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:03.6631308Z 2025-05-07T19:56:03.6633015Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:03.6635072Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:03.6635620Z ^ 2025-05-07T19:56:03.6635943Z 2025-05-07T19:56:03.6637569Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:03.6639703Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:03.6640541Z ^ 2025-05-07T19:56:03.6640851Z 2025-05-07T19:56:03.6642534Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:03.6644654Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:03.6645279Z ^ 2025-05-07T19:56:03.6645597Z 2025-05-07T19:56:03.6646071Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:03.6646757Z 2025-05-07T19:56:03.6648347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:03.6650511Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:03.6651113Z ^ 2025-05-07T19:56:03.6651444Z 2025-05-07T19:56:03.6653165Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:03.6655309Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:03.6655915Z ^ 2025-05-07T19:56:03.6656200Z 2025-05-07T19:56:03.6657852Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:03.6660019Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:03.6660613Z ^ 2025-05-07T19:56:03.6660943Z 2025-05-07T19:56:03.6661399Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:03.6662362Z 2025-05-07T19:56:03.6663882Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:03.6665977Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:03.6666549Z ^ 2025-05-07T19:56:03.6666885Z 2025-05-07T19:56:03.6668503Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:03.6670720Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:03.6671285Z ^ 2025-05-07T19:56:03.6671579Z 2025-05-07T19:56:06.9276033Z [246/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_grad_embedding_ops.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o 2025-05-07T19:56:07.4872039Z [247/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_dense_unweighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o 2025-05-07T19:56:22.9401093Z [248/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o 2025-05-07T19:56:22.9427080Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:22.9429564Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:22.9430395Z ^ 2025-05-07T19:56:22.9430735Z 2025-05-07T19:56:22.9431260Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:22.9431966Z 2025-05-07T19:56:22.9433607Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:22.9435753Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:22.9436340Z ^ 2025-05-07T19:56:22.9436674Z 2025-05-07T19:56:22.9438426Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:22.9440377Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:22.9440917Z ^ 2025-05-07T19:56:22.9441210Z 2025-05-07T19:56:22.9442990Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:22.9445095Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:22.9445668Z ^ 2025-05-07T19:56:22.9445986Z 2025-05-07T19:56:22.9447613Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:22.9449865Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:22.9450603Z ^ 2025-05-07T19:56:22.9450933Z 2025-05-07T19:56:22.9451400Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:22.9452077Z 2025-05-07T19:56:22.9453738Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:22.9455830Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:22.9456434Z ^ 2025-05-07T19:56:22.9456734Z 2025-05-07T19:56:22.9458289Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:22.9460403Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:22.9461009Z ^ 2025-05-07T19:56:22.9461304Z 2025-05-07T19:56:22.9462971Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:22.9465312Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:22.9465905Z ^ 2025-05-07T19:56:22.9466221Z 2025-05-07T19:56:22.9467907Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:22.9470402Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:22.9471202Z ^ 2025-05-07T19:56:22.9471536Z 2025-05-07T19:56:22.9472019Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:22.9472737Z 2025-05-07T19:56:22.9474420Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:22.9476599Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:22.9477228Z ^ 2025-05-07T19:56:22.9477535Z 2025-05-07T19:56:22.9479216Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:22.9481827Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:22.9482314Z ^ 2025-05-07T19:56:22.9482618Z 2025-05-07T19:56:22.9484089Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:22.9486379Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:22.9486971Z ^ 2025-05-07T19:56:22.9487290Z 2025-05-07T19:56:22.9489223Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:22.9491567Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:22.9492381Z ^ 2025-05-07T19:56:22.9492687Z 2025-05-07T19:56:22.9493189Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:22.9493908Z 2025-05-07T19:56:22.9495664Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:22.9497351Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:22.9497868Z ^ 2025-05-07T19:56:22.9498128Z 2025-05-07T19:56:22.9499450Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:22.9501235Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:22.9501806Z ^ 2025-05-07T19:56:22.9502110Z 2025-05-07T19:56:22.9503689Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:22.9505715Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:22.9506270Z ^ 2025-05-07T19:56:22.9506583Z 2025-05-07T19:56:22.9508158Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:22.9510752Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:22.9511510Z ^ 2025-05-07T19:56:22.9511792Z 2025-05-07T19:56:22.9512261Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:22.9512920Z 2025-05-07T19:56:22.9514450Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:22.9516403Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:22.9516975Z ^ 2025-05-07T19:56:22.9517252Z 2025-05-07T19:56:22.9518777Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:22.9520900Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:22.9521469Z ^ 2025-05-07T19:56:22.9521774Z 2025-05-07T19:56:22.9523293Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:22.9525248Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:22.9525788Z ^ 2025-05-07T19:56:22.9526064Z 2025-05-07T19:56:23.2598052Z [249/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o 2025-05-07T19:56:23.2610858Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:23.2612055Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:23.2612517Z ^ 2025-05-07T19:56:23.2612689Z 2025-05-07T19:56:23.2613007Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:23.2613376Z 2025-05-07T19:56:23.2614230Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:23.2615287Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:23.2615650Z ^ 2025-05-07T19:56:23.2615816Z 2025-05-07T19:56:23.2616667Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:23.2617724Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:23.2618062Z ^ 2025-05-07T19:56:23.2618223Z 2025-05-07T19:56:23.2619053Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:23.2620134Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:23.2620478Z ^ 2025-05-07T19:56:23.2620638Z 2025-05-07T19:56:23.2621538Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:23.2622725Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:23.2623183Z ^ 2025-05-07T19:56:23.2623351Z 2025-05-07T19:56:23.2623611Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:23.2623980Z 2025-05-07T19:56:23.2624834Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:23.2625881Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:23.2626232Z ^ 2025-05-07T19:56:23.2626400Z 2025-05-07T19:56:23.2627255Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:23.2628307Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:23.2628647Z ^ 2025-05-07T19:56:23.2628806Z 2025-05-07T19:56:23.2629799Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:23.2630889Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:23.2631245Z ^ 2025-05-07T19:56:23.2631409Z 2025-05-07T19:56:23.2632236Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:23.2633423Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:23.2633953Z ^ 2025-05-07T19:56:23.2634145Z 2025-05-07T19:56:23.2634402Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:23.2634774Z 2025-05-07T19:56:23.2635620Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:23.2636691Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:23.2637048Z ^ 2025-05-07T19:56:23.2637217Z 2025-05-07T19:56:23.2638080Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:23.2639135Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:23.2639493Z ^ 2025-05-07T19:56:23.2639667Z 2025-05-07T19:56:23.2640493Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:23.2641572Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:23.2641920Z ^ 2025-05-07T19:56:23.2642087Z 2025-05-07T19:56:23.2642910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:23.2644073Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:23.2644504Z ^ 2025-05-07T19:56:23.2644688Z 2025-05-07T19:56:23.2644944Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:23.2645326Z 2025-05-07T19:56:23.2646238Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:23.2647292Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:23.2647642Z ^ 2025-05-07T19:56:23.2647809Z 2025-05-07T19:56:23.2648655Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:23.2649695Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:23.2650036Z ^ 2025-05-07T19:56:23.2650194Z 2025-05-07T19:56:23.2651019Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:23.2652096Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:23.2652436Z ^ 2025-05-07T19:56:23.2652586Z 2025-05-07T19:56:23.2653401Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:23.2654562Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:23.2654986Z ^ 2025-05-07T19:56:23.2655172Z 2025-05-07T19:56:23.2655427Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:23.2655788Z 2025-05-07T19:56:23.2656633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:23.2657752Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:23.2658092Z ^ 2025-05-07T19:56:23.2658258Z 2025-05-07T19:56:23.2659083Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:23.2660151Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:23.2660489Z ^ 2025-05-07T19:56:23.2660656Z 2025-05-07T19:56:23.2661477Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:23.2662548Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:23.2662878Z ^ 2025-05-07T19:56:23.2663060Z 2025-05-07T19:56:24.7954016Z [250/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o 2025-05-07T19:56:24.7970309Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:24.7971502Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:24.7972107Z ^ 2025-05-07T19:56:24.7972279Z 2025-05-07T19:56:24.7972533Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:24.7972901Z 2025-05-07T19:56:24.7973769Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:24.7974834Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:24.7975359Z ^ 2025-05-07T19:56:24.7975528Z 2025-05-07T19:56:24.7976480Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:24.7977595Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:24.7977944Z ^ 2025-05-07T19:56:24.7978109Z 2025-05-07T19:56:24.7978961Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:24.7980024Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:24.7980360Z ^ 2025-05-07T19:56:24.7980515Z 2025-05-07T19:56:24.7981347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:24.7982540Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:24.7982992Z ^ 2025-05-07T19:56:24.7983160Z 2025-05-07T19:56:24.7983413Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:24.7983811Z 2025-05-07T19:56:24.7985170Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:24.7986297Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:24.7986630Z ^ 2025-05-07T19:56:24.7986798Z 2025-05-07T19:56:24.7987663Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:24.7988737Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:24.7989087Z ^ 2025-05-07T19:56:24.7989248Z 2025-05-07T19:56:24.7990227Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:24.7991310Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:24.7991649Z ^ 2025-05-07T19:56:24.7991810Z 2025-05-07T19:56:24.7992641Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:24.7993833Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:24.7994294Z ^ 2025-05-07T19:56:24.7994576Z 2025-05-07T19:56:24.7994832Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:24.7995225Z 2025-05-07T19:56:24.7996066Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:24.7997189Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:24.7997536Z ^ 2025-05-07T19:56:24.7997699Z 2025-05-07T19:56:24.7998556Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:24.7999615Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:24.7999961Z ^ 2025-05-07T19:56:24.8000130Z 2025-05-07T19:56:24.8000986Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:24.8002046Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:24.8002397Z ^ 2025-05-07T19:56:24.8002556Z 2025-05-07T19:56:24.8003392Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:24.8004701Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:24.8005146Z ^ 2025-05-07T19:56:24.8005308Z 2025-05-07T19:56:24.8005560Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:24.8005925Z 2025-05-07T19:56:24.8006788Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:24.8007855Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:24.8008247Z ^ 2025-05-07T19:56:24.8008413Z 2025-05-07T19:56:24.8009316Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:24.8010407Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:24.8010755Z ^ 2025-05-07T19:56:24.8010919Z 2025-05-07T19:56:24.8011761Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:24.8012856Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:24.8013179Z ^ 2025-05-07T19:56:24.8013374Z 2025-05-07T19:56:24.8014209Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:24.8015410Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:24.8015845Z ^ 2025-05-07T19:56:24.8016040Z 2025-05-07T19:56:24.8016297Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:24.8016666Z 2025-05-07T19:56:24.8017534Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:24.8018653Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:24.8019014Z ^ 2025-05-07T19:56:24.8019183Z 2025-05-07T19:56:24.8020023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:24.8021151Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:24.8021486Z ^ 2025-05-07T19:56:24.8021648Z 2025-05-07T19:56:24.8022480Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:24.8023558Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:24.8023875Z ^ 2025-05-07T19:56:24.8024059Z 2025-05-07T19:56:32.5066858Z [251/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o 2025-05-07T19:56:32.5093230Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.5095491Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:32.5096289Z ^ 2025-05-07T19:56:32.5096739Z 2025-05-07T19:56:32.5097222Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:32.5097922Z 2025-05-07T19:56:32.5099615Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.5101815Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.5120709Z ^ 2025-05-07T19:56:32.5121220Z 2025-05-07T19:56:32.5122798Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.5124908Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.5125493Z ^ 2025-05-07T19:56:32.5125785Z 2025-05-07T19:56:32.5127409Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.5129542Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.5130128Z ^ 2025-05-07T19:56:32.5130433Z 2025-05-07T19:56:32.5132105Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.5134381Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:32.5135206Z ^ 2025-05-07T19:56:32.5135501Z 2025-05-07T19:56:32.5135973Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:32.5136716Z 2025-05-07T19:56:32.5138383Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.5140542Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.5141135Z ^ 2025-05-07T19:56:32.5141462Z 2025-05-07T19:56:32.5143425Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.5145382Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.5145883Z ^ 2025-05-07T19:56:32.5146141Z 2025-05-07T19:56:32.5147410Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.5149002Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.5149635Z ^ 2025-05-07T19:56:32.5149863Z 2025-05-07T19:56:32.5151122Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.5152891Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:32.5153592Z ^ 2025-05-07T19:56:32.5153847Z 2025-05-07T19:56:32.5154213Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:32.5154834Z 2025-05-07T19:56:32.5156169Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.5158019Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.5158455Z ^ 2025-05-07T19:56:32.5158687Z 2025-05-07T19:56:32.5160198Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.5161985Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.5162471Z ^ 2025-05-07T19:56:32.5162738Z 2025-05-07T19:56:32.5164086Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.5165834Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.5166302Z ^ 2025-05-07T19:56:32.5166536Z 2025-05-07T19:56:32.5167728Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.5169439Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:32.5170119Z ^ 2025-05-07T19:56:32.5170350Z 2025-05-07T19:56:32.5170721Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:32.5171296Z 2025-05-07T19:56:32.5172581Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.5174224Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.5174705Z ^ 2025-05-07T19:56:32.5174937Z 2025-05-07T19:56:32.5176175Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.5177707Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.5178190Z ^ 2025-05-07T19:56:32.5178413Z 2025-05-07T19:56:32.5179846Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.5181412Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.5181885Z ^ 2025-05-07T19:56:32.5182105Z 2025-05-07T19:56:32.5183331Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.5185469Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:32.5186084Z ^ 2025-05-07T19:56:32.5186330Z 2025-05-07T19:56:32.5186708Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:32.5187316Z 2025-05-07T19:56:32.5188745Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.5190658Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.5191119Z ^ 2025-05-07T19:56:32.5191365Z 2025-05-07T19:56:32.5192683Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.5194543Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.5195011Z ^ 2025-05-07T19:56:32.5195277Z 2025-05-07T19:56:32.5196736Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.5198522Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.5199004Z ^ 2025-05-07T19:56:32.5199265Z 2025-05-07T19:57:04.8278937Z [252/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o 2025-05-07T19:57:05.7992896Z [253/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o 2025-05-07T19:57:08.0434260Z [254/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:57:10.2992257Z [255/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o 2025-05-07T19:57:11.7411273Z [256/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adagrad_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o 2025-05-07T19:57:12.7929078Z [257/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adagrad_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o 2025-05-07T19:57:14.1025694Z [258/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o 2025-05-07T19:57:15.3568971Z [259/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o 2025-05-07T19:57:18.2536449Z [260/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o 2025-05-07T19:57:18.5217723Z [261/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:57:27.0485631Z [262/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o 2025-05-07T19:57:27.5430689Z [263/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o 2025-05-07T19:57:32.3260194Z [264/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:57:33.4856625Z [265/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:57:39.6808213Z [266/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o 2025-05-07T19:57:39.6829740Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:57:39.6831328Z const auto offset_idx = idx * D_emb; 2025-05-07T19:57:39.6831757Z ^ 2025-05-07T19:57:39.6831981Z 2025-05-07T19:57:39.6832407Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:39.6833108Z 2025-05-07T19:57:39.6834431Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:57:39.6836500Z const auto offset_idx = idx * D_emb; 2025-05-07T19:57:39.6837011Z ^ 2025-05-07T19:57:39.6837281Z 2025-05-07T19:57:39.6837680Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:39.6838260Z 2025-05-07T19:57:39.6839687Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:57:39.6841370Z const auto offset_idx = idx * D_emb; 2025-05-07T19:57:39.6841908Z ^ 2025-05-07T19:57:39.6842173Z 2025-05-07T19:57:39.6842602Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:39.6843227Z 2025-05-07T19:57:39.6844265Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:57:39.6845879Z const auto offset_idx = idx * D_emb; 2025-05-07T19:57:39.6846369Z ^ 2025-05-07T19:57:39.6846584Z 2025-05-07T19:57:39.6846935Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:39.6847495Z 2025-05-07T19:57:45.7005454Z [267/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o 2025-05-07T19:57:47.2329340Z [268/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o 2025-05-07T19:57:49.2033399Z [269/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o 2025-05-07T19:57:49.6862978Z [270/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:57:51.5206142Z [271/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o 2025-05-07T19:57:51.5230491Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:51.5232557Z int error_code = 0; 2025-05-07T19:57:51.5233018Z ^ 2025-05-07T19:57:51.5233238Z 2025-05-07T19:57:51.5233695Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:51.5234396Z 2025-05-07T19:57:51.5235902Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:51.5237760Z int64_t error_value; 2025-05-07T19:57:51.5238246Z ^ 2025-05-07T19:57:51.5238484Z 2025-05-07T19:57:51.5239975Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:51.5241834Z int error_code = 0; 2025-05-07T19:57:51.5242302Z ^ 2025-05-07T19:57:51.5242516Z 2025-05-07T19:57:51.5244021Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:51.5245892Z int64_t error_value; 2025-05-07T19:57:51.5246350Z ^ 2025-05-07T19:57:51.5246610Z 2025-05-07T19:57:51.5248081Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:51.5250057Z int error_code = 0; 2025-05-07T19:57:51.5250503Z ^ 2025-05-07T19:57:51.5250748Z 2025-05-07T19:57:51.5252231Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:51.5254226Z int64_t error_value; 2025-05-07T19:57:51.5254690Z ^ 2025-05-07T19:57:51.5254919Z 2025-05-07T19:57:51.5256383Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:51.5258240Z int error_code = 0; 2025-05-07T19:57:51.5258705Z ^ 2025-05-07T19:57:51.5258926Z 2025-05-07T19:57:51.5260375Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:51.5262261Z int64_t error_value; 2025-05-07T19:57:51.5262736Z ^ 2025-05-07T19:57:51.5262968Z 2025-05-07T19:57:51.5264417Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:51.5266289Z int error_code = 0; 2025-05-07T19:57:51.5266727Z ^ 2025-05-07T19:57:51.5266925Z 2025-05-07T19:57:51.5267310Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:51.5267890Z 2025-05-07T19:57:51.5269199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:51.5271151Z int64_t error_value; 2025-05-07T19:57:51.5271625Z ^ 2025-05-07T19:57:51.5271877Z 2025-05-07T19:57:51.5273103Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:51.5274716Z int error_code = 0; 2025-05-07T19:57:51.5275152Z ^ 2025-05-07T19:57:51.5275571Z 2025-05-07T19:57:51.5277027Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:51.5278916Z int64_t error_value; 2025-05-07T19:57:51.5279364Z ^ 2025-05-07T19:57:51.5279634Z 2025-05-07T19:57:51.5281085Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:51.5282946Z int error_code = 0; 2025-05-07T19:57:51.5283400Z ^ 2025-05-07T19:57:51.5283624Z 2025-05-07T19:57:51.5285342Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:51.5287213Z int64_t error_value; 2025-05-07T19:57:51.5287671Z ^ 2025-05-07T19:57:51.5287906Z 2025-05-07T19:57:51.5289378Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:51.5291232Z int error_code = 0; 2025-05-07T19:57:51.5291683Z ^ 2025-05-07T19:57:51.5291888Z 2025-05-07T19:57:51.5293343Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:51.5295362Z int64_t error_value; 2025-05-07T19:57:51.5295819Z ^ 2025-05-07T19:57:51.5296050Z 2025-05-07T19:57:51.5297466Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:51.5299309Z int error_code = 0; 2025-05-07T19:57:51.5299666Z ^ 2025-05-07T19:57:51.5299865Z 2025-05-07T19:57:51.5300282Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:51.5300939Z 2025-05-07T19:57:51.5302420Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:51.5304284Z int64_t error_value; 2025-05-07T19:57:51.5304747Z ^ 2025-05-07T19:57:51.5304985Z 2025-05-07T19:57:51.5306470Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:51.5308377Z int error_code = 0; 2025-05-07T19:57:51.5308867Z ^ 2025-05-07T19:57:51.5309086Z 2025-05-07T19:57:51.5310766Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:51.5312645Z int64_t error_value; 2025-05-07T19:57:51.5313094Z ^ 2025-05-07T19:57:51.5313360Z 2025-05-07T19:57:51.5314807Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:51.5316692Z int error_code = 0; 2025-05-07T19:57:51.5317137Z ^ 2025-05-07T19:57:51.5317354Z 2025-05-07T19:57:51.5318869Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:51.5320745Z int64_t error_value; 2025-05-07T19:57:51.5321444Z ^ 2025-05-07T19:57:51.5321686Z 2025-05-07T19:57:51.5323177Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:51.5325021Z int error_code = 0; 2025-05-07T19:57:51.5325493Z ^ 2025-05-07T19:57:51.5325709Z 2025-05-07T19:57:51.5327174Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:51.5329074Z int64_t error_value; 2025-05-07T19:57:51.5329552Z ^ 2025-05-07T19:57:51.5329783Z 2025-05-07T19:57:51.5331273Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:51.5333150Z int error_code = 0; 2025-05-07T19:57:51.5333585Z ^ 2025-05-07T19:57:51.5333827Z 2025-05-07T19:57:51.5334285Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:51.5334975Z 2025-05-07T19:57:51.5336481Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:51.5338323Z int64_t error_value; 2025-05-07T19:57:51.5338805Z ^ 2025-05-07T19:57:51.5339038Z 2025-05-07T19:57:51.5340473Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:51.5342416Z int error_code = 0; 2025-05-07T19:57:51.5342856Z ^ 2025-05-07T19:57:51.5343165Z 2025-05-07T19:57:51.5344621Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:51.5346468Z int64_t error_value; 2025-05-07T19:57:51.5346885Z ^ 2025-05-07T19:57:51.5347140Z 2025-05-07T19:57:51.5348503Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:51.5350514Z int error_code = 0; 2025-05-07T19:57:51.5350922Z ^ 2025-05-07T19:57:51.5351147Z 2025-05-07T19:57:51.5352350Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:51.5353888Z int64_t error_value; 2025-05-07T19:57:51.5354312Z ^ 2025-05-07T19:57:51.5354558Z 2025-05-07T19:57:51.5355868Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:51.5357663Z int error_code = 0; 2025-05-07T19:57:51.5358045Z ^ 2025-05-07T19:57:51.5358247Z 2025-05-07T19:57:51.5359622Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:51.5361438Z int64_t error_value; 2025-05-07T19:57:51.5361860Z ^ 2025-05-07T19:57:51.5362106Z 2025-05-07T19:57:55.2679675Z [272/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o 2025-05-07T19:57:58.3329846Z [273/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o 2025-05-07T19:57:59.9733809Z [274/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o 2025-05-07T19:58:01.3697850Z [275/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o 2025-05-07T19:58:06.0890179Z [276/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o 2025-05-07T19:58:06.7697742Z [277/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:58:12.4524678Z [278/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o 2025-05-07T19:58:14.3835627Z [279/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:58:14.5613564Z [280/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o 2025-05-07T19:58:19.5275475Z [281/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o 2025-05-07T19:58:19.5295969Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:58:19.5297821Z const auto offset_idx = idx * D_emb; 2025-05-07T19:58:19.5298324Z ^ 2025-05-07T19:58:19.5298757Z 2025-05-07T19:58:19.5299248Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:19.5299826Z 2025-05-07T19:58:19.5300922Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:58:19.5302577Z const auto offset_idx = idx * D_emb; 2025-05-07T19:58:19.5303137Z ^ 2025-05-07T19:58:19.5303385Z 2025-05-07T19:58:19.5303734Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:19.5304277Z 2025-05-07T19:58:19.5305518Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:58:19.5307043Z const auto offset_idx = idx * D_emb; 2025-05-07T19:58:19.5307545Z ^ 2025-05-07T19:58:19.5307800Z 2025-05-07T19:58:19.5308222Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:19.5308847Z 2025-05-07T19:58:19.5310218Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:58:19.5311734Z const auto offset_idx = idx * D_emb; 2025-05-07T19:58:19.5312314Z ^ 2025-05-07T19:58:19.5312529Z 2025-05-07T19:58:19.5312932Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:19.5313449Z 2025-05-07T19:58:19.7067356Z [282/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:58:26.5843839Z [283/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o 2025-05-07T19:58:33.0347563Z [284/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_indice_weights_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o 2025-05-07T19:58:34.0388257Z [285/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o 2025-05-07T19:58:38.4408009Z [286/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:58:39.2577146Z [287/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:58:40.4863017Z [288/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o 2025-05-07T19:58:42.9559607Z [289/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o 2025-05-07T19:58:46.2271186Z [290/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:58:50.2684845Z [291/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o 2025-05-07T19:58:51.3885628Z [292/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o 2025-05-07T19:58:51.3908873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:51.3911039Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:51.3911624Z ^ 2025-05-07T19:58:51.3911927Z 2025-05-07T19:58:51.3912384Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:51.3912897Z 2025-05-07T19:58:51.3914387Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:51.3916310Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:51.3916772Z ^ 2025-05-07T19:58:51.3917060Z 2025-05-07T19:58:51.3918527Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:51.3920382Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:51.3920938Z ^ 2025-05-07T19:58:51.3921260Z 2025-05-07T19:58:51.3922763Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:51.3924989Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:51.3925535Z ^ 2025-05-07T19:58:51.3925774Z 2025-05-07T19:58:51.3926205Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:51.3926989Z 2025-05-07T19:58:51.3928618Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:51.3930375Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:51.3930880Z ^ 2025-05-07T19:58:51.3931143Z 2025-05-07T19:58:51.3932447Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:51.3934077Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:51.3934574Z ^ 2025-05-07T19:58:51.3934871Z 2025-05-07T19:58:51.3936266Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:51.3938184Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:51.3938740Z ^ 2025-05-07T19:58:51.3939059Z 2025-05-07T19:58:51.3939479Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:51.3940067Z 2025-05-07T19:58:51.3941525Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:51.3943396Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:51.3944010Z ^ 2025-05-07T19:58:51.3944311Z 2025-05-07T19:58:51.3946025Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:51.3947928Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:51.3948481Z ^ 2025-05-07T19:58:51.3948764Z 2025-05-07T19:58:51.3950133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:51.3952072Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:51.3952601Z ^ 2025-05-07T19:58:51.3952902Z 2025-05-07T19:58:51.3953320Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:51.3953945Z 2025-05-07T19:58:51.3955404Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:51.3957315Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:51.3957855Z ^ 2025-05-07T19:58:51.3958152Z 2025-05-07T19:58:51.3959548Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:51.3961597Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:51.3962178Z ^ 2025-05-07T19:58:51.3962481Z 2025-05-07T19:58:51.3964114Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:51.3966337Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:51.3969045Z ^ 2025-05-07T19:58:51.3969289Z 2025-05-07T19:58:51.3969673Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:51.3970212Z 2025-05-07T19:58:51.3971476Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:51.3973186Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:51.3973721Z ^ 2025-05-07T19:58:51.3973985Z 2025-05-07T19:58:51.3975374Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:51.3977215Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:51.3977815Z ^ 2025-05-07T19:58:51.3978083Z 2025-05-07T19:58:52.9424307Z [293/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o 2025-05-07T19:58:54.6705078Z [294/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:58:55.1719622Z [295/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:58:55.5374044Z [296/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o 2025-05-07T19:58:57.4977099Z [297/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:58:57.5100529Z [298/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o 2025-05-07T19:58:58.0488650Z [299/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o 2025-05-07T19:59:00.7733023Z [300/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o 2025-05-07T19:59:04.6973343Z [301/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o 2025-05-07T19:59:08.2187824Z [302/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o 2025-05-07T19:59:10.1245293Z [303/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lamb_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o 2025-05-07T19:59:10.8434952Z [304/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o 2025-05-07T19:59:12.3096178Z [305/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:59:20.0839482Z [306/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:59:20.6322654Z [307/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o 2025-05-07T19:59:20.9867879Z [308/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:59:22.4863096Z [309/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o 2025-05-07T19:59:25.0804472Z [310/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o 2025-05-07T19:59:25.1002862Z [311/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o 2025-05-07T19:59:25.3132557Z [312/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o 2025-05-07T19:59:27.5550336Z [313/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lamb_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o 2025-05-07T19:59:30.9902242Z [314/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:59:35.3444785Z [315/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o 2025-05-07T19:59:35.3664890Z [316/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o 2025-05-07T19:59:35.9014862Z [317/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:59:38.3093619Z [318/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:59:42.7202175Z [319/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o 2025-05-07T19:59:44.1657186Z [320/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o 2025-05-07T19:59:44.1682367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:44.1684806Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:44.1685600Z ^ 2025-05-07T19:59:44.1685885Z 2025-05-07T19:59:44.1686340Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:44.1687064Z 2025-05-07T19:59:44.1688670Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:44.1690695Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:44.1691188Z ^ 2025-05-07T19:59:44.1691485Z 2025-05-07T19:59:44.1693267Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:44.1695364Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:44.1695948Z ^ 2025-05-07T19:59:44.1696241Z 2025-05-07T19:59:44.1697932Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:44.1700022Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:44.1700594Z ^ 2025-05-07T19:59:44.1700867Z 2025-05-07T19:59:44.1702539Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:44.1704650Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:44.1705469Z ^ 2025-05-07T19:59:44.1705767Z 2025-05-07T19:59:44.1706228Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:44.1706881Z 2025-05-07T19:59:44.1708440Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:44.1710604Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:44.1711367Z ^ 2025-05-07T19:59:44.1711653Z 2025-05-07T19:59:44.1713229Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:44.1715404Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:44.1715960Z ^ 2025-05-07T19:59:44.1716266Z 2025-05-07T19:59:44.1717784Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:44.1719626Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:44.1720155Z ^ 2025-05-07T19:59:44.1720398Z 2025-05-07T19:59:44.1721958Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:44.1724134Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:44.1724958Z ^ 2025-05-07T19:59:44.1725269Z 2025-05-07T19:59:44.1725767Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:44.1726460Z 2025-05-07T19:59:44.1728038Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:44.1730107Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:44.1730683Z ^ 2025-05-07T19:59:44.1731006Z 2025-05-07T19:59:44.1732663Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:44.1734478Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:44.1735000Z ^ 2025-05-07T19:59:44.1735286Z 2025-05-07T19:59:44.1736977Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:44.1738941Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:44.1739482Z ^ 2025-05-07T19:59:44.1739759Z 2025-05-07T19:59:44.1741315Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:44.1743404Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:44.1744179Z ^ 2025-05-07T19:59:44.1744451Z 2025-05-07T19:59:44.1744942Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:44.1745651Z 2025-05-07T19:59:44.1747286Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:44.1749486Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:44.1750083Z ^ 2025-05-07T19:59:44.1750413Z 2025-05-07T19:59:44.1752009Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:44.1754055Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:44.1754683Z ^ 2025-05-07T19:59:44.1754977Z 2025-05-07T19:59:44.1756537Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:44.1758611Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:44.1759180Z ^ 2025-05-07T19:59:44.1759448Z 2025-05-07T19:59:44.1761072Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:44.1763274Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:44.1764071Z ^ 2025-05-07T19:59:44.1764373Z 2025-05-07T19:59:44.1764852Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:44.1765535Z 2025-05-07T19:59:44.1767091Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:44.1769105Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:44.1769665Z ^ 2025-05-07T19:59:44.1769980Z 2025-05-07T19:59:44.1771560Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:44.1773538Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:44.1774091Z ^ 2025-05-07T19:59:44.1774394Z 2025-05-07T19:59:44.1775937Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:44.1777944Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:44.1778499Z ^ 2025-05-07T19:59:44.1778780Z 2025-05-07T19:59:47.1729620Z [321/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o 2025-05-07T19:59:52.8081264Z [322/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o 2025-05-07T19:59:52.8104738Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.8106879Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:52.8107582Z ^ 2025-05-07T19:59:52.8107963Z 2025-05-07T19:59:52.8108399Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:52.8109221Z 2025-05-07T19:59:52.8110708Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.8112542Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.8113084Z ^ 2025-05-07T19:59:52.8113351Z 2025-05-07T19:59:52.8114636Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.8116283Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.8116785Z ^ 2025-05-07T19:59:52.8117024Z 2025-05-07T19:59:52.8118461Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.8120358Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.8120911Z ^ 2025-05-07T19:59:52.8121207Z 2025-05-07T19:59:52.8122662Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.8124644Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:52.8125372Z ^ 2025-05-07T19:59:52.8125698Z 2025-05-07T19:59:52.8126114Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:52.8126711Z 2025-05-07T19:59:52.8128238Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.8130028Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.8130578Z ^ 2025-05-07T19:59:52.8130823Z 2025-05-07T19:59:52.8132629Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.8134615Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.8135205Z ^ 2025-05-07T19:59:52.8135496Z 2025-05-07T19:59:52.8137058Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.8139103Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.8139694Z ^ 2025-05-07T19:59:52.8139966Z 2025-05-07T19:59:52.8141580Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.8143828Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:52.8144577Z ^ 2025-05-07T19:59:52.8144897Z 2025-05-07T19:59:52.8145363Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:52.8146031Z 2025-05-07T19:59:52.8147448Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.8149288Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.8149960Z ^ 2025-05-07T19:59:52.8150372Z 2025-05-07T19:59:52.8151871Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.8155919Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.8156422Z ^ 2025-05-07T19:59:52.8156675Z 2025-05-07T19:59:52.8158082Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.8159928Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.8160457Z ^ 2025-05-07T19:59:52.8160686Z 2025-05-07T19:59:52.8162062Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.8164024Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:52.8164732Z ^ 2025-05-07T19:59:52.8165035Z 2025-05-07T19:59:52.8165471Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:52.8166104Z 2025-05-07T19:59:52.8167596Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.8169392Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.8169943Z ^ 2025-05-07T19:59:52.8170205Z 2025-05-07T19:59:52.8171570Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.8173442Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.8173988Z ^ 2025-05-07T19:59:52.8174256Z 2025-05-07T19:59:52.8175914Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.8177714Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.8178226Z ^ 2025-05-07T19:59:52.8178503Z 2025-05-07T19:59:52.8179960Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.8181998Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:52.8182687Z ^ 2025-05-07T19:59:52.8182987Z 2025-05-07T19:59:52.8183397Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:52.8184028Z 2025-05-07T19:59:52.8185814Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.8187650Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.8188224Z ^ 2025-05-07T19:59:52.8188473Z 2025-05-07T19:59:52.8190001Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.8191753Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.8192305Z ^ 2025-05-07T19:59:52.8192781Z 2025-05-07T19:59:52.8194244Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.8196364Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.8196927Z ^ 2025-05-07T19:59:52.8197234Z 2025-05-07T19:59:53.9516733Z [323/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_inference.so -o fbgemm_gpu_tbe_inference.so CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_config.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed asmjit.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl && : 2025-05-07T20:00:00.1263662Z [324/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_unweighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o 2025-05-07T20:00:06.3826712Z [325/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_weighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o 2025-05-07T20:00:44.1818480Z [326/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_none_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o 2025-05-07T20:00:52.6431022Z [327/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:00:54.2713924Z [328/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_none_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o 2025-05-07T20:00:57.1836845Z [329/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o 2025-05-07T20:00:59.5905597Z [330/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_none_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o 2025-05-07T20:00:59.6099434Z [331/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:00:59.9028597Z [332/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o 2025-05-07T20:01:00.6893928Z [333/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:01:05.7107623Z [334/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o 2025-05-07T20:01:10.3487053Z [335/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T20:01:10.8347514Z [336/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_none_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o 2025-05-07T20:01:11.5051772Z [337/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_none_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o 2025-05-07T20:01:11.8745904Z [338/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T20:01:14.5289482Z [339/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:01:16.1071289Z [340/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T20:01:20.7240253Z [341/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T20:01:21.0077425Z [342/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T20:01:21.6628024Z [343/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T20:01:22.0747633Z [344/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_none_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o 2025-05-07T20:01:22.3746496Z [345/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T20:01:23.6489379Z [346/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o 2025-05-07T20:01:24.3034547Z [347/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T20:01:25.9208351Z [348/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T20:01:30.6079724Z [349/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T20:01:31.1334972Z [350/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T20:01:31.7931737Z [351/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T20:01:32.5910073Z [352/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o 2025-05-07T20:01:34.4545530Z [353/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:01:36.1710140Z [354/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o 2025-05-07T20:01:37.0815431Z [355/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:01:38.6891153Z [356/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o 2025-05-07T20:01:45.4125187Z [357/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o 2025-05-07T20:01:47.2528816Z [358/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o 2025-05-07T20:01:49.4048674Z [359/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_weighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o 2025-05-07T20:01:55.3538055Z [360/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o 2025-05-07T20:02:01.1605401Z [361/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:02:01.5314195Z [362/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o 2025-05-07T20:02:08.9606870Z [363/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o 2025-05-07T20:02:09.5481466Z [364/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o 2025-05-07T20:02:09.7929208Z [365/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o 2025-05-07T20:02:13.4148661Z [366/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o 2025-05-07T20:02:13.4586733Z [367/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:02:15.8508746Z [368/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o 2025-05-07T20:02:20.5188570Z [369/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o 2025-05-07T20:02:30.8045015Z [370/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o 2025-05-07T20:02:32.7793839Z [371/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o 2025-05-07T20:02:33.8389943Z [372/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:02:47.8933725Z [373/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o 2025-05-07T20:02:49.9648551Z [374/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o 2025-05-07T20:02:58.7294834Z [375/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o 2025-05-07T20:03:00.2476579Z [376/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o 2025-05-07T20:03:02.1587596Z [377/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o 2025-05-07T20:03:05.9658470Z [378/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o 2025-05-07T20:03:07.1065127Z [379/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:03:24.5089908Z [380/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o 2025-05-07T20:03:25.7143170Z [381/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o 2025-05-07T20:03:26.8324374Z [382/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o 2025-05-07T20:03:27.2495879Z [383/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o 2025-05-07T20:03:31.8291555Z [384/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:03:34.4337074Z [385/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T20:03:35.1947666Z [386/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o 2025-05-07T20:03:35.4824156Z [387/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T20:03:36.6750149Z [388/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T20:03:45.7435731Z [389/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:51.3260165Z [390/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:51.4125526Z [391/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_dense.cpp 2025-05-07T20:03:52.2807930Z [392/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:54.0394668Z [393/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o 2025-05-07T20:03:56.6671281Z [394/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:58.0442143Z [395/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o 2025-05-07T20:03:58.9155270Z [396/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o 2025-05-07T20:04:00.0958850Z [397/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o 2025-05-07T20:04:00.2334488Z [398/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:00.5715767Z [399/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:04:01.3987063Z [400/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o 2025-05-07T20:04:01.9265306Z [401/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:05.9097845Z [402/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:05.9878448Z [403/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:04:13.3627423Z [404/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:04:17.8116208Z [405/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_adagrad.cpp 2025-05-07T20:04:20.2754202Z [406/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T20:04:20.4754528Z [407/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:20.8987675Z [408/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:21.2638943Z [409/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:23.3783271Z [410/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:23.8753817Z [411/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_none.cpp 2025-05-07T20:04:23.9212156Z [412/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:23.9371991Z [413/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_sgd.cpp 2025-05-07T20:04:23.9535466Z [414/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T20:04:24.4904784Z [415/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_lamb.cpp 2025-05-07T20:04:24.9338489Z [416/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:25.0792228Z [417/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T20:04:26.0148873Z [418/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T20:04:26.7226274Z [419/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:27.2480518Z [420/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:27.6324934Z [421/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:27.9241298Z [422/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T20:04:28.0315912Z [423/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o 2025-05-07T20:04:29.4794629Z [424/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_adam.cpp 2025-05-07T20:04:30.4912461Z [425/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:31.1123957Z [426/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T20:04:31.6288642Z [427/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:32.2886545Z [428/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T20:04:32.8601505Z [429/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T20:04:33.5000467Z [430/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T20:04:33.6969441Z [431/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T20:04:33.9178955Z [432/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T20:04:34.0147375Z [433/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T20:04:34.0449932Z [434/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T20:04:34.5814935Z [435/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T20:04:34.7652077Z [436/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T20:04:34.7821960Z [437/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T20:04:34.8641821Z [438/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T20:04:35.3200514Z [439/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T20:04:35.7541914Z [440/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T20:04:36.0306589Z [441/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:36.3766487Z [442/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T20:04:36.8631493Z [443/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T20:04:36.9475224Z [444/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T20:04:37.1932377Z [445/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T20:04:37.6352348Z [446/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T20:04:37.6853465Z [447/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T20:04:37.9642008Z [448/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:39.3406447Z [449/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T20:04:39.7541382Z [450/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_ops.cpp 2025-05-07T20:04:40.2142622Z [451/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T20:04:40.9835492Z [452/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T20:04:41.3456696Z [453/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:04:41.5590297Z [454/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T20:04:42.4403547Z [455/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T20:04:42.6415564Z [456/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T20:04:44.6137553Z [457/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:44.7037602Z [458/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils.cpp 2025-05-07T20:04:46.5306386Z [459/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils_ops.cpp 2025-05-07T20:04:47.3756583Z [460/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp 2025-05-07T20:04:49.3465795Z [461/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T20:04:49.9820855Z [462/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_split_host.so -o fbgemm_gpu_tbe_training_backward_split_host.so CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_config.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so && : 2025-05-07T20:04:50.8642179Z [463/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp 2025-05-07T20:04:50.8976616Z [464/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp 2025-05-07T20:04:50.9009734Z [465/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/eeg_models.cpp 2025-05-07T20:04:52.3112843Z [466/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/layout_transform_ops/layout_transform_ops_cpu.cpp 2025-05-07T20:04:53.3928894Z [467/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp 2025-05-07T20:04:53.4223100Z [468/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:54.0035951Z [469/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_host.cpp 2025-05-07T20:04:54.1185160Z [470/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp 2025-05-07T20:04:55.1226531Z [471/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp 2025-05-07T20:04:55.6977448Z [472/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp 2025-05-07T20:04:55.7272952Z [473/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_ops_cpu.cpp 2025-05-07T20:04:56.6728306Z [474/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cpp 2025-05-07T20:04:56.7066810Z [475/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_embedding_inplace_ops_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp 2025-05-07T20:04:56.7703754Z [476/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o 2025-05-07T20:04:56.7725601Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:04:56.7727255Z const auto offset_idx = idx * D_emb; 2025-05-07T20:04:56.7727758Z ^ 2025-05-07T20:04:56.7728047Z 2025-05-07T20:04:56.7728439Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:56.7729040Z 2025-05-07T20:04:56.7730358Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:04:56.7732039Z const auto offset_idx = idx * D_emb; 2025-05-07T20:04:56.7732590Z ^ 2025-05-07T20:04:56.7732846Z 2025-05-07T20:04:56.7733283Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:56.7733955Z 2025-05-07T20:04:56.7735203Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:04:56.7736957Z const auto offset_idx = idx * D_emb; 2025-05-07T20:04:56.7737464Z ^ 2025-05-07T20:04:56.7737742Z 2025-05-07T20:04:56.7738146Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:56.7738768Z 2025-05-07T20:04:56.7740139Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:04:56.7741834Z const auto offset_idx = idx * D_emb; 2025-05-07T20:04:56.7742390Z ^ 2025-05-07T20:04:56.7742654Z 2025-05-07T20:04:56.7743089Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:56.7743786Z 2025-05-07T20:04:56.9110136Z [477/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_embedding_inplace_ops_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp 2025-05-07T20:04:56.9956338Z [478/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_ops_meta.cpp 2025-05-07T20:04:57.0196855Z [479/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/input_combine_ops/input_combine_cpu.cpp 2025-05-07T20:04:58.4590596Z [480/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp 2025-05-07T20:04:59.8985041Z [481/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_ops_meta.cpp 2025-05-07T20:05:00.0277764Z [482/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/topology_utils.cpp 2025-05-07T20:05:02.1147973Z [483/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o 2025-05-07T20:05:02.1172910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:02.1174858Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:02.1175465Z ^ 2025-05-07T20:05:02.1175743Z 2025-05-07T20:05:02.1176216Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:02.1176929Z 2025-05-07T20:05:02.1178295Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:02.1179971Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:02.1180562Z ^ 2025-05-07T20:05:02.1180848Z 2025-05-07T20:05:02.1181320Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:02.1182014Z 2025-05-07T20:05:02.1183340Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:02.1185608Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:02.1186211Z ^ 2025-05-07T20:05:02.1186486Z 2025-05-07T20:05:02.1186961Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:02.1187640Z 2025-05-07T20:05:02.1189082Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:02.1190937Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:02.1193265Z ^ 2025-05-07T20:05:02.1193630Z 2025-05-07T20:05:02.1194093Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:02.1194724Z 2025-05-07T20:05:03.6201954Z [484/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_forward.so -o fbgemm_gpu_tbe_training_forward.so CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_common.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && : 2025-05-07T20:05:05.7245424Z [485/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/layout_transform_ops/layout_transform_ops_gpu.cpp 2025-05-07T20:05:05.9284691Z [486/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/metric_ops/metric_ops_host.cpp 2025-05-07T20:05:06.7172531Z [487/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/input_combine_ops/input_combine_gpu.cpp 2025-05-07T20:05:07.4648485Z [488/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_ops_gpu.cpp 2025-05-07T20:05:08.8231901Z [489/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp 2025-05-07T20:05:09.0169622Z [490/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o 2025-05-07T20:05:11.2061343Z [491/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/eeg_utils.cpp 2025-05-07T20:05:11.2633724Z [492/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_estimator.cpp 2025-05-07T20:05:11.7917530Z [493/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp 2025-05-07T20:05:11.9526353Z [494/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp 2025-05-07T20:05:12.9295533Z [495/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_estimator_ops.cpp 2025-05-07T20:05:13.1347688Z [496/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_generator_ops.cpp 2025-05-07T20:05:13.4477612Z [497/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp 2025-05-07T20:05:13.9123638Z [498/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_generator.cpp 2025-05-07T20:05:14.2048911Z [499/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o 2025-05-07T20:05:14.2235540Z [500/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:05:14.5757244Z [501/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp 2025-05-07T20:05:15.7653670Z [502/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_ops_gpu.cpp 2025-05-07T20:05:20.3498807Z [503/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:05:21.4341708Z [504/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:05:24.6239766Z [505/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o 2025-05-07T20:05:25.9280078Z [506/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_ops_cpu.cpp 2025-05-07T20:05:27.9907210Z [507/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:05:31.2700697Z [508/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o 2025-05-07T20:05:38.4182446Z [509/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o 2025-05-07T20:05:39.2636546Z [510/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_grad_index_select.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o 2025-05-07T20:05:41.4304288Z [511/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:05:44.6240861Z [512/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_batch_index_select_dim0_forward_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o 2025-05-07T20:05:48.7131255Z [513/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_batch_index_select_dim0_forward_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o 2025-05-07T20:05:51.3444264Z [514/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/input_combine_ops/input_combine.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o 2025-05-07T20:05:51.6997075Z [515/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp 2025-05-07T20:05:52.4964549Z [516/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o 2025-05-07T20:05:55.8571897Z [517/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o 2025-05-07T20:06:02.7019644Z [518/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_batch_index_select_dim0_forward_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o 2025-05-07T20:06:02.7216670Z [519/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o 2025-05-07T20:06:13.1520370Z [520/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_batch_index_select_dim0_backward_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o 2025-05-07T20:06:14.8166065Z [521/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/histogram_binning_calibration_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o 2025-05-07T20:06:15.8277236Z [522/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_batch_index_select_dim0_backward_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o 2025-05-07T20:06:16.2323823Z [523/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o 2025-05-07T20:06:20.9958444Z [524/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o 2025-05-07T20:06:23.3632646Z [525/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:06:23.5652114Z [526/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o 2025-05-07T20:06:25.9424627Z [527/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o 2025-05-07T20:06:27.4136370Z [528/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o 2025-05-07T20:06:30.5864858Z [529/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/metric_ops/metric_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o 2025-05-07T20:06:30.9723144Z [530/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o 2025-05-07T20:06:32.0482896Z [531/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/layout_transform_ops/layout_transform_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o 2025-05-07T20:06:33.3913849Z [532/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o 2025-05-07T20:06:33.6685117Z [533/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o 2025-05-07T20:06:33.9543525Z [534/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_softmax_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o 2025-05-07T20:06:33.9624141Z [535/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so 2025-05-07T20:06:33.9626191Z ################################################################################ 2025-05-07T20:06:33.9626779Z [CMAKE] Running post-build script ... 2025-05-07T20:06:33.9627969Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so 2025-05-07T20:06:33.9628779Z Removing all RPATHs ... 2025-05-07T20:06:33.9629232Z ################################################################################ 2025-05-07T20:06:33.9782186Z [536/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so 1 2025-05-07T20:06:33.9784146Z ################################################################################ 2025-05-07T20:06:33.9785000Z [CMAKE] Running post-build script ... 2025-05-07T20:06:33.9785807Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so 2025-05-07T20:06:33.9786684Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:33.9787281Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:33.9787977Z ################################################################################ 2025-05-07T20:06:34.0488473Z [537/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:06:34.0490628Z ################################################################################ 2025-05-07T20:06:34.0491219Z [CMAKE] Running post-build script ... 2025-05-07T20:06:34.0492173Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:06:34.0493387Z Removing all RPATHs ... 2025-05-07T20:06:34.0493807Z ################################################################################ 2025-05-07T20:06:34.1193830Z [538/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o 2025-05-07T20:06:34.1266048Z [539/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:06:34.1268227Z ################################################################################ 2025-05-07T20:06:34.1268821Z [CMAKE] Running post-build script ... 2025-05-07T20:06:34.1269914Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:06:34.1270884Z Removing all RPATHs ... 2025-05-07T20:06:34.1271336Z ################################################################################ 2025-05-07T20:06:34.1810920Z [540/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:06:34.1813104Z ################################################################################ 2025-05-07T20:06:34.1813714Z [CMAKE] Running post-build script ... 2025-05-07T20:06:34.1814669Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:06:34.1815655Z Removing all RPATHs ... 2025-05-07T20:06:34.1816075Z ################################################################################ 2025-05-07T20:06:34.1913100Z [541/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 1 2025-05-07T20:06:34.1915583Z ################################################################################ 2025-05-07T20:06:34.1916138Z [CMAKE] Running post-build script ... 2025-05-07T20:06:34.1917183Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:06:34.1918171Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:34.1918814Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:34.1919460Z ################################################################################ 2025-05-07T20:06:34.2038638Z [542/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so 1 2025-05-07T20:06:34.2040825Z ################################################################################ 2025-05-07T20:06:34.2041412Z [CMAKE] Running post-build script ... 2025-05-07T20:06:34.2042345Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:06:34.2043293Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:34.2043895Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:34.2044629Z ################################################################################ 2025-05-07T20:06:34.2344834Z [543/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:06:34.2347077Z ################################################################################ 2025-05-07T20:06:34.2347645Z [CMAKE] Running post-build script ... 2025-05-07T20:06:34.2348964Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:06:34.2350192Z Removing all RPATHs ... 2025-05-07T20:06:34.2350617Z ################################################################################ 2025-05-07T20:06:34.4673253Z [544/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_embedding_inplace_ops_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o -MF CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update.cu -o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o 2025-05-07T20:06:35.0609145Z [545/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_embedding_inplace_ops.so -o fbgemm_gpu_embedding_inplace_ops.so CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -lrt -lpthread -ldl && : 2025-05-07T20:06:35.1885806Z [546/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 1 2025-05-07T20:06:35.1888264Z ################################################################################ 2025-05-07T20:06:35.1888827Z [CMAKE] Running post-build script ... 2025-05-07T20:06:35.1889807Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:06:35.1890927Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:35.1891478Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:35.1892091Z ################################################################################ 2025-05-07T20:06:35.2936940Z [547/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 1 2025-05-07T20:06:35.2939445Z ################################################################################ 2025-05-07T20:06:35.2940028Z [CMAKE] Running post-build script ... 2025-05-07T20:06:35.2940957Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:06:35.2941952Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:35.2942640Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:35.2943420Z ################################################################################ 2025-05-07T20:06:35.3377725Z [548/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:06:35.3379986Z ################################################################################ 2025-05-07T20:06:35.3380528Z [CMAKE] Running post-build script ... 2025-05-07T20:06:35.3381637Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:06:35.3382752Z Removing all RPATHs ... 2025-05-07T20:06:35.3383193Z ################################################################################ 2025-05-07T20:06:35.5971912Z [549/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o 2025-05-07T20:06:36.1681943Z [550/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_unique_indices.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o 2025-05-07T20:06:38.7607014Z [551/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o 2025-05-07T20:06:38.7954465Z [552/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o 2025-05-07T20:06:39.4610825Z [553/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/dense_to_jagged_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o 2025-05-07T20:06:39.4680345Z [554/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so 1 2025-05-07T20:06:39.4682828Z ################################################################################ 2025-05-07T20:06:39.4683458Z [CMAKE] Running post-build script ... 2025-05-07T20:06:39.4684850Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:06:39.4686024Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:39.4686726Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:39.4687473Z ################################################################################ 2025-05-07T20:06:40.2770290Z [555/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o 2025-05-07T20:06:40.3937195Z [556/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o 2025-05-07T20:06:40.8556063Z [557/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_batch_index_select_dim0_backward_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o 2025-05-07T20:06:40.8568401Z [558/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so 1 2025-05-07T20:06:40.8569637Z ################################################################################ 2025-05-07T20:06:40.8569983Z [CMAKE] Running post-build script ... 2025-05-07T20:06:40.8570571Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:06:40.8571383Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:40.8571765Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:40.8583910Z ################################################################################ 2025-05-07T20:06:41.4897615Z [559/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_index_select.so -o fbgemm_gpu_tbe_index_select.so CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_sparse_async_cumsum.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl && : 2025-05-07T20:06:41.6222712Z [560/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_softmax_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o 2025-05-07T20:06:41.9479615Z [561/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so 1 2025-05-07T20:06:41.9480950Z ################################################################################ 2025-05-07T20:06:41.9481325Z [CMAKE] Running post-build script ... 2025-05-07T20:06:41.9481913Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:06:41.9482526Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:41.9482910Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:41.9483321Z ################################################################################ 2025-05-07T20:06:42.0978384Z [562/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o 2025-05-07T20:06:45.0617985Z [563/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o 2025-05-07T20:06:45.4114638Z [564/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o 2025-05-07T20:06:48.4641271Z [565/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_bfloat16.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o 2025-05-07T20:06:49.6896208Z [566/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o 2025-05-07T20:06:55.7075337Z [567/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_mx.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o 2025-05-07T20:06:55.9641322Z [568/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o 2025-05-07T20:07:00.7771243Z [569/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o 2025-05-07T20:07:01.7791610Z [570/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_hfp8.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o 2025-05-07T20:07:02.3966700Z [571/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o 2025-05-07T20:07:03.3848787Z [572/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o 2025-05-07T20:07:09.6686144Z [573/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o 2025-05-07T20:07:13.7676784Z [574/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_bucketize_features.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o 2025-05-07T20:07:16.2031252Z [575/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_batched_unary_embeddings.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o 2025-05-07T20:07:17.7821360Z [576/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o 2025-05-07T20:07:17.7833018Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:07:17.7833903Z static constexpr uint32_t kMaxThreads = 1024; 2025-05-07T20:07:17.7834230Z ^ 2025-05-07T20:07:17.7834401Z 2025-05-07T20:07:17.7834664Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:17.7835023Z 2025-05-07T20:07:17.7835566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:07:17.7836366Z static constexpr uint32_t kMaxThreads = 1024; 2025-05-07T20:07:17.7836702Z ^ 2025-05-07T20:07:17.7836870Z 2025-05-07T20:07:17.7837110Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:17.7837467Z 2025-05-07T20:07:17.7838050Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:07:17.7838832Z static constexpr uint32_t kMaxThreads = 1024; 2025-05-07T20:07:17.7839163Z ^ 2025-05-07T20:07:17.7839326Z 2025-05-07T20:07:17.7839563Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:17.7839930Z 2025-05-07T20:07:17.7840469Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:07:17.7841266Z static constexpr uint32_t kMaxThreads = 1024; 2025-05-07T20:07:17.7841582Z ^ 2025-05-07T20:07:17.7841760Z 2025-05-07T20:07:17.7841996Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:17.7842382Z 2025-05-07T20:07:20.3253849Z [577/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_compute_frequency_sequence.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o 2025-05-07T20:07:21.2745791Z [578/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_expand_into_jagged_permute.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o 2025-05-07T20:07:31.8643316Z [579/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_block_bucketize_features.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o 2025-05-07T20:07:32.3844458Z [580/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_group_index.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o 2025-05-07T20:07:33.6364719Z [581/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_invert_permute.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o 2025-05-07T20:07:33.8292986Z [582/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_index_add.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o 2025-05-07T20:07:34.0764066Z [583/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_index_select.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o 2025-05-07T20:07:39.9051819Z [584/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_pack_segments_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o 2025-05-07T20:07:44.3883748Z [585/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute102.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o 2025-05-07T20:07:44.7609623Z [586/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o 2025-05-07T20:07:45.1183936Z [587/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute_1d.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o 2025-05-07T20:07:46.5643743Z [588/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_pack_segments_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o 2025-05-07T20:07:47.1690419Z [589/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute_embeddings.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o 2025-05-07T20:07:48.8315380Z [590/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute_2d.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o 2025-05-07T20:07:49.1571508Z [591/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_range.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o 2025-05-07T20:07:49.8984231Z [592/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_zipf.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o 2025-05-07T20:07:52.0322381Z [593/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_segment_sum_csr.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o 2025-05-07T20:07:52.4331334Z [594/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_reorder_batched_ad.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o 2025-05-07T20:07:53.1985184Z [595/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_py.so -o fbgemm_gpu_py.so CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm.so fbgemm_gpu_embedding_inplace_ops.so fbgemm_gpu_tbe_index_select.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_tbe_optimizers.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed fbgemm_gpu_sparse_async_cumsum.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && : 2025-05-07T20:07:53.4632575Z [596/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so 1 2025-05-07T20:07:53.4633835Z ################################################################################ 2025-05-07T20:07:53.4634205Z [CMAKE] Running post-build script ... 2025-05-07T20:07:53.4634739Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:07:53.4635291Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:07:53.4635655Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:07:53.4636083Z ################################################################################ 2025-05-07T20:09:04.5580791Z [597/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o 2025-05-07T20:09:07.2107908Z [598/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o 2025-05-07T20:09:10.3549508Z [599/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:09:12.1877055Z [600/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward.so -o fbgemm_gpu_tbe_training_backward.so CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_tbe_common.so fbgemm_gpu_sparse_async_cumsum.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed fbgemm.so fbgemm_gpu_config.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && : 2025-05-07T20:09:12.8208070Z [601/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_dense.so -o fbgemm_gpu_tbe_training_backward_dense.so CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_training_backward.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && : 2025-05-07T20:09:12.9311207Z [602/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_gwd.so -o fbgemm_gpu_tbe_training_backward_gwd.so CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_training_backward.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build -L"/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs" -L"/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib" && : 2025-05-07T20:09:13.0862178Z [603/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 1 2025-05-07T20:09:13.0863604Z ################################################################################ 2025-05-07T20:09:13.0863976Z [CMAKE] Running post-build script ... 2025-05-07T20:09:13.0864639Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:09:13.0865303Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:09:13.0865681Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:09:13.0866098Z ################################################################################ 2025-05-07T20:09:13.4325249Z [604/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so 1 2025-05-07T20:09:13.4326641Z ################################################################################ 2025-05-07T20:09:13.4327034Z [CMAKE] Running post-build script ... 2025-05-07T20:09:13.4327796Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:09:13.4328561Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:09:13.4328976Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:09:13.4329474Z ################################################################################ 2025-05-07T20:09:14.4588787Z [605/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 1 2025-05-07T20:09:14.4590320Z ################################################################################ 2025-05-07T20:09:14.4590709Z [CMAKE] Running post-build script ... 2025-05-07T20:09:14.4591380Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:09:14.4592050Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:09:14.4592446Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:09:14.4592870Z ################################################################################ 2025-05-07T20:09:20.7551084Z [606/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_vbe.so -o fbgemm_gpu_tbe_training_backward_vbe.so CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_training_backward.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && : 2025-05-07T20:09:23.3944492Z [607/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 1 2025-05-07T20:09:23.3945924Z ################################################################################ 2025-05-07T20:09:23.3946306Z [CMAKE] Running post-build script ... 2025-05-07T20:09:23.3946971Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:09:23.3947630Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:09:23.3948040Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:09:23.3948723Z ################################################################################ 2025-05-07T20:09:23.3949889Z [607/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/cmake/data/bin/cmake -P cmake_install.cmake 2025-05-07T20:09:23.4000536Z -- Install configuration: "Release" 2025-05-07T20:09:23.4003441Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/asmjit.so 2025-05-07T20:09:23.4092706Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm.so 2025-05-07T20:09:23.4093666Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_cache.so 2025-05-07T20:09:23.4118668Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_inference.so 2025-05-07T20:09:23.4121602Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_config.so 2025-05-07T20:09:23.4142761Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_utils.so 2025-05-07T20:09:23.4168386Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:09:23.4171435Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_common.so 2025-05-07T20:09:23.4174310Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:09:23.4188568Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:09:23.4191400Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:09:23.4192695Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:09:23.4193853Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:09:23.4194982Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:09:23.4196092Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:09:23.4197237Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:09:23.4198336Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:09:23.4199486Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py 2025-05-07T20:09:23.4200770Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py 2025-05-07T20:09:23.4202058Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py 2025-05-07T20:09:23.4203297Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py 2025-05-07T20:09:23.4204518Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py 2025-05-07T20:09:23.4205802Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py 2025-05-07T20:09:23.4207184Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py 2025-05-07T20:09:23.4208588Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py 2025-05-07T20:09:23.4209896Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py 2025-05-07T20:09:23.4211222Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py 2025-05-07T20:09:23.4212626Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py 2025-05-07T20:09:23.4213913Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py 2025-05-07T20:09:23.4215115Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py 2025-05-07T20:09:23.4216369Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py 2025-05-07T20:09:23.4217743Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py 2025-05-07T20:09:23.4219057Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py 2025-05-07T20:09:23.4220201Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:09:23.4223150Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_py.so 2025-05-07T20:09:23.4268121Z 2025-05-07T20:09:23.4311566Z 2025-05-07T20:09:23.4312298Z copying fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/__init__.py 2025-05-07T20:09:23.4314200Z copying fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/batched_unary_embeddings_ops.py 2025-05-07T20:09:23.4315125Z copying fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/enums.py 2025-05-07T20:09:23.4315892Z copying fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/metrics.py 2025-05-07T20:09:23.4317103Z copying fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules.py 2025-05-07T20:09:23.4318341Z copying fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules_split.py 2025-05-07T20:09:23.4319419Z copying fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/quantize_comm.py 2025-05-07T20:09:23.4320294Z copying fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/quantize_utils.py 2025-05-07T20:09:23.4321167Z copying fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/runtime_monitor.py 2025-05-07T20:09:23.4322011Z copying fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sparse_ops.py 2025-05-07T20:09:23.4322910Z copying fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_configs.py 2025-05-07T20:09:23.4324055Z copying fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_inference_converter.py 2025-05-07T20:09:23.4325327Z copying fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_optimizer_ops.py 2025-05-07T20:09:23.4326353Z copying fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_utils.py 2025-05-07T20:09:23.4327689Z copying fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops.py 2025-05-07T20:09:23.4328960Z copying fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_common.py 2025-05-07T20:09:23.4330299Z copying fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py 2025-05-07T20:09:23.4331672Z copying fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training.py 2025-05-07T20:09:23.4333105Z copying fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py 2025-05-07T20:09:23.4334449Z copying fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py 2025-05-07T20:09:23.4335565Z copying fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe_input_multiplexer.py 2025-05-07T20:09:23.4336402Z copying fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/uvm.py 2025-05-07T20:09:23.4337040Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/config 2025-05-07T20:09:23.4337793Z copying fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/config/__init__.py 2025-05-07T20:09:23.4338749Z copying fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/config/feature_list.py 2025-05-07T20:09:23.4339536Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs 2025-05-07T20:09:23.4340291Z copying fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/__init__.py 2025-05-07T20:09:23.4341111Z copying fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/common.py 2025-05-07T20:09:23.4341945Z copying fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/examples.py 2025-05-07T20:09:23.4342850Z copying fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/jagged_tensor_ops.py 2025-05-07T20:09:23.4343918Z copying fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/merge_pooled_embedding_ops.py 2025-05-07T20:09:23.4345089Z copying fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/permute_pooled_embedding_ops.py 2025-05-07T20:09:23.4346124Z copying fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/quantize_ops.py 2025-05-07T20:09:23.4347037Z copying fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/sparse_ops.py 2025-05-07T20:09:23.4347905Z copying fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/version.py 2025-05-07T20:09:23.4348624Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/quantize 2025-05-07T20:09:23.4349487Z copying fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/quantize/__init__.py 2025-05-07T20:09:23.4350451Z copying fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/quantize/quantize_ops.py 2025-05-07T20:09:23.4351296Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll 2025-05-07T20:09:23.4351998Z copying fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/__init__.py 2025-05-07T20:09:23.4352679Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe 2025-05-07T20:09:23.4353416Z copying fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/__init__.py 2025-05-07T20:09:23.4354127Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton 2025-05-07T20:09:23.4354862Z copying fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/__init__.py 2025-05-07T20:09:23.4355711Z copying fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/common.py 2025-05-07T20:09:23.4356564Z copying fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/quantize.py 2025-05-07T20:09:23.4357491Z copying fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/quantize_ref.py 2025-05-07T20:09:23.4358266Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/utils 2025-05-07T20:09:23.4358986Z copying fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/utils/__init__.py 2025-05-07T20:09:23.4359854Z copying fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/utils/filestore.py 2025-05-07T20:09:23.4360697Z copying fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/utils/loader.py 2025-05-07T20:09:23.4361594Z copying fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/utils/torch_library.py 2025-05-07T20:09:23.4362370Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/cpu 2025-05-07T20:09:23.4363117Z copying fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/cpu/__init__.py 2025-05-07T20:09:23.4363990Z copying fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/cpu/cpu_sll.py 2025-05-07T20:09:23.4364774Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/meta 2025-05-07T20:09:23.4365566Z copying fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/meta/__init__.py 2025-05-07T20:09:23.4366446Z copying fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/meta/meta_sll.py 2025-05-07T20:09:23.4367224Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton 2025-05-07T20:09:23.4368014Z copying fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/__init__.py 2025-05-07T20:09:23.4368918Z copying fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/common.py 2025-05-07T20:09:23.4370071Z copying fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py 2025-05-07T20:09:23.4371420Z copying fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py 2025-05-07T20:09:23.4372648Z copying fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm.py 2025-05-07T20:09:23.4373835Z copying fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py 2025-05-07T20:09:23.4375184Z copying fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py 2025-05-07T20:09:23.4376695Z copying fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py 2025-05-07T20:09:23.4378251Z copying fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py 2025-05-07T20:09:23.4379682Z copying fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py 2025-05-07T20:09:23.4381154Z copying fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py 2025-05-07T20:09:23.4382524Z copying fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_softmax.py 2025-05-07T20:09:23.4383852Z copying fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py 2025-05-07T20:09:23.4385139Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench 2025-05-07T20:09:23.4385928Z copying fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/__init__.py 2025-05-07T20:09:23.4386879Z copying fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/bench_config.py 2025-05-07T20:09:23.4387870Z copying fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/bench_runs.py 2025-05-07T20:09:23.4388835Z copying fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/eeg_cli.py 2025-05-07T20:09:23.4389979Z copying fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py 2025-05-07T20:09:23.4391163Z copying fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/eval_compression.py 2025-05-07T20:09:23.4392250Z copying fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/reporter.py 2025-05-07T20:09:23.4393318Z copying fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config.py 2025-05-07T20:09:23.4394427Z copying fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py 2025-05-07T20:09:23.4395618Z copying fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py 2025-05-07T20:09:23.4396695Z copying fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/utils.py 2025-05-07T20:09:23.4397453Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/cache 2025-05-07T20:09:23.4398241Z copying fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/cache/__init__.py 2025-05-07T20:09:23.4399325Z copying fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py 2025-05-07T20:09:23.4400251Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:23.4401007Z copying fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/__init__.py 2025-05-07T20:09:23.4401880Z copying fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/common.py 2025-05-07T20:09:23.4402761Z copying fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/inference.py 2025-05-07T20:09:23.4403679Z copying fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/training.py 2025-05-07T20:09:23.4404491Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils 2025-05-07T20:09:23.4405275Z copying fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/__init__.py 2025-05-07T20:09:23.4406204Z copying fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/common.py 2025-05-07T20:09:23.4407144Z copying fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/offsets.py 2025-05-07T20:09:23.4408080Z copying fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/quantize.py 2025-05-07T20:09:23.4409010Z copying fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/requests.py 2025-05-07T20:09:23.4409812Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/stats 2025-05-07T20:09:23.4410583Z copying fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/stats/__init__.py 2025-05-07T20:09:23.4411714Z copying fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/stats/bench_params_reporter.py 2025-05-07T20:09:23.4412607Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:23.4413399Z copying fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/utils/__init__.py 2025-05-07T20:09:23.4414560Z copying fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py 2025-05-07T20:09:23.4415570Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/jagged 2025-05-07T20:09:23.4416363Z copying fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/jagged/__init__.py 2025-05-07T20:09:23.4417464Z copying fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py 2025-05-07T20:09:23.4418137Z 2025-05-07T20:09:23.4491191Z INFO:root:running bdist_wheel 2025-05-07T20:09:23.4526144Z INFO:root:running build 2025-05-07T20:09:23.4527010Z INFO:root:running build_py 2025-05-07T20:09:23.4529475Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:23.4532488Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:23.4535129Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:23.4536466Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:23.4537750Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:23.4539140Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:23.4540797Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:23.4542219Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:23.4543553Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:23.4544975Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:23.4546306Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:23.4548120Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:23.4549794Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:23.4551633Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:23.4553138Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:23.4554602Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:23.4556138Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:23.4557713Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:23.4559298Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:23.4562201Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:23.4563820Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:23.4565278Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:23.4566594Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:23.4567753Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/config 2025-05-07T20:09:23.4568900Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/config 2025-05-07T20:09:23.4570479Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/config 2025-05-07T20:09:23.4572580Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:09:23.4573691Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:09:23.4575133Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:09:23.4576548Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:09:23.4577968Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:09:23.4579495Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:09:23.4581016Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:09:23.4582490Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:09:23.4583909Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:09:23.4585531Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:09:23.4587678Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/quantize 2025-05-07T20:09:23.4588914Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/quantize 2025-05-07T20:09:23.4590447Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/quantize 2025-05-07T20:09:23.4592123Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll 2025-05-07T20:09:23.4593396Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll 2025-05-07T20:09:23.4595246Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe 2025-05-07T20:09:23.4596437Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe 2025-05-07T20:09:23.4598227Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton 2025-05-07T20:09:23.4599353Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton 2025-05-07T20:09:23.4600967Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton 2025-05-07T20:09:23.4602395Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton 2025-05-07T20:09:23.4603854Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton 2025-05-07T20:09:23.4605962Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/utils 2025-05-07T20:09:23.4607076Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/utils 2025-05-07T20:09:23.4608614Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/utils 2025-05-07T20:09:23.4610066Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/utils 2025-05-07T20:09:23.4611533Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/utils 2025-05-07T20:09:23.4613065Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/cpu 2025-05-07T20:09:23.4614298Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/cpu 2025-05-07T20:09:23.4615747Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/cpu 2025-05-07T20:09:23.4617852Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/meta 2025-05-07T20:09:23.4618984Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/meta 2025-05-07T20:09:23.4620523Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/meta 2025-05-07T20:09:23.4622876Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:23.4624140Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:23.4625624Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:23.4627274Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:23.4629008Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:23.4630690Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:23.4632279Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:23.4633975Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:23.4635749Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:23.4637502Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:23.4639227Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:23.4640982Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:23.4642715Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:23.4644584Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:23.4645892Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:23.4647044Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:23.4648510Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:23.4649997Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:23.4651468Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:23.4652982Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:23.4654570Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:23.4656109Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:23.4657607Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:23.4659146Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:23.4660741Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:23.4662262Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:23.4663414Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/cache 2025-05-07T20:09:23.4664559Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/cache 2025-05-07T20:09:23.4666068Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/cache 2025-05-07T20:09:23.4667290Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:23.4668419Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:23.4669894Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:23.4671361Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:23.4672974Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:23.4675615Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils 2025-05-07T20:09:23.4676869Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils 2025-05-07T20:09:23.4678322Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils 2025-05-07T20:09:23.4679795Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils 2025-05-07T20:09:23.4681270Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils 2025-05-07T20:09:23.4682732Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils 2025-05-07T20:09:23.4683985Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/stats 2025-05-07T20:09:23.4685359Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/stats 2025-05-07T20:09:23.4687042Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/stats 2025-05-07T20:09:23.4688611Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:23.4689830Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:23.4691454Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:23.4692936Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton/jagged 2025-05-07T20:09:23.4694158Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton/jagged 2025-05-07T20:09:23.4695751Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton/jagged 2025-05-07T20:09:23.4739023Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/asmjit.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:23.4783810Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:23.5114064Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_cache.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:23.6166336Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_inference.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:27.0634613Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_config.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:27.0649833Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_utils.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:27.1938767Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:27.2056646Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_common.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:27.2269605Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:27.2964112Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:30.0342023Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:30.0906201Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:37.3409559Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:38.6557000Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:41.2822093Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:41.7475670Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:41.7769078Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_index_select.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:42.0475337Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:42.0477090Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:42.0492917Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:42.0500862Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:42.0511607Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:42.0519432Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:42.0537246Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:42.0543499Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:42.0555289Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:42.0561833Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:42.0574840Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:42.0580048Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:42.0591704Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:42.0598066Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:42.0607761Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:42.0613268Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:42.0615454Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:42.0624464Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:42.0634139Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:42.0665398Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_py.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:42.6385914Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:42.6387653Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:42.6389165Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:42.6390555Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:42.6392056Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:42.6393685Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:42.6395233Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:42.6396659Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:42.6398055Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:42.6399479Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:42.6401022Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:42.6402748Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:42.6404370Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:42.6405886Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:42.6407460Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:42.6409052Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:42.6410703Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:42.6412295Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:42.6414030Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:42.6415799Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:42.6417381Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:42.6418795Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:09:42.6420215Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/config 2025-05-07T20:09:42.6421726Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/config 2025-05-07T20:09:42.6423237Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:09:42.6424708Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:09:42.6426188Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:09:42.6427714Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:09:42.6429373Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:09:42.6431578Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:09:42.6433309Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:09:42.6434863Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:09:42.6436469Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:09:42.6438937Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/quantize 2025-05-07T20:09:42.6442145Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/quantize 2025-05-07T20:09:42.6443842Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll 2025-05-07T20:09:42.6445271Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe 2025-05-07T20:09:42.6447093Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton 2025-05-07T20:09:42.6448525Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton 2025-05-07T20:09:42.6450106Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton 2025-05-07T20:09:42.6451610Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton 2025-05-07T20:09:42.6453011Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/utils 2025-05-07T20:09:42.6454448Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/utils 2025-05-07T20:09:42.6455887Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/utils 2025-05-07T20:09:42.6457401Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/utils 2025-05-07T20:09:42.6458972Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/cpu 2025-05-07T20:09:42.6460477Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/cpu 2025-05-07T20:09:42.6462738Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/meta 2025-05-07T20:09:42.6464295Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/meta 2025-05-07T20:09:42.6465930Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:42.6467563Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:42.6469259Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:42.6470965Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:42.6472590Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:42.6474180Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:42.6475866Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:42.6477631Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:42.6479375Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:42.6481145Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:42.6482932Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:42.6484785Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:42.6486470Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:09:42.6488085Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:42.6489580Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:42.6491065Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:42.6492539Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:42.6494082Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:42.6495835Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:42.6499105Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:42.6500612Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:42.6502385Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:42.6504132Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:42.6505784Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:09:42.6507357Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/cache 2025-05-07T20:09:42.6509007Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/cache 2025-05-07T20:09:42.6510801Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:42.6512437Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:42.6514039Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:42.6515553Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:42.6517102Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils 2025-05-07T20:09:42.6518688Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils 2025-05-07T20:09:42.6520283Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils 2025-05-07T20:09:42.6521889Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils 2025-05-07T20:09:42.6523617Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils 2025-05-07T20:09:42.6525175Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/stats 2025-05-07T20:09:42.6526789Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/stats 2025-05-07T20:09:42.6528481Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:42.6530305Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:42.6532069Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton/jagged 2025-05-07T20:09:42.6533717Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton/jagged 2025-05-07T20:09:42.6551128Z INFO:skbuild:copied 90 files 2025-05-07T20:09:42.6552106Z INFO:root:running build_ext 2025-05-07T20:09:42.6552903Z INFO:root:installing to _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel 2025-05-07T20:09:42.6553421Z INFO:root:running install 2025-05-07T20:09:42.6610046Z INFO:root:running install_lib 2025-05-07T20:09:42.6611771Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel 2025-05-07T20:09:42.6614082Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu 2025-05-07T20:09:42.6615657Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/config 2025-05-07T20:09:42.6616973Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/config 2025-05-07T20:09:42.6618830Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/config 2025-05-07T20:09:42.6620381Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/docs 2025-05-07T20:09:42.6621783Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:42.6623681Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:42.6625360Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:42.6627182Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:42.6628975Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:42.6630985Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:42.6632844Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:42.6634531Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:42.6636330Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:42.6637696Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/quantize 2025-05-07T20:09:42.6639113Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/quantize 2025-05-07T20:09:42.6640873Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/quantize 2025-05-07T20:09:42.6642294Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll 2025-05-07T20:09:42.6643117Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll/cpu 2025-05-07T20:09:42.6644422Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/cpu 2025-05-07T20:09:42.6646173Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/cpu 2025-05-07T20:09:42.6647534Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll/meta 2025-05-07T20:09:42.6648901Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/meta 2025-05-07T20:09:42.6650725Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/meta 2025-05-07T20:09:42.6652029Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll/triton 2025-05-07T20:09:42.6653405Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:42.6655211Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:42.6657127Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:42.6659210Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:42.6661174Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:42.6663173Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:42.6665226Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:42.6667472Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:42.6669716Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:42.6671818Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:42.6673929Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:42.6676039Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:42.6678108Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:42.6680012Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll 2025-05-07T20:09:42.6681158Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe 2025-05-07T20:09:42.6682067Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/bench 2025-05-07T20:09:42.6683445Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:42.6685524Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:42.6687253Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:42.6689080Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:42.6690996Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:42.6692937Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:42.6694823Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:42.6696581Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:42.6698542Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:42.6700592Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:42.6702467Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:42.6703742Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/cache 2025-05-07T20:09:42.6705129Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/cache 2025-05-07T20:09:42.6707018Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/cache 2025-05-07T20:09:42.6708438Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:42.6709321Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:42.6710607Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:42.6712488Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:42.6714392Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:09:42.6716053Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:09:42.6717679Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:09:42.6719515Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:09:42.6720738Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/utils 2025-05-07T20:09:42.6721948Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:42.6723593Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:42.6725342Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:42.6726957Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:42.6728624Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:42.6729862Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/stats 2025-05-07T20:09:42.6731042Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/stats 2025-05-07T20:09:42.6732708Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/stats 2025-05-07T20:09:42.6734317Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe 2025-05-07T20:09:42.6735427Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/triton 2025-05-07T20:09:42.6736227Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/triton/jagged 2025-05-07T20:09:42.6737487Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton/jagged 2025-05-07T20:09:42.6739222Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton/jagged 2025-05-07T20:09:42.6753802Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:09:42.6755580Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:09:42.6757263Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:09:42.6758905Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:09:42.6760109Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/utils 2025-05-07T20:09:42.6761281Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:09:42.6762961Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:09:42.6764510Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:09:42.6766121Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:09:42.6767559Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/asmjit.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:42.6768962Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:42.6770399Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_tbe_cache.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:42.6880669Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_tbe_inference.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:42.9572259Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_config.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:42.9573889Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_tbe_utils.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:42.9678740Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:42.9687120Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_tbe_common.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:42.9711631Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:42.9768832Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:43.1905070Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:43.1951622Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:43.7552189Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:43.8423993Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:44.0430056Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:44.0788542Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:44.0814844Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_tbe_index_select.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:44.1026351Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:44.1031654Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:44.1035164Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:44.1037409Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:44.1039607Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:44.1041805Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:44.1044017Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:44.1046304Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:44.1048643Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:44.1050982Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:44.1053422Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:44.1055720Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:44.1057902Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:44.1060020Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:44.1062153Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:44.1063733Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:44.1065480Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:44.1067658Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:44.1069799Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:44.1071423Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_py.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:44.1504994Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:44.1506610Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:44.1508176Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:44.1509774Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:44.1511552Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:44.1513364Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:44.1515027Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:44.1516547Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:44.1518091Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:44.1519623Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:44.1521170Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:44.1522844Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:44.1524556Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:44.1526251Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:44.1527962Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:44.1529693Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:44.1531450Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:44.1533334Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:44.1535102Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:44.1536832Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:44.1538452Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:44.1539933Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:44.1540833Z INFO:skbuild:copied 125 files 2025-05-07T20:09:44.1541147Z INFO:root:running install_egg_info 2025-05-07T20:09:44.1569148Z INFO:root:running egg_info 2025-05-07T20:09:44.1592863Z INFO:root:writing fbgemm_gpu_nightly.egg-info/PKG-INFO 2025-05-07T20:09:44.1596607Z INFO:root:writing dependency_links to fbgemm_gpu_nightly.egg-info/dependency_links.txt 2025-05-07T20:09:44.1598812Z INFO:root:writing requirements to fbgemm_gpu_nightly.egg-info/requires.txt 2025-05-07T20:09:44.1599558Z INFO:root:writing top-level names to fbgemm_gpu_nightly.egg-info/top_level.txt 2025-05-07T20:09:44.1683091Z INFO:root:reading manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T20:09:44.1715016Z INFO:root:writing manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T20:09:44.1716178Z INFO:root:Copying fbgemm_gpu_nightly.egg-info to _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu_nightly-2025.5.7-py3.11.egg-info 2025-05-07T20:09:44.1724176Z INFO:root:running install_scripts 2025-05-07T20:09:44.1724537Z INFO:skbuild:copied 0 files 2025-05-07T20:09:46.8624348Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu_nightly-2025.5.7.dist-info/WHEEL 2025-05-07T20:09:46.8627052Z INFO:wheel:creating '/__w/FBGEMM/FBGEMM/fbgemm_gpu/dist/.tmp-w8ozr4b7/fbgemm_gpu_nightly-2025.5.7-cp311-cp311-manylinux_2_28_x86_64.whl' and adding '_skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel' to it 2025-05-07T20:09:46.8628132Z INFO:wheel:adding 'fbgemm_gpu/__init__.py' 2025-05-07T20:09:46.8903759Z INFO:wheel:adding 'fbgemm_gpu/asmjit.so' 2025-05-07T20:09:46.8914229Z INFO:wheel:adding 'fbgemm_gpu/batched_unary_embeddings_ops.py' 2025-05-07T20:09:46.8915490Z INFO:wheel:adding 'fbgemm_gpu/enums.py' 2025-05-07T20:09:47.0527610Z INFO:wheel:adding 'fbgemm_gpu/fbgemm.so' 2025-05-07T20:09:47.0658346Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_config.so' 2025-05-07T20:09:47.0795089Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so' 2025-05-07T20:09:48.8498161Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_py.so' 2025-05-07T20:09:49.0551806Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so' 2025-05-07T20:09:49.7753660Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_cache.so' 2025-05-07T20:09:49.8880246Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_common.so' 2025-05-07T20:09:50.4854626Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_index_select.so' 2025-05-07T20:10:08.6521858Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_inference.so' 2025-05-07T20:10:09.9187833Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so' 2025-05-07T20:10:38.1733017Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so' 2025-05-07T20:10:41.0066666Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so' 2025-05-07T20:10:44.6716901Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so' 2025-05-07T20:10:45.2637121Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so' 2025-05-07T20:10:45.4385679Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so' 2025-05-07T20:10:54.2054663Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so' 2025-05-07T20:11:05.3817746Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so' 2025-05-07T20:11:06.8651172Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_utils.so' 2025-05-07T20:11:06.9010463Z INFO:wheel:adding 'fbgemm_gpu/metrics.py' 2025-05-07T20:11:06.9013728Z INFO:wheel:adding 'fbgemm_gpu/permute_pooled_embedding_modules.py' 2025-05-07T20:11:06.9014356Z INFO:wheel:adding 'fbgemm_gpu/permute_pooled_embedding_modules_split.py' 2025-05-07T20:11:06.9014866Z INFO:wheel:adding 'fbgemm_gpu/quantize_comm.py' 2025-05-07T20:11:06.9017011Z INFO:wheel:adding 'fbgemm_gpu/quantize_utils.py' 2025-05-07T20:11:06.9019928Z INFO:wheel:adding 'fbgemm_gpu/runtime_monitor.py' 2025-05-07T20:11:06.9032068Z INFO:wheel:adding 'fbgemm_gpu/sparse_ops.py' 2025-05-07T20:11:06.9034842Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_configs.py' 2025-05-07T20:11:06.9037541Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_inference_converter.py' 2025-05-07T20:11:06.9039086Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_optimizer_ops.py' 2025-05-07T20:11:06.9040580Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_utils.py' 2025-05-07T20:11:06.9042258Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops.py' 2025-05-07T20:11:06.9046004Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_common.py' 2025-05-07T20:11:06.9070182Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_inference.py' 2025-05-07T20:11:06.9111906Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_training.py' 2025-05-07T20:11:06.9114417Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py' 2025-05-07T20:11:06.9115770Z INFO:wheel:adding 'fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py' 2025-05-07T20:11:06.9117851Z INFO:wheel:adding 'fbgemm_gpu/tbe_input_multiplexer.py' 2025-05-07T20:11:06.9119344Z INFO:wheel:adding 'fbgemm_gpu/uvm.py' 2025-05-07T20:11:06.9121597Z INFO:wheel:adding 'fbgemm_gpu/config/__init__.py' 2025-05-07T20:11:06.9122909Z INFO:wheel:adding 'fbgemm_gpu/config/feature_list.py' 2025-05-07T20:11:06.9124669Z INFO:wheel:adding 'fbgemm_gpu/docs/__init__.py' 2025-05-07T20:11:06.9129528Z INFO:wheel:adding 'fbgemm_gpu/docs/common.py' 2025-05-07T20:11:06.9130624Z INFO:wheel:adding 'fbgemm_gpu/docs/examples.py' 2025-05-07T20:11:06.9131783Z INFO:wheel:adding 'fbgemm_gpu/docs/jagged_tensor_ops.py' 2025-05-07T20:11:06.9133031Z INFO:wheel:adding 'fbgemm_gpu/docs/merge_pooled_embedding_ops.py' 2025-05-07T20:11:06.9133596Z INFO:wheel:adding 'fbgemm_gpu/docs/permute_pooled_embedding_ops.py' 2025-05-07T20:11:06.9135252Z INFO:wheel:adding 'fbgemm_gpu/docs/quantize_ops.py' 2025-05-07T20:11:06.9141306Z INFO:wheel:adding 'fbgemm_gpu/docs/sparse_ops.py' 2025-05-07T20:11:06.9142795Z INFO:wheel:adding 'fbgemm_gpu/docs/version.py' 2025-05-07T20:11:06.9144862Z INFO:wheel:adding 'fbgemm_gpu/quantize/__init__.py' 2025-05-07T20:11:06.9146104Z INFO:wheel:adding 'fbgemm_gpu/quantize/quantize_ops.py' 2025-05-07T20:11:06.9148000Z INFO:wheel:adding 'fbgemm_gpu/sll/__init__.py' 2025-05-07T20:11:06.9150177Z INFO:wheel:adding 'fbgemm_gpu/sll/cpu/__init__.py' 2025-05-07T20:11:06.9156181Z INFO:wheel:adding 'fbgemm_gpu/sll/cpu/cpu_sll.py' 2025-05-07T20:11:06.9158740Z INFO:wheel:adding 'fbgemm_gpu/sll/meta/__init__.py' 2025-05-07T20:11:06.9161110Z INFO:wheel:adding 'fbgemm_gpu/sll/meta/meta_sll.py' 2025-05-07T20:11:06.9163535Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/__init__.py' 2025-05-07T20:11:06.9165405Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/common.py' 2025-05-07T20:11:06.9167051Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py' 2025-05-07T20:11:06.9168759Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py' 2025-05-07T20:11:06.9172349Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_bmm.py' 2025-05-07T20:11:06.9177018Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py' 2025-05-07T20:11:06.9178810Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py' 2025-05-07T20:11:06.9180955Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py' 2025-05-07T20:11:06.9186941Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py' 2025-05-07T20:11:06.9192053Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py' 2025-05-07T20:11:06.9193739Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py' 2025-05-07T20:11:06.9198100Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_softmax.py' 2025-05-07T20:11:06.9203324Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py' 2025-05-07T20:11:06.9206708Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py' 2025-05-07T20:11:06.9208912Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py' 2025-05-07T20:11:06.9211971Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py' 2025-05-07T20:11:06.9214321Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py' 2025-05-07T20:11:06.9215268Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py' 2025-05-07T20:11:06.9218259Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py' 2025-05-07T20:11:06.9222044Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py' 2025-05-07T20:11:06.9224627Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py' 2025-05-07T20:11:06.9228784Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py' 2025-05-07T20:11:06.9231776Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py' 2025-05-07T20:11:06.9234087Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py' 2025-05-07T20:11:06.9237247Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py' 2025-05-07T20:11:06.9241449Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py' 2025-05-07T20:11:06.9243727Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py' 2025-05-07T20:11:06.9245071Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py' 2025-05-07T20:11:06.9247621Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py' 2025-05-07T20:11:06.9249044Z INFO:wheel:adding 'fbgemm_gpu/tbe/__init__.py' 2025-05-07T20:11:06.9251520Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/__init__.py' 2025-05-07T20:11:06.9253139Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/bench_config.py' 2025-05-07T20:11:06.9258311Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/bench_runs.py' 2025-05-07T20:11:06.9260223Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/eeg_cli.py' 2025-05-07T20:11:06.9262666Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/embedding_ops_common_config.py' 2025-05-07T20:11:06.9264137Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/eval_compression.py' 2025-05-07T20:11:06.9265452Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/reporter.py' 2025-05-07T20:11:06.9268492Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/tbe_data_config.py' 2025-05-07T20:11:06.9271265Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/tbe_data_config_loader.py' 2025-05-07T20:11:06.9273671Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py' 2025-05-07T20:11:06.9275289Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/utils.py' 2025-05-07T20:11:06.9276801Z INFO:wheel:adding 'fbgemm_gpu/tbe/cache/__init__.py' 2025-05-07T20:11:06.9278327Z INFO:wheel:adding 'fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py' 2025-05-07T20:11:06.9279832Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/__init__.py' 2025-05-07T20:11:06.9281059Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/common.py' 2025-05-07T20:11:06.9287690Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/inference.py' 2025-05-07T20:11:06.9312710Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/training.py' 2025-05-07T20:11:06.9315121Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/utils/__init__.py' 2025-05-07T20:11:06.9318061Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py' 2025-05-07T20:11:06.9319959Z INFO:wheel:adding 'fbgemm_gpu/tbe/stats/__init__.py' 2025-05-07T20:11:06.9322197Z INFO:wheel:adding 'fbgemm_gpu/tbe/stats/bench_params_reporter.py' 2025-05-07T20:11:06.9323726Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/__init__.py' 2025-05-07T20:11:06.9324971Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/common.py' 2025-05-07T20:11:06.9326498Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/offsets.py' 2025-05-07T20:11:06.9328982Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/quantize.py' 2025-05-07T20:11:06.9334554Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/requests.py' 2025-05-07T20:11:06.9336436Z INFO:wheel:adding 'fbgemm_gpu/triton/__init__.py' 2025-05-07T20:11:06.9338058Z INFO:wheel:adding 'fbgemm_gpu/triton/common.py' 2025-05-07T20:11:06.9345452Z INFO:wheel:adding 'fbgemm_gpu/triton/quantize.py' 2025-05-07T20:11:06.9350604Z INFO:wheel:adding 'fbgemm_gpu/triton/quantize_ref.py' 2025-05-07T20:11:06.9352428Z INFO:wheel:adding 'fbgemm_gpu/triton/jagged/__init__.py' 2025-05-07T20:11:06.9360323Z INFO:wheel:adding 'fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py' 2025-05-07T20:11:06.9362062Z INFO:wheel:adding 'fbgemm_gpu/utils/__init__.py' 2025-05-07T20:11:06.9364022Z INFO:wheel:adding 'fbgemm_gpu/utils/filestore.py' 2025-05-07T20:11:06.9365530Z INFO:wheel:adding 'fbgemm_gpu/utils/loader.py' 2025-05-07T20:11:06.9367589Z INFO:wheel:adding 'fbgemm_gpu/utils/torch_library.py' 2025-05-07T20:11:06.9370188Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/METADATA' 2025-05-07T20:11:06.9371014Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/WHEEL' 2025-05-07T20:11:06.9371948Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/top_level.txt' 2025-05-07T20:11:06.9379197Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/RECORD' 2025-05-07T20:11:06.9382241Z INFO:root:removing _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel 2025-05-07T20:11:07.0911188Z ╒════════════════════════════╤════════════════════════════════════════════════╕ 2025-05-07T20:11:07.0911756Z │ │ Version │ 2025-05-07T20:11:07.0912312Z ╞════════════════════════════╪════════════════════════════════════════════════╡ 2025-05-07T20:11:07.0912829Z │ PyTorch │ 2.8.0.dev20250507+cu126 │ 2025-05-07T20:11:07.0913397Z ├────────────────────────────┼────────────────────────────────────────────────┤ 2025-05-07T20:11:07.0913974Z │ CUDA (Declared by PyTorch) │ 12.6 │ 2025-05-07T20:11:07.0915033Z ├────────────────────────────┼────────────────────────────────────────────────┤ 2025-05-07T20:11:07.0915603Z │ CUDA (Actual) │ nvcc: NVIDIA (R) Cuda compiler driver │ 2025-05-07T20:11:07.0916221Z │ │ Copyright (c) 2005-2024 NVIDIA Corporation │ 2025-05-07T20:11:07.0916733Z │ │ Built on Tue_Oct_29_23:50:19_PDT_2024 │ 2025-05-07T20:11:07.0917235Z │ │ Cuda compilation tools, release 12.6, V12.6.85 │ 2025-05-07T20:11:07.0917728Z │ │ Build cuda_12.6.r12.6/compiler.35059454_0 │ 2025-05-07T20:11:07.0918280Z ╘════════════════════════════╧════════════════════════════════════════════════╛ 2025-05-07T20:11:07.3947185Z Successfully built fbgemm_gpu_nightly-2025.5.7-cp311-cp311-manylinux_2_28_x86_64.whl 2025-05-07T20:11:07.4853072Z 2025-05-07T20:11:07.5005413Z ################################################################################ 2025-05-07T20:11:07.5006023Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so 2025-05-07T20:11:07.5006466Z [CHECK] Listing out library size: 2025-05-07T20:11:07.5006895Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so 2025-05-07T20:11:07.5007450Z 2025-05-07T20:11:07.5014168Z 1 ./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so 2025-05-07T20:11:07.5015128Z 2025-05-07T20:11:07.5019681Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so 2025-05-07T20:11:07.5020592Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:07.5021141Z 2025-05-07T20:11:07.5085918Z GLIBC_2.2.5 2025-05-07T20:11:07.5086172Z GLIBC_2.14 2025-05-07T20:11:07.5086297Z 2025-05-07T20:11:07.5086307Z 2025-05-07T20:11:07.5086736Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so 2025-05-07T20:11:07.5087861Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:07.5088449Z 2025-05-07T20:11:07.5151111Z 2025-05-07T20:11:07.5151129Z 2025-05-07T20:11:07.5178626Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so > /tmp/tmp.ke9oFS59lh.symbols.txt 2025-05-07T20:11:07.5179912Z 2025-05-07T20:11:07.5217099Z 2025-05-07T20:11:07.5245846Z [CHECK] Total Number of symbols: 803 2025-05-07T20:11:07.5264367Z [CHECK] Number of fbgemm symbols: 0 2025-05-07T20:11:07.5289440Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so > /tmp/tmp.8NJ1UW6hgq.usymbols.txt 2025-05-07T20:11:07.5290741Z 2025-05-07T20:11:07.5308257Z 2025-05-07T20:11:07.5339736Z [CHECK] Listing out undefined symbols (49 total): 2025-05-07T20:11:07.5355294Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:07.5356450Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:07.5357427Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:07.5358357Z U __errno_location@GLIBC_2.2.5 2025-05-07T20:11:07.5359307Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:07.5360235Z U __popcountdi2@GCC_3.4 2025-05-07T20:11:07.5361117Z U abort@GLIBC_2.2.5 2025-05-07T20:11:07.5361395Z U close@GLIBC_2.2.5 2025-05-07T20:11:07.5361687Z U fputs@GLIBC_2.2.5 2025-05-07T20:11:07.5361975Z U free@GLIBC_2.2.5 2025-05-07T20:11:07.5362264Z U ftruncate64@GLIBC_2.2.5 2025-05-07T20:11:07.5362581Z U fwrite@GLIBC_2.2.5 2025-05-07T20:11:07.5362870Z U getenv@GLIBC_2.2.5 2025-05-07T20:11:07.5363176Z U getpagesize@GLIBC_2.2.5 2025-05-07T20:11:07.5363486Z U madvise@GLIBC_2.2.5 2025-05-07T20:11:07.5363786Z U malloc@GLIBC_2.2.5 2025-05-07T20:11:07.5364070Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:07.5364365Z U memcpy@GLIBC_2.14 2025-05-07T20:11:07.5364793Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:07.5365086Z U memset@GLIBC_2.2.5 2025-05-07T20:11:07.5365389Z U mmap@GLIBC_2.2.5 2025-05-07T20:11:07.5365735Z U mprotect@GLIBC_2.2.5 2025-05-07T20:11:07.5366043Z U munmap@GLIBC_2.2.5 2025-05-07T20:11:07.5366328Z U open64@GLIBC_2.2.5 2025-05-07T20:11:07.5366679Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:07.5367088Z U pthread_mutex_destroy@GLIBC_2.2.5 2025-05-07T20:11:07.5367428Z U pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:07.5367772Z U pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:07.5368104Z U read@GLIBC_2.2.5 2025-05-07T20:11:07.5368384Z U realloc@GLIBC_2.2.5 2025-05-07T20:11:07.5368683Z U shm_open 2025-05-07T20:11:07.5368930Z U shm_unlink 2025-05-07T20:11:07.5369209Z U snprintf@GLIBC_2.2.5 2025-05-07T20:11:07.5369499Z U stderr@GLIBC_2.2.5 2025-05-07T20:11:07.5369796Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:07.5370126Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:07.5370420Z U strtol@GLIBC_2.2.5 2025-05-07T20:11:07.5370721Z U syscall@GLIBC_2.2.5 2025-05-07T20:11:07.5371111Z U sysconf@GLIBC_2.2.5 2025-05-07T20:11:07.5371403Z U uname@GLIBC_2.2.5 2025-05-07T20:11:07.5371674Z U unlink@GLIBC_2.2.5 2025-05-07T20:11:07.5371967Z U vsnprintf@GLIBC_2.2.5 2025-05-07T20:11:07.5372332Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:07.5372766Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:07.5373206Z U vtable for __cxxabiv1::__vmi_class_type_info@CXXABI_1.3 2025-05-07T20:11:07.5373728Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:07.5374045Z w _ITM_registerTMCloneTable 2025-05-07T20:11:07.5374335Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:07.5374628Z w __gmon_start__ 2025-05-07T20:11:07.5374939Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:07.5375330Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so 2025-05-07T20:11:07.5375569Z 2025-05-07T20:11:07.5406091Z linux-vdso.so.1 (0x00007ffeca741000) 2025-05-07T20:11:07.5406539Z libtorch_cpu.so => not found 2025-05-07T20:11:07.5406875Z libtorch_cuda.so => not found 2025-05-07T20:11:07.5407159Z libtorch.so => not found 2025-05-07T20:11:07.5407602Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f737182e000) 2025-05-07T20:11:07.5408032Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f7371800000) 2025-05-07T20:11:07.5408436Z libc.so.6 => /lib64/libc.so.6 (0x00007f73715f6000) 2025-05-07T20:11:07.5408827Z libm.so.6 => /lib64/libm.so.6 (0x00007f737151b000) 2025-05-07T20:11:07.5409213Z /lib64/ld-linux-x86-64.so.2 (0x00007f7371b11000) 2025-05-07T20:11:07.5409455Z 2025-05-07T20:11:07.5409568Z [CHECK] Displaying ELF information: 2025-05-07T20:11:07.5409972Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so 2025-05-07T20:11:07.5410267Z 2025-05-07T20:11:07.5445897Z 2025-05-07T20:11:07.5446276Z Dynamic section at offset 0x78e78 contains 33 entries: 2025-05-07T20:11:07.5446709Z Tag Type Name/Value 2025-05-07T20:11:07.5447216Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:07.5447783Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:07.5448329Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:07.5448849Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:07.5449398Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:07.5450046Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:07.5450553Z 0x000000000000000e (SONAME) Library soname: [asmjit.so] 2025-05-07T20:11:07.5450999Z 0x000000000000000c (INIT) 0x1a000 2025-05-07T20:11:07.5451385Z 0x000000000000000d (FINI) 0x5af2c 2025-05-07T20:11:07.5451737Z 0x0000000000000019 (INIT_ARRAY) 0x780a0 2025-05-07T20:11:07.5452081Z 0x000000000000001b (INIT_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:07.5452443Z 0x000000000000001a (FINI_ARRAY) 0x780a8 2025-05-07T20:11:07.5452803Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:07.5453140Z 0x0000000000000004 (HASH) 0x200 2025-05-07T20:11:07.5453486Z 0x000000006ffffef5 (GNU_HASH) 0x1e18 2025-05-07T20:11:07.5453820Z 0x0000000000000005 (STRTAB) 0x86e0 2025-05-07T20:11:07.5454161Z 0x0000000000000006 (SYMTAB) 0x3b80 2025-05-07T20:11:07.5454515Z 0x000000000000000a (STRSZ) 45342 (bytes) 2025-05-07T20:11:07.5454908Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:07.5455263Z 0x0000000000000003 (PLTGOT) 0x790d8 2025-05-07T20:11:07.5455660Z 0x0000000000000002 (PLTRELSZ) 8064 (bytes) 2025-05-07T20:11:07.5456021Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:07.5456340Z 0x0000000000000017 (JMPREL) 0x17220 2025-05-07T20:11:07.5456678Z 0x0000000000000007 (RELA) 0x13ed8 2025-05-07T20:11:07.5457025Z 0x0000000000000008 (RELASZ) 13128 (bytes) 2025-05-07T20:11:07.5457399Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:07.5457724Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:07.5458063Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:07.5458428Z 0x000000006ffffffe (VERNEED) 0x13e48 2025-05-07T20:11:07.5458799Z 0x000000006fffffff (VERNEEDNUM) 3 2025-05-07T20:11:07.5459138Z 0x000000006ffffff0 (VERSYM) 0x137fe 2025-05-07T20:11:07.5459468Z 0x000000006ffffff9 (RELACOUNT) 3 2025-05-07T20:11:07.5459789Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:07.5460000Z 2025-05-07T20:11:07.5460117Z ################################################################################ 2025-05-07T20:11:07.5460366Z 2025-05-07T20:11:07.5460370Z 2025-05-07T20:11:07.5460590Z ################################################################################ 2025-05-07T20:11:07.5461076Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:11:07.5461632Z [CHECK] Listing out library size: 2025-05-07T20:11:07.5462052Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:11:07.5462387Z 2025-05-07T20:11:07.5471010Z 1 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:11:07.5471757Z 2025-05-07T20:11:07.5472458Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:11:07.5473453Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:07.5474044Z 2025-05-07T20:11:07.5524811Z GLIBC_2.2.5 2025-05-07T20:11:07.5525146Z GLIBC_2.14 2025-05-07T20:11:07.5525354Z 2025-05-07T20:11:07.5525362Z 2025-05-07T20:11:07.5526062Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:11:07.5527123Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:07.5527745Z 2025-05-07T20:11:07.5574835Z GLIBCXX_3.4 2025-05-07T20:11:07.5575149Z GLIBCXX_3.4.9 2025-05-07T20:11:07.5575403Z GLIBCXX_3.4.21 2025-05-07T20:11:07.5575643Z 2025-05-07T20:11:07.5575661Z 2025-05-07T20:11:07.5593872Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so > /tmp/tmp.WShTsMLVcx.symbols.txt 2025-05-07T20:11:07.5594473Z 2025-05-07T20:11:07.5624044Z 2025-05-07T20:11:07.5657260Z [CHECK] Total Number of symbols: 107 2025-05-07T20:11:07.5671557Z [CHECK] Number of fbgemm symbols: 4 2025-05-07T20:11:07.5688863Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so > /tmp/tmp.qTZlhvmE20.usymbols.txt 2025-05-07T20:11:07.5689343Z 2025-05-07T20:11:07.5708578Z 2025-05-07T20:11:07.5736522Z [CHECK] Listing out undefined symbols (57 total): 2025-05-07T20:11:07.5755153Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:07.5755776Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:07.5756088Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:07.5756421Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:07.5756753Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:07.5757081Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:07.5757423Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:07.5757741Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:07.5758197Z U __errno_location@GLIBC_2.2.5 2025-05-07T20:11:07.5758526Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:07.5758865Z U c10::BoolType::get() 2025-05-07T20:11:07.5759165Z U c10::StringType::get() 2025-05-07T20:11:07.5759510Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:07.5760301Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:07.5761567Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:07.5762446Z U getenv@GLIBC_2.2.5 2025-05-07T20:11:07.5762740Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:07.5763041Z U memcpy@GLIBC_2.14 2025-05-07T20:11:07.5763345Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:07.5763637Z U memset@GLIBC_2.2.5 2025-05-07T20:11:07.5764097Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:07.5764486Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:07.5764901Z U std::_Rb_tree_decrement(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:11:07.5765664Z U std::_Rb_tree_insert_and_rebalance(bool, std::_Rb_tree_node_base*, std::_Rb_tree_node_base*, std::_Rb_tree_node_base&)@GLIBCXX_3.4 2025-05-07T20:11:07.5766523Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:07.5767133Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:07.5767492Z U std::__throw_invalid_argument(char const*)@GLIBCXX_3.4 2025-05-07T20:11:07.5767893Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:07.5768285Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:07.5768656Z U std::__throw_out_of_range(char const*)@GLIBCXX_3.4 2025-05-07T20:11:07.5769234Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:07.5770229Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:07.5771016Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:07.5771363Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:07.5771691Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:07.5772070Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:07.5772391Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:07.5772775Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:07.5773092Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:07.5773364Z U strtol@GLIBC_2.2.5 2025-05-07T20:11:07.5773678Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:07.5774470Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:07.5775656Z U torch::Library::_def(std::variant&&, torch::CppFunction&&, std::vector > const&) & 2025-05-07T20:11:07.5776645Z U torch::jit::parseSchemaOrName(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:07.5777270Z U typeinfo for std::invalid_argument@GLIBCXX_3.4 2025-05-07T20:11:07.5777694Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:07.5778099Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:07.5778527Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:07.5779138Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:07.5779899Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:07.5780530Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:07.5781084Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:07.5781518Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:07.5781843Z w _ITM_registerTMCloneTable 2025-05-07T20:11:07.5782142Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:07.5782439Z w __gmon_start__ 2025-05-07T20:11:07.5782700Z w __pthread_key_create 2025-05-07T20:11:07.5783046Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:07.5783474Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:11:07.5783758Z 2025-05-07T20:11:07.5806399Z linux-vdso.so.1 (0x00007ffde926b000) 2025-05-07T20:11:07.5806888Z libc10.so => not found 2025-05-07T20:11:07.5807261Z libtorch_cpu.so => not found 2025-05-07T20:11:07.5807573Z libtorch_cuda.so => not found 2025-05-07T20:11:07.5807843Z libtorch.so => not found 2025-05-07T20:11:07.5808195Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fa02ece2000) 2025-05-07T20:11:07.5808642Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fa02ecb2000) 2025-05-07T20:11:07.5809036Z libc.so.6 => /lib64/libc.so.6 (0x00007fa02eaaa000) 2025-05-07T20:11:07.5809410Z libm.so.6 => /lib64/libm.so.6 (0x00007fa02e9cf000) 2025-05-07T20:11:07.5809783Z /lib64/ld-linux-x86-64.so.2 (0x00007fa02ef56000) 2025-05-07T20:11:07.5810044Z 2025-05-07T20:11:07.5810158Z [CHECK] Displaying ELF information: 2025-05-07T20:11:07.5810579Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:11:07.5810925Z 2025-05-07T20:11:07.5842685Z 2025-05-07T20:11:07.5843044Z Dynamic section at offset 0xab00 contains 34 entries: 2025-05-07T20:11:07.5843458Z Tag Type Name/Value 2025-05-07T20:11:07.5843912Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:07.5844507Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:07.5845066Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:07.5845715Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:07.5846251Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:07.5846824Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:07.5847344Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:07.5847883Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_config.so] 2025-05-07T20:11:07.5848331Z 0x000000000000000c (INIT) 0x4000 2025-05-07T20:11:07.5848669Z 0x000000000000000d (FINI) 0x817c 2025-05-07T20:11:07.5848997Z 0x0000000000000019 (INIT_ARRAY) 0xaa58 2025-05-07T20:11:07.5849350Z 0x000000000000001b (INIT_ARRAYSZ) 16 (bytes) 2025-05-07T20:11:07.5849794Z 0x000000000000001a (FINI_ARRAY) 0xaa68 2025-05-07T20:11:07.5850127Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:07.5850437Z 0x0000000000000004 (HASH) 0x200 2025-05-07T20:11:07.5850753Z 0x000000006ffffef5 (GNU_HASH) 0x700 2025-05-07T20:11:07.5851073Z 0x0000000000000005 (STRTAB) 0x13b0 2025-05-07T20:11:07.5851417Z 0x0000000000000006 (SYMTAB) 0x990 2025-05-07T20:11:07.5851754Z 0x000000000000000a (STRSZ) 6890 (bytes) 2025-05-07T20:11:07.5852087Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:07.5852423Z 0x0000000000000003 (PLTGOT) 0xad70 2025-05-07T20:11:07.5852746Z 0x0000000000000002 (PLTRELSZ) 1272 (bytes) 2025-05-07T20:11:07.5853078Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:07.5853376Z 0x0000000000000017 (JMPREL) 0x34a8 2025-05-07T20:11:07.5853687Z 0x0000000000000007 (RELA) 0x3028 2025-05-07T20:11:07.5854015Z 0x0000000000000008 (RELASZ) 1152 (bytes) 2025-05-07T20:11:07.5854381Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:07.5854696Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:07.5854999Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:07.5855338Z 0x000000006ffffffe (VERNEED) 0x2f78 2025-05-07T20:11:07.5855648Z 0x000000006fffffff (VERNEEDNUM) 3 2025-05-07T20:11:07.5855961Z 0x000000006ffffff0 (VERSYM) 0x2e9a 2025-05-07T20:11:07.5856422Z 0x000000006ffffff9 (RELACOUNT) 4 2025-05-07T20:11:07.5856733Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:07.5856935Z 2025-05-07T20:11:07.5857059Z ################################################################################ 2025-05-07T20:11:07.5857282Z 2025-05-07T20:11:07.5857286Z 2025-05-07T20:11:07.5857397Z ################################################################################ 2025-05-07T20:11:07.5857838Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so 2025-05-07T20:11:07.5858427Z [CHECK] Listing out library size: 2025-05-07T20:11:07.5858838Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so 2025-05-07T20:11:07.5859157Z 2025-05-07T20:11:07.5859325Z 6 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so 2025-05-07T20:11:07.5859571Z 2025-05-07T20:11:07.5859903Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so 2025-05-07T20:11:07.5860796Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:07.5861345Z 2025-05-07T20:11:07.6129020Z GLIBC_2.2.5 2025-05-07T20:11:07.6129648Z GLIBC_2.3 2025-05-07T20:11:07.6130245Z GLIBC_2.14 2025-05-07T20:11:07.6130568Z 2025-05-07T20:11:07.6130581Z 2025-05-07T20:11:07.6131624Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so 2025-05-07T20:11:07.6134398Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:07.6135063Z 2025-05-07T20:11:07.6388678Z GLIBCXX_3.4 2025-05-07T20:11:07.6389552Z GLIBCXX_3.4.9 2025-05-07T20:11:07.6390167Z GLIBCXX_3.4.11 2025-05-07T20:11:07.6390748Z GLIBCXX_3.4.14 2025-05-07T20:11:07.6391348Z GLIBCXX_3.4.15 2025-05-07T20:11:07.6392055Z GLIBCXX_3.4.18 2025-05-07T20:11:07.6392648Z GLIBCXX_3.4.21 2025-05-07T20:11:07.6392995Z 2025-05-07T20:11:07.6393009Z 2025-05-07T20:11:07.6409782Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so > /tmp/tmp.eF1w0LKpDL.symbols.txt 2025-05-07T20:11:07.6410238Z 2025-05-07T20:11:07.6628380Z 2025-05-07T20:11:07.6653978Z [CHECK] Total Number of symbols: 4871 2025-05-07T20:11:07.6671063Z [CHECK] Number of fbgemm symbols: 3365 2025-05-07T20:11:07.6687195Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so > /tmp/tmp.u65KWnSD26.usymbols.txt 2025-05-07T20:11:07.6688469Z 2025-05-07T20:11:07.6711985Z 2025-05-07T20:11:07.6739218Z [CHECK] Listing out undefined symbols (135 total): 2025-05-07T20:11:07.6751087Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:07.6751567Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:07.6751937Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:07.6752656Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:07.6752987Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:07.6753328Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:07.6753678Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:07.6754001Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:07.6754348Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:07.6754699Z U __cxa_init_primary_exception@CXXABI_1.3.11 2025-05-07T20:11:07.6755071Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:07.6755388Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:07.6755728Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:07.6756146Z U __cxa_throw_bad_array_new_length@CXXABI_1.3.8 2025-05-07T20:11:07.6756529Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:07.6756877Z U __once_proxy@GLIBCXX_3.4.11 2025-05-07T20:11:07.6757195Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:07.6757520Z U abort@GLIBC_2.2.5 2025-05-07T20:11:07.6757938Z U asmjit::_abi_1_13::BaseAssembler::bind(asmjit::_abi_1_13::Label const&) 2025-05-07T20:11:07.6758431Z U asmjit::_abi_1_13::BaseAssembler::newLabel() 2025-05-07T20:11:07.6758959Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:11:07.6759753Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:11:07.6760777Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:11:07.6762021Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:11:07.6763225Z U asmjit::_abi_1_13::BaseEmitter::emitArgsAssignment(asmjit::_abi_1_13::FuncFrame const&, asmjit::_abi_1_13::FuncArgsAssignment const&) 2025-05-07T20:11:07.6764060Z U asmjit::_abi_1_13::BaseEmitter::emitEpilog(asmjit::_abi_1_13::FuncFrame const&) 2025-05-07T20:11:07.6764775Z U asmjit::_abi_1_13::BaseEmitter::emitProlog(asmjit::_abi_1_13::FuncFrame const&) 2025-05-07T20:11:07.6765402Z U asmjit::_abi_1_13::CodeHolder::CodeHolder(asmjit::_abi_1_13::Support::Temporary const*) 2025-05-07T20:11:07.6766059Z U asmjit::_abi_1_13::CodeHolder::init(asmjit::_abi_1_13::Environment const&, unsigned long) 2025-05-07T20:11:07.6766568Z U asmjit::_abi_1_13::CodeHolder::~CodeHolder() 2025-05-07T20:11:07.6767172Z U asmjit::_abi_1_13::FuncArgsAssignment::updateFuncFrame(asmjit::_abi_1_13::FuncFrame&) const 2025-05-07T20:11:07.6767997Z U asmjit::_abi_1_13::FuncDetail::init(asmjit::_abi_1_13::FuncSignature const&, asmjit::_abi_1_13::Environment const&) 2025-05-07T20:11:07.6768590Z U asmjit::_abi_1_13::FuncFrame::finalize() 2025-05-07T20:11:07.6769039Z U asmjit::_abi_1_13::FuncFrame::init(asmjit::_abi_1_13::FuncDetail const&) 2025-05-07T20:11:07.6769656Z U asmjit::_abi_1_13::JitRuntime::JitRuntime(asmjit::_abi_1_13::JitAllocator::CreateParams const*) 2025-05-07T20:11:07.6770290Z U asmjit::_abi_1_13::JitRuntime::_add(void**, asmjit::_abi_1_13::CodeHolder*) 2025-05-07T20:11:07.6770766Z U asmjit::_abi_1_13::JitRuntime::~JitRuntime() 2025-05-07T20:11:07.6771226Z U asmjit::_abi_1_13::x86::Assembler::Assembler(asmjit::_abi_1_13::CodeHolder*) 2025-05-07T20:11:07.6771708Z U asmjit::_abi_1_13::x86::Assembler::~Assembler() 2025-05-07T20:11:07.6772057Z U cpuinfo_get_packages 2025-05-07T20:11:07.6772413Z U cpuinfo_get_packages_count 2025-05-07T20:11:07.6772728Z U cpuinfo_initialize 2025-05-07T20:11:07.6773024Z U cpuinfo_isa 2025-05-07T20:11:07.6773299Z U fma@GLIBC_2.2.5 2025-05-07T20:11:07.6773566Z U fmaf@GLIBC_2.2.5 2025-05-07T20:11:07.6773854Z U fminf@GLIBC_2.2.5 2025-05-07T20:11:07.6774126Z U free@GLIBC_2.2.5 2025-05-07T20:11:07.6774414Z U fwrite@GLIBC_2.2.5 2025-05-07T20:11:07.6774695Z U getenv@GLIBC_2.2.5 2025-05-07T20:11:07.6774983Z U log2@GLIBC_2.2.5 2025-05-07T20:11:07.6775254Z U log2f@GLIBC_2.2.5 2025-05-07T20:11:07.6775573Z U lrintf@GLIBC_2.2.5 2025-05-07T20:11:07.6775867Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:07.6776146Z U memcpy@GLIBC_2.14 2025-05-07T20:11:07.6776438Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:07.6776724Z U memset@GLIBC_2.2.5 2025-05-07T20:11:07.6777021Z U nearbyint@GLIBC_2.2.5 2025-05-07T20:11:07.6777320Z U nearbyintf@GLIBC_2.2.5 2025-05-07T20:11:07.6777681Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:07.6778058Z U operator delete[](void*)@GLIBCXX_3.4 2025-05-07T20:11:07.6778415Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:07.6778785Z U operator new[](unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:07.6779123Z U posix_memalign@GLIBC_2.2.5 2025-05-07T20:11:07.6779442Z U pow@GLIBC_2.2.5 2025-05-07T20:11:07.6779710Z U sqrtf@GLIBC_2.2.5 2025-05-07T20:11:07.6780136Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:11:07.6780633Z U std::_Rb_tree_decrement(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:11:07.6781113Z U std::_Rb_tree_increment(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:11:07.6781801Z U std::_Rb_tree_insert_and_rebalance(bool, std::_Rb_tree_node_base*, std::_Rb_tree_node_base*, std::_Rb_tree_node_base&)@GLIBCXX_3.4 2025-05-07T20:11:07.6782555Z U std::__atomic_futex_unsigned_base::_M_futex_notify_all(unsigned int*)@GLIBCXX_3.4.21 2025-05-07T20:11:07.6783647Z U std::__atomic_futex_unsigned_base::_M_futex_wait_until(unsigned int*, unsigned int, bool, std::chrono::duration >, std::chrono::duration >)@GLIBCXX_3.4.21 2025-05-07T20:11:07.6785023Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:07.6786025Z U std::__detail::_Prime_rehash_policy::_M_next_bkt(unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:07.6786547Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:11:07.6787010Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:11:07.6787492Z U std::__exception_ptr::exception_ptr::exception_ptr(void*)@CXXABI_1.3.11 2025-05-07T20:11:07.6788028Z U std::__future_base::_Result_base::_Result_base()@GLIBCXX_3.4.15 2025-05-07T20:11:07.6788517Z U std::__future_base::_Result_base::~_Result_base()@GLIBCXX_3.4.15 2025-05-07T20:11:07.6788929Z U std::__once_call@GLIBCXX_3.4.11 2025-05-07T20:11:07.6789385Z U std::__once_callable@GLIBCXX_3.4.11 2025-05-07T20:11:07.6789741Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:07.6790099Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:07.6790431Z U std::__throw_bad_cast()@GLIBCXX_3.4 2025-05-07T20:11:07.6790811Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:11:07.6791208Z U std::__throw_future_error(int)@GLIBCXX_3.4.14 2025-05-07T20:11:07.6791649Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:07.6792064Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:07.6792455Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:07.6792846Z U std::bad_alloc::~bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:07.6793708Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:07.6794524Z U std::cerr@GLIBCXX_3.4 2025-05-07T20:11:07.6794930Z U std::cout@GLIBCXX_3.4 2025-05-07T20:11:07.6795292Z U std::ctype::_M_widen_init() const@GLIBCXX_3.4.11 2025-05-07T20:11:07.6795702Z U std::future_category()@GLIBCXX_3.4.15 2025-05-07T20:11:07.6796094Z U std::future_error::~future_error()@GLIBCXX_3.4.14 2025-05-07T20:11:07.6796477Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:07.6796851Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:07.6797514Z U std::logic_error::logic_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:11:07.6798274Z U std::logic_error::logic_error(std::logic_error const&)@GLIBCXX_3.4.21 2025-05-07T20:11:07.6798815Z U std::ostream& std::ostream::_M_insert(double)@GLIBCXX_3.4.9 2025-05-07T20:11:07.6799319Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:07.6799887Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:07.6800365Z U std::ostream::flush()@GLIBCXX_3.4 2025-05-07T20:11:07.6800736Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:07.6801117Z U std::ostream::put(char)@GLIBCXX_3.4 2025-05-07T20:11:07.6801576Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:11:07.6802126Z U std::runtime_error::runtime_error(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:07.6802569Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:11:07.6802964Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:07.6803285Z U stderr@GLIBC_2.2.5 2025-05-07T20:11:07.6803604Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:07.6803914Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:07.6804208Z U strstr@GLIBC_2.2.5 2025-05-07T20:11:07.6804519Z U tolower@GLIBC_2.2.5 2025-05-07T20:11:07.6804837Z U toupper@GLIBC_2.2.5 2025-05-07T20:11:07.6805241Z U typeinfo for std::__future_base::_Result_base@GLIBCXX_3.4.15 2025-05-07T20:11:07.6805726Z U typeinfo for std::bad_alloc@GLIBCXX_3.4 2025-05-07T20:11:07.6806124Z U typeinfo for std::future_error@GLIBCXX_3.4.14 2025-05-07T20:11:07.6806527Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:11:07.6806928Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:07.6807377Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:07.6807781Z U vtable for std::bad_alloc@GLIBCXX_3.4 2025-05-07T20:11:07.6808165Z U vtable for std::future_error@GLIBCXX_3.4.14 2025-05-07T20:11:07.6808527Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:07.6808867Z w _ITM_registerTMCloneTable 2025-05-07T20:11:07.6809198Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:07.6809605Z w __gmon_start__ 2025-05-07T20:11:07.6809875Z w __pthread_key_create 2025-05-07T20:11:07.6810187Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:07.6810510Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:07.6810812Z w pthread_once 2025-05-07T20:11:07.6811079Z w pthread_rwlock_rdlock 2025-05-07T20:11:07.6811355Z w pthread_rwlock_unlock 2025-05-07T20:11:07.6811645Z w pthread_rwlock_wrlock 2025-05-07T20:11:07.6811941Z w pthread_self@GLIBC_2.2.5 2025-05-07T20:11:07.6812272Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:07.6812664Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so 2025-05-07T20:11:07.6812903Z 2025-05-07T20:11:07.6813059Z linux-vdso.so.1 (0x00007ffc4d3ca000) 2025-05-07T20:11:07.6813350Z libc10.so => not found 2025-05-07T20:11:07.6813843Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so (0x00007fad456ad000) 2025-05-07T20:11:07.6814397Z libtorch.so => not found 2025-05-07T20:11:07.6814658Z libtorch_cpu.so => not found 2025-05-07T20:11:07.6814919Z libtorch_cuda.so => not found 2025-05-07T20:11:07.6815249Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fad44d9c000) 2025-05-07T20:11:07.6815620Z libm.so.6 => /lib64/libm.so.6 (0x00007fad455d0000) 2025-05-07T20:11:07.6815992Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fad455a2000) 2025-05-07T20:11:07.6816355Z libc.so.6 => /lib64/libc.so.6 (0x00007fad44b94000) 2025-05-07T20:11:07.6816707Z /lib64/ld-linux-x86-64.so.2 (0x00007fad4572c000) 2025-05-07T20:11:07.6817033Z libtorch_cpu.so => not found 2025-05-07T20:11:07.6817286Z libtorch_cuda.so => not found 2025-05-07T20:11:07.6817550Z libtorch.so => not found 2025-05-07T20:11:07.6817705Z 2025-05-07T20:11:07.6817810Z [CHECK] Displaying ELF information: 2025-05-07T20:11:07.6818179Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so 2025-05-07T20:11:07.6818447Z 2025-05-07T20:11:07.6831672Z 2025-05-07T20:11:07.6832116Z Dynamic section at offset 0x51fb38 contains 38 entries: 2025-05-07T20:11:07.6832625Z Tag Type Name/Value 2025-05-07T20:11:07.6833070Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:07.6833704Z 0x0000000000000001 (NEEDED) Shared library: [asmjit.so] 2025-05-07T20:11:07.6834274Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:07.6834791Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:07.6835361Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:07.6835906Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:07.6836423Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:11:07.6837004Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:07.6837513Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:07.6838086Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:07.6838634Z 0x000000000000000e (SONAME) Library soname: [fbgemm.so] 2025-05-07T20:11:07.6839133Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:07.6839554Z 0x000000000000000c (INIT) 0xf6000 2025-05-07T20:11:07.6839890Z 0x000000000000000d (FINI) 0x4c8fb0 2025-05-07T20:11:07.6840246Z 0x0000000000000019 (INIT_ARRAY) 0x51dac0 2025-05-07T20:11:07.6840595Z 0x000000000000001b (INIT_ARRAYSZ) 56 (bytes) 2025-05-07T20:11:07.6840960Z 0x000000000000001a (FINI_ARRAY) 0x51daf8 2025-05-07T20:11:07.6841302Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:07.6841654Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:07.6841997Z 0x000000006ffffef5 (GNU_HASH) 0x6e20 2025-05-07T20:11:07.6842335Z 0x0000000000000005 (STRTAB) 0x2b0a0 2025-05-07T20:11:07.6842708Z 0x0000000000000006 (SYMTAB) 0xe7e0 2025-05-07T20:11:07.6843060Z 0x000000000000000a (STRSZ) 708057 (bytes) 2025-05-07T20:11:07.6843436Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:07.6843781Z 0x0000000000000003 (PLTGOT) 0x520dd8 2025-05-07T20:11:07.6844154Z 0x0000000000000002 (PLTRELSZ) 24312 (bytes) 2025-05-07T20:11:07.6844504Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:07.6844845Z 0x0000000000000017 (JMPREL) 0xef8e0 2025-05-07T20:11:07.6845189Z 0x0000000000000007 (RELA) 0xda610 2025-05-07T20:11:07.6845542Z 0x0000000000000008 (RELASZ) 86736 (bytes) 2025-05-07T20:11:07.6846098Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:07.6846452Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:07.6846792Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:07.6847138Z 0x000000006ffffffe (VERNEED) 0xda490 2025-05-07T20:11:07.6847489Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:07.6847997Z 0x000000006ffffff0 (VERSYM) 0xd7e7a 2025-05-07T20:11:07.6848342Z 0x000000006ffffff9 (RELACOUNT) 9 2025-05-07T20:11:07.6848888Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:07.6849306Z 2025-05-07T20:11:07.6849495Z ################################################################################ 2025-05-07T20:11:07.6849751Z 2025-05-07T20:11:07.6849755Z 2025-05-07T20:11:07.6849868Z ################################################################################ 2025-05-07T20:11:07.6850369Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:11:07.6850876Z [CHECK] Listing out library size: 2025-05-07T20:11:07.6851347Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:11:07.6851726Z 2025-05-07T20:11:07.6851936Z 3 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:11:07.6852265Z 2025-05-07T20:11:07.6852661Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:11:07.6853679Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:07.6854282Z 2025-05-07T20:11:07.6907383Z GLIBC_2.2.5 2025-05-07T20:11:07.6908211Z GLIBC_2.3 2025-05-07T20:11:07.6909395Z GLIBC_2.14 2025-05-07T20:11:07.6910054Z 2025-05-07T20:11:07.6910080Z 2025-05-07T20:11:07.6911541Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:11:07.6914402Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:07.6915041Z 2025-05-07T20:11:07.6967530Z GLIBCXX_3.4 2025-05-07T20:11:07.6968715Z GLIBCXX_3.4.9 2025-05-07T20:11:07.6969798Z GLIBCXX_3.4.14 2025-05-07T20:11:07.6973656Z GLIBCXX_3.4.20 2025-05-07T20:11:07.6974206Z GLIBCXX_3.4.21 2025-05-07T20:11:07.6974421Z GLIBCXX_3.4.29 2025-05-07T20:11:07.6974541Z 2025-05-07T20:11:07.6974545Z 2025-05-07T20:11:07.6988135Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so > /tmp/tmp.T0JIADY91a.symbols.txt 2025-05-07T20:11:07.6989856Z 2025-05-07T20:11:07.7013660Z 2025-05-07T20:11:07.7036982Z [CHECK] Total Number of symbols: 505 2025-05-07T20:11:07.7051492Z [CHECK] Number of fbgemm symbols: 47 2025-05-07T20:11:07.7069531Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so > /tmp/tmp.I57hgOOgdL.usymbols.txt 2025-05-07T20:11:07.7070079Z 2025-05-07T20:11:07.7087315Z 2025-05-07T20:11:07.7111855Z [CHECK] Listing out undefined symbols (195 total): 2025-05-07T20:11:07.7127222Z U GOMP_barrier 2025-05-07T20:11:07.7128873Z U GOMP_parallel 2025-05-07T20:11:07.7131056Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:07.7132266Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:07.7132623Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:07.7133011Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:07.7133401Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:07.7133781Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:07.7134247Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:07.7134590Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:07.7134988Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:07.7135338Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:07.7135649Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:07.7135954Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:07.7136255Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:07.7136569Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:07.7136887Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:07.7137187Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:07.7137509Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:07.7137804Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:07.7138102Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:07.7138388Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:07.7138696Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:07.7139172Z U at::Tensor::index_put_(std::initializer_list, at::Tensor const&) 2025-05-07T20:11:07.7139724Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:11:07.7140172Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:11:07.7141058Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:07.7141934Z U at::_ops::is_nonzero::call(at::Tensor const&) 2025-05-07T20:11:07.7142365Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:07.7142828Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:11:07.7143458Z U at::_ops::slice_Tensor::call(at::Tensor const&, long, std::optional, std::optional, c10::SymInt) 2025-05-07T20:11:07.7144589Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:11:07.7145452Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:11:07.7146237Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:07.7147038Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:11:07.7147357Z U at::get_num_threads() 2025-05-07T20:11:07.7147652Z U at::get_thread_num() 2025-05-07T20:11:07.7147933Z U at::in_parallel_region() 2025-05-07T20:11:07.7148236Z U at::init_num_threads() 2025-05-07T20:11:07.7148532Z U at::internal::set_thread_num(int) 2025-05-07T20:11:07.7148883Z U at::toAccumulateType(c10::ScalarType, bool) 2025-05-07T20:11:07.7149335Z U c10::BoolType::get() 2025-05-07T20:11:07.7149863Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:07.7150603Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:07.7151209Z U c10::Error::what() const 2025-05-07T20:11:07.7151586Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:11:07.7152050Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:07.7152489Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:07.7152860Z U c10::IntType::get() 2025-05-07T20:11:07.7153230Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:07.7153683Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:07.7154142Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:07.7154632Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:11:07.7155013Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:11:07.7155396Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:11:07.7155819Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:07.7156498Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:07.7157182Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:07.7157586Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:11:07.7157960Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:07.7158334Z U c10::SymInt::sym_ne(c10::SymInt const&) const 2025-05-07T20:11:07.7158690Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:07.7159024Z U c10::SymIntType::get() 2025-05-07T20:11:07.7159397Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:07.7159766Z U c10::TensorType::get() 2025-05-07T20:11:07.7160109Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:07.7161091Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:07.7162185Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:07.7162598Z U c10::cuda::CUDAKernelLaunchRegistry::get_singleton_ref() 2025-05-07T20:11:07.7163117Z U c10::cuda::CUDAKernelLaunchRegistry::get_uvm_assertions_ptr_for_current_device() 2025-05-07T20:11:07.7163841Z U c10::cuda::CUDAKernelLaunchRegistry::insert(char const*, char const*, unsigned int, char const*, int) 2025-05-07T20:11:07.7164385Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:07.7164756Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:07.7165095Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:07.7165416Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:07.7165754Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:07.7166204Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:07.7166666Z U c10::cuda::device_count() 2025-05-07T20:11:07.7166994Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:07.7167552Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:07.7167955Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:07.7168346Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:07.7168765Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:07.7169180Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:07.7170582Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:07.7171826Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:07.7172959Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:07.7173960Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:07.7175093Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:07.7175941Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:07.7176306Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:07.7176654Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:07.7177029Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:11:07.7177400Z U c10::operator<(c10::SymInt const&, int) 2025-05-07T20:11:07.7177780Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:07.7178198Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:07.7178600Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:11:07.7178999Z U c10::operator>(c10::SymInt const&, int) 2025-05-07T20:11:07.7192685Z U c10::operator>=(c10::SymInt const&, int) 2025-05-07T20:11:07.7193187Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:07.7193619Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:07.7194098Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:07.7194483Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:07.7194870Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:07.7195245Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:07.7195600Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:07.7195958Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:07.7196314Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:07.7196685Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:07.7197036Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:07.7197394Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:07.7197893Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:07.7198252Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:07.7198678Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:07.7199715Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:07.7201524Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:07.7203316Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:07.7205350Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:07.7207529Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:07.7209385Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:07.7211254Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:07.7213178Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:07.7215039Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:07.7216912Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:07.7219767Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:07.7221661Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:07.7222867Z U fbgemm::fbgemmAlignedAlloc(unsigned long, unsigned long, bool) 2025-05-07T20:11:07.7223327Z U fbgemm::fbgemmAlignedFree(void*) 2025-05-07T20:11:07.7223835Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:11:07.7224363Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:07.7224796Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:07.7225236Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:07.7225642Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:07.7226128Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:07.7226562Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:07.7226975Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:07.7227338Z U memcpy@GLIBC_2.14 2025-05-07T20:11:07.7227645Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:07.7227941Z U memset@GLIBC_2.2.5 2025-05-07T20:11:07.7228242Z U omp_get_max_threads 2025-05-07T20:11:07.7228532Z U omp_get_num_threads 2025-05-07T20:11:07.7228838Z U omp_get_thread_num 2025-05-07T20:11:07.7229306Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:07.7229706Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:07.7230388Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:07.7231404Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:07.7232287Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:07.7232921Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:11:07.7233330Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:11:07.7233725Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:07.7234093Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:11:07.7234529Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:07.7234941Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:07.7235372Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:07.7235929Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:07.7236654Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:07.7237752Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:07.7239009Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:07.7239787Z U std::current_exception()@CXXABI_1.3.3 2025-05-07T20:11:07.7240173Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:07.7240553Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:07.7240910Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:07.7241276Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:07.7241730Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:07.7242082Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:07.7242429Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:07.7242830Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:07.7243381Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:07.7243863Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:07.7244617Z U std::pair fbgemm::radix_sort_parallel(int*, int*, int*, int*, long, long, bool) 2025-05-07T20:11:07.7245596Z U std::pair*> fbgemm::radix_sort_parallel >(int*, std::pair*, int*, std::pair*, long, long, bool) 2025-05-07T20:11:07.7246734Z U std::pair*> fbgemm::radix_sort_parallel >(int*, std::pair*, int*, std::pair*, long, long, bool) 2025-05-07T20:11:07.7247590Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:11:07.7248042Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:07.7248357Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:07.7249196Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:07.7250379Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:07.7251260Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:07.7252009Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:07.7252590Z U typeinfo for c10::Error 2025-05-07T20:11:07.7252935Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:11:07.7253369Z U unsigned char* at::TensorBase::data_ptr() const 2025-05-07T20:11:07.7253821Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:07.7254256Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:07.7254712Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:07.7255096Z U vtable for c10::Error 2025-05-07T20:11:07.7255650Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:07.7256465Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:07.7257125Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:07.7257662Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:07.7258129Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:07.7258462Z w _ITM_registerTMCloneTable 2025-05-07T20:11:07.7258768Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:07.7259075Z w __gmon_start__ 2025-05-07T20:11:07.7259345Z w __pthread_key_create 2025-05-07T20:11:07.7259701Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:07.7260156Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:11:07.7260490Z 2025-05-07T20:11:07.7260640Z linux-vdso.so.1 (0x00007ffd719e1000) 2025-05-07T20:11:07.7260948Z libc10.so => not found 2025-05-07T20:11:07.7261189Z libc10_cuda.so => not found 2025-05-07T20:11:07.7261740Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so (0x00007f8b60a00000) 2025-05-07T20:11:07.7262663Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so (0x00007f8b613e7000) 2025-05-07T20:11:07.7263326Z libtorch.so => not found 2025-05-07T20:11:07.7263596Z libtorch_cpu.so => not found 2025-05-07T20:11:07.7263862Z libtorch_cuda.so => not found 2025-05-07T20:11:07.7264151Z libcudart.so.12 => not found 2025-05-07T20:11:07.7264475Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f8b6079c000) 2025-05-07T20:11:07.7264938Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f8b613b7000) 2025-05-07T20:11:07.7265317Z libc.so.6 => /lib64/libc.so.6 (0x00007f8b60594000) 2025-05-07T20:11:07.7265715Z /lib64/ld-linux-x86-64.so.2 (0x00007f8b613f7000) 2025-05-07T20:11:07.7266034Z libc10.so => not found 2025-05-07T20:11:07.7266568Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so (0x00007f8b6133a000) 2025-05-07T20:11:07.7267142Z libtorch.so => not found 2025-05-07T20:11:07.7267394Z libtorch_cpu.so => not found 2025-05-07T20:11:07.7267674Z libtorch_cuda.so => not found 2025-05-07T20:11:07.7267966Z libm.so.6 => /lib64/libm.so.6 (0x00007f8b604b9000) 2025-05-07T20:11:07.7268301Z libc10.so => not found 2025-05-07T20:11:07.7268540Z libtorch_cpu.so => not found 2025-05-07T20:11:07.7268820Z libtorch_cuda.so => not found 2025-05-07T20:11:07.7269160Z libtorch.so => not found 2025-05-07T20:11:07.7269592Z libtorch_cpu.so => not found 2025-05-07T20:11:07.7269877Z libtorch_cuda.so => not found 2025-05-07T20:11:07.7270140Z libtorch.so => not found 2025-05-07T20:11:07.7270310Z 2025-05-07T20:11:07.7270434Z [CHECK] Displaying ELF information: 2025-05-07T20:11:07.7270899Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:11:07.7271266Z 2025-05-07T20:11:07.7271270Z 2025-05-07T20:11:07.7271430Z Dynamic section at offset 0x2c4138 contains 40 entries: 2025-05-07T20:11:07.7271810Z Tag Type Name/Value 2025-05-07T20:11:07.7272237Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:07.7272762Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:07.7273270Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:11:07.7273812Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:11:07.7274368Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:07.7274902Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:07.7275429Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:07.7275974Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:07.7276513Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:07.7277032Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:07.7277550Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:07.7278071Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:07.7278644Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:11:07.7279172Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:07.7279597Z 0x000000000000000c (INIT) 0x13000 2025-05-07T20:11:07.7279944Z 0x000000000000000d (FINI) 0x7422c 2025-05-07T20:11:07.7280278Z 0x0000000000000019 (INIT_ARRAY) 0x2c4cf8 2025-05-07T20:11:07.7280642Z 0x000000000000001b (INIT_ARRAYSZ) 72 (bytes) 2025-05-07T20:11:07.7280993Z 0x000000000000001a (FINI_ARRAY) 0x2c4d40 2025-05-07T20:11:07.7281363Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:07.7281795Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:07.7282112Z 0x000000006ffffef5 (GNU_HASH) 0x18b0 2025-05-07T20:11:07.7282433Z 0x0000000000000005 (STRTAB) 0x5790 2025-05-07T20:11:07.7282735Z 0x0000000000000006 (SYMTAB) 0x2820 2025-05-07T20:11:07.7283072Z 0x000000000000000a (STRSZ) 40152 (bytes) 2025-05-07T20:11:07.7283405Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:07.7283740Z 0x0000000000000003 (PLTGOT) 0x2c53f8 2025-05-07T20:11:07.7284073Z 0x0000000000000002 (PLTRELSZ) 6768 (bytes) 2025-05-07T20:11:07.7284850Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:07.7285303Z 0x0000000000000017 (JMPREL) 0x10f38 2025-05-07T20:11:07.7285652Z 0x0000000000000007 (RELA) 0xf990 2025-05-07T20:11:07.7286060Z 0x0000000000000008 (RELASZ) 5544 (bytes) 2025-05-07T20:11:07.7286416Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:07.7286747Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:07.7287089Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:07.7287447Z 0x000000006ffffffe (VERNEED) 0xf860 2025-05-07T20:11:07.7287789Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:07.7288112Z 0x000000006ffffff0 (VERSYM) 0xf468 2025-05-07T20:11:07.7288443Z 0x000000006ffffff9 (RELACOUNT) 17 2025-05-07T20:11:07.7288763Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:07.7288975Z 2025-05-07T20:11:07.7289092Z ################################################################################ 2025-05-07T20:11:07.7289336Z 2025-05-07T20:11:07.7289340Z 2025-05-07T20:11:07.7289452Z ################################################################################ 2025-05-07T20:11:07.7289983Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:11:07.7290512Z [CHECK] Listing out library size: 2025-05-07T20:11:07.7291273Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:11:07.7291854Z 2025-05-07T20:11:07.7292063Z 21 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:11:07.7292618Z 2025-05-07T20:11:07.7293364Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:11:07.7294352Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:07.7295010Z 2025-05-07T20:11:07.7295119Z GLIBC_2.2.5 2025-05-07T20:11:07.7295335Z GLIBC_2.14 2025-05-07T20:11:07.7295455Z 2025-05-07T20:11:07.7295459Z 2025-05-07T20:11:07.7295863Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:11:07.7296891Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:07.7297504Z 2025-05-07T20:11:07.7372094Z GLIBCXX_3.4 2025-05-07T20:11:07.7372517Z GLIBCXX_3.4.9 2025-05-07T20:11:07.7372931Z GLIBCXX_3.4.11 2025-05-07T20:11:07.7373225Z GLIBCXX_3.4.20 2025-05-07T20:11:07.7373447Z GLIBCXX_3.4.21 2025-05-07T20:11:07.7373651Z GLIBCXX_3.4.29 2025-05-07T20:11:07.7373792Z 2025-05-07T20:11:07.7373796Z 2025-05-07T20:11:07.7389396Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so > /tmp/tmp.DkKfYJ4mOE.symbols.txt 2025-05-07T20:11:07.7389922Z 2025-05-07T20:11:07.7432262Z 2025-05-07T20:11:07.7458858Z [CHECK] Total Number of symbols: 811 2025-05-07T20:11:07.7471696Z [CHECK] Number of fbgemm symbols: 80 2025-05-07T20:11:07.7486750Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so > /tmp/tmp.pfGwNbyzyV.usymbols.txt 2025-05-07T20:11:07.7487255Z 2025-05-07T20:11:07.7506786Z 2025-05-07T20:11:07.7540155Z [CHECK] Listing out undefined symbols (152 total): 2025-05-07T20:11:07.7558389Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:07.7558972Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:07.7559343Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:07.7559747Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:07.7560154Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:07.7560545Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:07.7561065Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:07.7561444Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:07.7561819Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:07.7562231Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:07.7562550Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:07.7562882Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:07.7563196Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:07.7563528Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:07.7563852Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:07.7564184Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:07.7564515Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:07.7564872Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:11:07.7565331Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:11:07.7566124Z U at::_ops::arange::call(c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:07.7567378Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:07.7568777Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:07.7569851Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:11:07.7570822Z U at::_ops::full_like::call(at::Tensor const&, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:07.7571972Z U at::_ops::index_select::call(at::Tensor const&, long, at::Tensor const&) 2025-05-07T20:11:07.7572660Z U at::_ops::slice_Tensor::call(at::Tensor const&, long, std::optional, std::optional, c10::SymInt) 2025-05-07T20:11:07.7573627Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:07.7574818Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:07.7575681Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:11:07.7576111Z U at::cuda::getDeviceProperties(signed char) 2025-05-07T20:11:07.7576457Z U c10::BoolType::get() 2025-05-07T20:11:07.7576833Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:07.7577234Z U c10::GeneratorImpl::device() const 2025-05-07T20:11:07.7577637Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:07.7578081Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:07.7578428Z U c10::IntType::get() 2025-05-07T20:11:07.7578858Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:07.7579354Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:07.7579776Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:07.7580458Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:07.7581140Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:07.7581517Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:07.7581893Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:07.7582293Z U c10::TensorType::get() 2025-05-07T20:11:07.7582628Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:07.7583762Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:07.7584961Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:07.7585355Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:07.7585771Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:07.7586139Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:07.7586490Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:07.7586851Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:07.7587379Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:07.7587878Z U c10::cuda::current_device() 2025-05-07T20:11:07.7588211Z U c10::cuda::device_count() 2025-05-07T20:11:07.7588558Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:07.7589044Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:07.7589452Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:07.7589868Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:07.7590282Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:07.7590732Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:07.7591511Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:07.7592427Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:07.7593335Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:07.7594332Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:07.7594922Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:07.7595280Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:07.7595652Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:07.7596108Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:07.7596536Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:07.7596910Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:07.7597328Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:07.7597701Z U c10::throwNullDataPtrError() 2025-05-07T20:11:07.7598046Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:07.7598391Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:07.7598814Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:07.7599265Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:07.7599628Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:07.7600013Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:07.7600390Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:07.7600770Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:07.7601163Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:07.7601529Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:07.7601913Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:07.7602270Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:07.7602646Z U cudaFuncGetAttributes@libcudart.so.12 2025-05-07T20:11:07.7602997Z U cudaGetDevice@libcudart.so.12 2025-05-07T20:11:07.7603358Z U cudaGetDeviceCount@libcudart.so.12 2025-05-07T20:11:07.7603705Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:07.7604068Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:07.7604425Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:07.7604767Z U cudaMemcpyAsync@libcudart.so.12 2025-05-07T20:11:07.7605132Z U cudaMemsetAsync@libcudart.so.12 2025-05-07T20:11:07.7605648Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.12 2025-05-07T20:11:07.7606195Z U cudaPeekAtLastError@libcudart.so.12 2025-05-07T20:11:07.7606571Z U cudaSetDevice@libcudart.so.12 2025-05-07T20:11:07.7606920Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:07.7607274Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:07.7607649Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:07.7608057Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:07.7608464Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:07.7608869Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:07.7609223Z U log2f@GLIBC_2.2.5 2025-05-07T20:11:07.7609609Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:07.7610080Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:07.7610480Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:07.7610861Z U memcpy@GLIBC_2.14 2025-05-07T20:11:07.7611152Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:07.7611459Z U memset@GLIBC_2.2.5 2025-05-07T20:11:07.7611803Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:07.7612210Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:07.7612813Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:07.7613685Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:07.7614722Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:07.7615313Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:07.7615702Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:07.7616111Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:07.7616543Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:07.7616977Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:07.7617471Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:07.7618207Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:07.7619480Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:07.7620276Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:07.7620611Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:07.7622668Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:07.7623003Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:07.7623330Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:07.7623643Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:07.7624031Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:07.7624537Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:07.7624976Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:07.7625284Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:07.7625584Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:07.7626383Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:07.7627562Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:07.7628357Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:07.7629146Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:07.7630084Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:07.7630591Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:07.7631074Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:07.7631514Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:07.7632169Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:07.7633001Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:07.7633662Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:07.7634225Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:07.7634682Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:07.7635018Z w _ITM_registerTMCloneTable 2025-05-07T20:11:07.7635341Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:07.7635737Z w __gmon_start__ 2025-05-07T20:11:07.7636005Z w __pthread_key_create 2025-05-07T20:11:07.7636295Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:07.7636615Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:07.7636975Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:07.7637405Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:11:07.7637696Z 2025-05-07T20:11:07.7637828Z linux-vdso.so.1 (0x00007ffddb3d0000) 2025-05-07T20:11:07.7638109Z libtorch.so => not found 2025-05-07T20:11:07.7638356Z libc10.so => not found 2025-05-07T20:11:07.7638582Z libc10_cuda.so => not found 2025-05-07T20:11:07.7638838Z libtorch_cpu.so => not found 2025-05-07T20:11:07.7639092Z libtorch_cuda.so => not found 2025-05-07T20:11:07.7639520Z libcudart.so.12 => not found 2025-05-07T20:11:07.7639855Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f875999c000) 2025-05-07T20:11:07.7640243Z libm.so.6 => /lib64/libm.so.6 (0x00007f87598c1000) 2025-05-07T20:11:07.7640628Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f875b2ef000) 2025-05-07T20:11:07.7641205Z libc.so.6 => /lib64/libc.so.6 (0x00007f87596b9000) 2025-05-07T20:11:07.7641579Z /lib64/ld-linux-x86-64.so.2 (0x00007f875b323000) 2025-05-07T20:11:07.7641818Z 2025-05-07T20:11:07.7641959Z [CHECK] Displaying ELF information: 2025-05-07T20:11:07.7642476Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:11:07.7642820Z 2025-05-07T20:11:07.7642872Z 2025-05-07T20:11:07.7643041Z Dynamic section at offset 0x14c3b48 contains 37 entries: 2025-05-07T20:11:07.7643450Z Tag Type Name/Value 2025-05-07T20:11:07.7643886Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:07.7644406Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:07.7644913Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:07.7645452Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:07.7646017Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:07.7646555Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:07.7647113Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:07.7647638Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:11:07.7648157Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:07.7648672Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:07.7649194Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:11:07.7649652Z 0x000000000000000c (INIT) 0x2a000 2025-05-07T20:11:07.7649978Z 0x000000000000000d (FINI) 0xe445c 2025-05-07T20:11:07.7650319Z 0x0000000000000019 (INIT_ARRAY) 0x14c31b0 2025-05-07T20:11:07.7650703Z 0x000000000000001b (INIT_ARRAYSZ) 208 (bytes) 2025-05-07T20:11:07.7651067Z 0x000000000000001a (FINI_ARRAY) 0x14c3280 2025-05-07T20:11:07.7651517Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:07.7651839Z 0x0000000000000004 (HASH) 0x200 2025-05-07T20:11:07.7652261Z 0x000000006ffffef5 (GNU_HASH) 0x1eb8 2025-05-07T20:11:07.7652567Z 0x0000000000000005 (STRTAB) 0x8730 2025-05-07T20:11:07.7652871Z 0x0000000000000006 (SYMTAB) 0x3b10 2025-05-07T20:11:07.7653193Z 0x000000000000000a (STRSZ) 113475 (bytes) 2025-05-07T20:11:07.7653531Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:07.7653844Z 0x0000000000000003 (PLTGOT) 0x14c3de8 2025-05-07T20:11:07.7654176Z 0x0000000000000002 (PLTRELSZ) 8736 (bytes) 2025-05-07T20:11:07.7654499Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:07.7654791Z 0x0000000000000017 (JMPREL) 0x27c90 2025-05-07T20:11:07.7655099Z 0x0000000000000007 (RELA) 0x249f0 2025-05-07T20:11:07.7655409Z 0x0000000000000008 (RELASZ) 12960 (bytes) 2025-05-07T20:11:07.7655743Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:07.7656209Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:07.7656525Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:07.7657039Z 0x000000006ffffffe (VERNEED) 0x248d0 2025-05-07T20:11:07.7657363Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:07.7657689Z 0x000000006ffffff0 (VERSYM) 0x24274 2025-05-07T20:11:07.7658012Z 0x000000006ffffff9 (RELACOUNT) 39 2025-05-07T20:11:07.7658331Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:07.7658531Z 2025-05-07T20:11:07.7658642Z ################################################################################ 2025-05-07T20:11:07.7658882Z 2025-05-07T20:11:07.7658887Z 2025-05-07T20:11:07.7658996Z ################################################################################ 2025-05-07T20:11:07.7659540Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:11:07.7660037Z [CHECK] Listing out library size: 2025-05-07T20:11:07.7660531Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:11:07.7660918Z 2025-05-07T20:11:07.7661130Z 9 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:11:07.7661775Z 2025-05-07T20:11:07.7662183Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:11:07.7663216Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:07.7663835Z 2025-05-07T20:11:07.7722891Z GLIBC_2.2.5 2025-05-07T20:11:07.7723231Z GLIBC_2.3 2025-05-07T20:11:07.7723881Z GLIBC_2.14 2025-05-07T20:11:07.7726820Z 2025-05-07T20:11:07.7726825Z 2025-05-07T20:11:07.7727310Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:11:07.7728440Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:07.7730437Z 2025-05-07T20:11:07.7787809Z GLIBCXX_3.4 2025-05-07T20:11:07.7788047Z GLIBCXX_3.4.9 2025-05-07T20:11:07.7788269Z GLIBCXX_3.4.11 2025-05-07T20:11:07.7788479Z GLIBCXX_3.4.18 2025-05-07T20:11:07.7788673Z GLIBCXX_3.4.21 2025-05-07T20:11:07.7788877Z GLIBCXX_3.4.29 2025-05-07T20:11:07.7789102Z 2025-05-07T20:11:07.7789106Z 2025-05-07T20:11:07.7812083Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so > /tmp/tmp.z9F3Xm3vLe.symbols.txt 2025-05-07T20:11:07.7812610Z 2025-05-07T20:11:07.7837172Z 2025-05-07T20:11:07.7862655Z [CHECK] Total Number of symbols: 342 2025-05-07T20:11:07.7874559Z [CHECK] Number of fbgemm symbols: 14 2025-05-07T20:11:07.7890925Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so > /tmp/tmp.1pplmiP2VS.usymbols.txt 2025-05-07T20:11:07.7892504Z 2025-05-07T20:11:07.7910215Z 2025-05-07T20:11:07.7934313Z [CHECK] Listing out undefined symbols (129 total): 2025-05-07T20:11:07.7950321Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:07.7953939Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:07.7955428Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:07.7955779Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:07.7956277Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:07.7956647Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:07.7957029Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:07.7957491Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:07.7957825Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:07.7958165Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:07.7958479Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:07.7958765Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:07.7959052Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:07.7959339Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:07.7959618Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:07.7959917Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:07.7960205Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:07.7960519Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:11:07.7960897Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:11:07.7962640Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:11:07.7963101Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:11:07.7963447Z U c10::BoolType::get() 2025-05-07T20:11:07.7963666Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:07.7963762Z U c10::FloatType::get() 2025-05-07T20:11:07.7963869Z U c10::GeneratorImpl::device() const 2025-05-07T20:11:07.7964050Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:07.7964181Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:07.7964270Z U c10::IntType::get() 2025-05-07T20:11:07.7964436Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:07.7964549Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:07.7964687Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:07.7964836Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:07.7965230Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:07.7965399Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:07.7965499Z U c10::TensorType::get() 2025-05-07T20:11:07.7965614Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:07.7966309Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:07.7966444Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:07.7966746Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:07.7966863Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:07.7966979Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:07.7967090Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:07.7967201Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:07.7967454Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:07.7967548Z U c10::cuda::device_count() 2025-05-07T20:11:07.7967675Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:07.7967801Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:07.7967940Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:07.7968068Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:07.7968224Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:07.7968338Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:07.7968831Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:07.7969077Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:07.7969558Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:07.7969880Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:07.7969997Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:07.7970096Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:07.7970208Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:07.7970367Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:07.7970506Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:07.7970603Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:07.7970809Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:07.7970938Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:07.7971061Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:07.7971166Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:07.7971290Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:07.7971396Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:07.7971499Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:07.7971626Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:07.7971757Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:07.7971866Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:07.7971970Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:07.7972112Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:07.7972217Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:07.7972334Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:07.7972456Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:07.7972565Z U float at::Tensor::item() const 2025-05-07T20:11:07.7972712Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:07.7972855Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:07.7972996Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:07.7973114Z U memcpy@GLIBC_2.14 2025-05-07T20:11:07.7973198Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:07.7973299Z U memset@GLIBC_2.2.5 2025-05-07T20:11:07.7973436Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:07.7973552Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:07.7973897Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:07.7974270Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:07.7974591Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:07.7974953Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:07.7975065Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:07.7975175Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:07.7975322Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:07.7975452Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:07.7975576Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:07.7975812Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:07.7976141Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:07.7976703Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:07.7977228Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:07.7977343Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:07.7977465Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:07.7977598Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:07.7977709Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:07.7977826Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:07.7977928Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:07.7978099Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:07.7978333Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:07.7978449Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:07.7978551Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:07.7978642Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:07.7978775Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:07.7979347Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:07.7979832Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:07.7980081Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:07.7980430Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:07.7980591Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:07.7980768Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:07.7980924Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:07.7981272Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:07.7981592Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:07.7981921Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:07.7982126Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:07.7982343Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:07.7982450Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:07.7982564Z w _ITM_registerTMCloneTable 2025-05-07T20:11:07.7982663Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:07.7982750Z w __gmon_start__ 2025-05-07T20:11:07.7982843Z w __pthread_key_create 2025-05-07T20:11:07.7982961Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:07.7983066Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:07.7983204Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:07.7983425Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:11:07.7983432Z 2025-05-07T20:11:07.7987738Z linux-vdso.so.1 (0x00007ffeba71d000) 2025-05-07T20:11:07.7987919Z libtorch.so => not found 2025-05-07T20:11:07.7988107Z libc10.so => not found 2025-05-07T20:11:07.7988287Z libc10_cuda.so => not found 2025-05-07T20:11:07.7988436Z libtorch_cpu.so => not found 2025-05-07T20:11:07.7988548Z libtorch_cuda.so => not found 2025-05-07T20:11:07.7988653Z libcudart.so.12 => not found 2025-05-07T20:11:07.7988823Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f58d2d9c000) 2025-05-07T20:11:07.7989145Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f58d399d000) 2025-05-07T20:11:07.7989291Z libc.so.6 => /lib64/libc.so.6 (0x00007f58d2b94000) 2025-05-07T20:11:07.7989463Z /lib64/ld-linux-x86-64.so.2 (0x00007f58d39d1000) 2025-05-07T20:11:07.7989592Z libm.so.6 => /lib64/libm.so.6 (0x00007f58d2ab9000) 2025-05-07T20:11:07.7990193Z 2025-05-07T20:11:07.7991414Z [CHECK] Displaying ELF information: 2025-05-07T20:11:07.7991913Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:11:07.7991921Z 2025-05-07T20:11:07.8022154Z 2025-05-07T20:11:07.8022697Z Dynamic section at offset 0x8a8558 contains 37 entries: 2025-05-07T20:11:07.8022931Z Tag Type Name/Value 2025-05-07T20:11:07.8023362Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:07.8023604Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:07.8023811Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:07.8024019Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:07.8024388Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:07.8024596Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:07.8024799Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:07.8024997Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:07.8025204Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:07.8025425Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:07.8025668Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_optimizers.so] 2025-05-07T20:11:07.8025854Z 0x000000000000000c (INIT) 0x10000 2025-05-07T20:11:07.8025969Z 0x000000000000000d (FINI) 0x3464c 2025-05-07T20:11:07.8026090Z 0x0000000000000019 (INIT_ARRAY) 0x8a82d8 2025-05-07T20:11:07.8026231Z 0x000000000000001b (INIT_ARRAYSZ) 48 (bytes) 2025-05-07T20:11:07.8026355Z 0x000000000000001a (FINI_ARRAY) 0x8a8308 2025-05-07T20:11:07.8026480Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:07.8026591Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:07.8026725Z 0x000000006ffffef5 (GNU_HASH) 0x1218 2025-05-07T20:11:07.8026835Z 0x0000000000000005 (STRTAB) 0x3d10 2025-05-07T20:11:07.8026944Z 0x0000000000000006 (SYMTAB) 0x1ce8 2025-05-07T20:11:07.8027091Z 0x000000000000000a (STRSZ) 36563 (bytes) 2025-05-07T20:11:07.8027213Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:07.8027330Z 0x0000000000000003 (PLTGOT) 0x8a87f8 2025-05-07T20:11:07.8027480Z 0x0000000000000002 (PLTRELSZ) 3600 (bytes) 2025-05-07T20:11:07.8027588Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:07.8027698Z 0x0000000000000017 (JMPREL) 0xec00 2025-05-07T20:11:07.8027804Z 0x0000000000000007 (RELA) 0xcfc8 2025-05-07T20:11:07.8027950Z 0x0000000000000008 (RELASZ) 7224 (bytes) 2025-05-07T20:11:07.8028069Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:07.8028168Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:07.8028306Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:07.8028420Z 0x000000006ffffffe (VERNEED) 0xce98 2025-05-07T20:11:07.8028526Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:07.8028639Z 0x000000006ffffff0 (VERSYM) 0xcbe4 2025-05-07T20:11:07.8028759Z 0x000000006ffffff9 (RELACOUNT) 90 2025-05-07T20:11:07.8028860Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:07.8028867Z 2025-05-07T20:11:07.8029083Z ################################################################################ 2025-05-07T20:11:07.8029126Z 2025-05-07T20:11:07.8029131Z 2025-05-07T20:11:07.8029259Z ################################################################################ 2025-05-07T20:11:07.8029681Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:11:07.8029788Z [CHECK] Listing out library size: 2025-05-07T20:11:07.8030070Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:11:07.8030075Z 2025-05-07T20:11:07.8033361Z 17 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:11:07.8034475Z 2025-05-07T20:11:07.8035270Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:11:07.8035776Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:07.8035785Z 2025-05-07T20:11:07.8091905Z GLIBC_2.2.5 2025-05-07T20:11:07.8092997Z GLIBC_2.14 2025-05-07T20:11:07.8094422Z 2025-05-07T20:11:07.8094435Z 2025-05-07T20:11:07.8095703Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:11:07.8097170Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:07.8097190Z 2025-05-07T20:11:07.8151750Z GLIBCXX_3.4 2025-05-07T20:11:07.8152035Z GLIBCXX_3.4.9 2025-05-07T20:11:07.8152404Z GLIBCXX_3.4.20 2025-05-07T20:11:07.8152503Z GLIBCXX_3.4.21 2025-05-07T20:11:07.8152586Z GLIBCXX_3.4.29 2025-05-07T20:11:07.8152592Z 2025-05-07T20:11:07.8152596Z 2025-05-07T20:11:07.8173840Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so > /tmp/tmp.Migyy8HD0p.symbols.txt 2025-05-07T20:11:07.8174265Z 2025-05-07T20:11:07.8198446Z 2025-05-07T20:11:07.8225542Z [CHECK] Total Number of symbols: 469 2025-05-07T20:11:07.8236396Z [CHECK] Number of fbgemm symbols: 12 2025-05-07T20:11:07.8255836Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so > /tmp/tmp.hS0bvZENRU.usymbols.txt 2025-05-07T20:11:07.8255869Z 2025-05-07T20:11:07.8271087Z 2025-05-07T20:11:07.8303637Z [CHECK] Listing out undefined symbols (155 total): 2025-05-07T20:11:07.8323192Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:07.8323484Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:07.8323661Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:07.8323808Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:07.8323961Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:07.8324100Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:07.8324247Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:07.8324370Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:07.8324524Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:07.8324661Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:07.8324766Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:07.8324890Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:07.8324998Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:07.8325111Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:07.8325270Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:07.8325375Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:07.8325488Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:07.8325587Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:07.8325699Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:07.8326110Z U at::_ops::add_Tensor::call(at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:07.8326285Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:11:07.8326953Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:07.8327649Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:07.8327819Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:11:07.8328000Z U at::_ops::mul_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:07.8328198Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:11:07.8328430Z U at::_ops::sub__Tensor::call(at::Tensor&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:07.8328546Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:11:07.8329101Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:07.8329707Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:07.8329825Z U c10::BoolType::get() 2025-05-07T20:11:07.8329993Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:07.8330138Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:07.8330291Z U c10::IntType::get() 2025-05-07T20:11:07.8330469Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:07.8330597Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:07.8330840Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:07.8331123Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:07.8331369Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:07.8331773Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:07.8331905Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:07.8332017Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:07.8332123Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:07.8332233Z U c10::SymIntType::get() 2025-05-07T20:11:07.8332382Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:07.8332532Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:07.8332647Z U c10::TensorType::get() 2025-05-07T20:11:07.8332762Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:07.8333448Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:07.8333576Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:07.8333687Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:07.8333801Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:07.8333922Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:07.8334090Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:07.8334197Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:07.8334542Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:07.8334644Z U c10::cuda::current_device() 2025-05-07T20:11:07.8334762Z U c10::cuda::device_count() 2025-05-07T20:11:07.8334907Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:07.8335032Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:07.8335165Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:07.8335307Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:07.8335455Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:07.8335561Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:07.8336073Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:07.8336319Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:07.8336813Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:07.8337147Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:07.8337709Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:07.8337816Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:07.8337931Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:07.8338093Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:07.8338252Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:07.8338372Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:07.8338513Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:07.8338643Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:07.8338766Z U c10::throwNullDataPtrError() 2025-05-07T20:11:07.8338865Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:07.8338974Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:07.8339175Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:07.8339289Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:07.8339414Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:07.8339535Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:07.8339669Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:07.8339774Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:07.8339893Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:07.8340012Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:07.8340119Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:07.8340235Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:07.8340361Z U cudaFuncGetAttributes@libcudart.so.12 2025-05-07T20:11:07.8340465Z U cudaGetDevice@libcudart.so.12 2025-05-07T20:11:07.8340578Z U cudaGetDeviceCount@libcudart.so.12 2025-05-07T20:11:07.8340704Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:07.8340824Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:07.8340935Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:07.8341039Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:07.8341174Z U cudaMemcpyAsync@libcudart.so.12 2025-05-07T20:11:07.8341281Z U cudaMemsetAsync@libcudart.so.12 2025-05-07T20:11:07.8341572Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.12 2025-05-07T20:11:07.8341695Z U cudaPeekAtLastError@libcudart.so.12 2025-05-07T20:11:07.8341801Z U cudaSetDevice@libcudart.so.12 2025-05-07T20:11:07.8341906Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:07.8342021Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:07.8342140Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:07.8342259Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:07.8342389Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:07.8342493Z U log2@GLIBC_2.2.5 2025-05-07T20:11:07.8342663Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:07.8342786Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:07.8342963Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:07.8343055Z U memcpy@GLIBC_2.14 2025-05-07T20:11:07.8343145Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:07.8343244Z U memset@GLIBC_2.2.5 2025-05-07T20:11:07.8343390Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:07.8343508Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:07.8343834Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:07.8344218Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:07.8344567Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:07.8344687Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:07.8344826Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:07.8344957Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:07.8345133Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:07.8345361Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:07.8345693Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:07.8346257Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:07.8346380Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:07.8346493Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:07.8346620Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:07.8346726Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:07.8346833Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:07.8346947Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:07.8347053Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:07.8347223Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:07.8347459Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:07.8347580Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:07.8347682Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:07.8347792Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:07.8347921Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:07.8348517Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:07.8349066Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:07.8349314Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:07.8349855Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:07.8350059Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:11:07.8350222Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:07.8350393Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:07.8350597Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:07.8350962Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:07.8351304Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:07.8351528Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:07.8351759Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:07.8351874Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:07.8352023Z w _ITM_registerTMCloneTable 2025-05-07T20:11:07.8352128Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:07.8352218Z w __gmon_start__ 2025-05-07T20:11:07.8352315Z w __pthread_key_create 2025-05-07T20:11:07.8352479Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:07.8352683Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:11:07.8352690Z 2025-05-07T20:11:07.8367324Z linux-vdso.so.1 (0x00007ffed2943000) 2025-05-07T20:11:07.8367639Z libtorch.so => not found 2025-05-07T20:11:07.8367913Z libc10.so => not found 2025-05-07T20:11:07.8368181Z libc10_cuda.so => not found 2025-05-07T20:11:07.8368507Z libtorch_cpu.so => not found 2025-05-07T20:11:07.8368778Z libtorch_cuda.so => not found 2025-05-07T20:11:07.8369044Z libcudart.so.12 => not found 2025-05-07T20:11:07.8369542Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fee8df9c000) 2025-05-07T20:11:07.8369942Z libm.so.6 => /lib64/libm.so.6 (0x00007fee8f41e000) 2025-05-07T20:11:07.8370405Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fee8f3f0000) 2025-05-07T20:11:07.8370769Z libc.so.6 => /lib64/libc.so.6 (0x00007fee8dd94000) 2025-05-07T20:11:07.8371160Z /lib64/ld-linux-x86-64.so.2 (0x00007fee8f4ff000) 2025-05-07T20:11:07.8372475Z 2025-05-07T20:11:07.8372811Z [CHECK] Displaying ELF information: 2025-05-07T20:11:07.8373479Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:11:07.8373485Z 2025-05-07T20:11:07.8411764Z 2025-05-07T20:11:07.8412335Z Dynamic section at offset 0x106d2d0 contains 37 entries: 2025-05-07T20:11:07.8412468Z Tag Type Name/Value 2025-05-07T20:11:07.8412692Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:07.8412891Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:07.8413093Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:07.8413308Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:07.8413663Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:07.8413871Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:07.8414127Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:07.8414331Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:11:07.8414524Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:07.8414716Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:07.8414951Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:11:07.8415063Z 0x000000000000000c (INIT) 0x12000 2025-05-07T20:11:07.8415175Z 0x000000000000000d (FINI) 0xa2d3c 2025-05-07T20:11:07.8415296Z 0x0000000000000019 (INIT_ARRAY) 0x106de30 2025-05-07T20:11:07.8415433Z 0x000000000000001b (INIT_ARRAYSZ) 96 (bytes) 2025-05-07T20:11:07.8415550Z 0x000000000000001a (FINI_ARRAY) 0x106de90 2025-05-07T20:11:07.8415687Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:07.8415848Z 0x0000000000000004 (HASH) 0x200 2025-05-07T20:11:07.8415966Z 0x000000006ffffef5 (GNU_HASH) 0x1640 2025-05-07T20:11:07.8416074Z 0x0000000000000005 (STRTAB) 0x51f0 2025-05-07T20:11:07.8416195Z 0x0000000000000006 (SYMTAB) 0x25e0 2025-05-07T20:11:07.8416323Z 0x000000000000000a (STRSZ) 38760 (bytes) 2025-05-07T20:11:07.8416439Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:07.8416573Z 0x0000000000000003 (PLTGOT) 0x106e570 2025-05-07T20:11:07.8416702Z 0x0000000000000002 (PLTRELSZ) 5376 (bytes) 2025-05-07T20:11:07.8416804Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:07.8416912Z 0x0000000000000017 (JMPREL) 0x10600 2025-05-07T20:11:07.8417207Z 0x0000000000000007 (RELA) 0xee18 2025-05-07T20:11:07.8417331Z 0x0000000000000008 (RELASZ) 6120 (bytes) 2025-05-07T20:11:07.8417444Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:07.8417556Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:07.8417673Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:07.8417782Z 0x000000006ffffffe (VERNEED) 0xed08 2025-05-07T20:11:07.8417888Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:07.8418114Z 0x000000006ffffff0 (VERSYM) 0xe958 2025-05-07T20:11:07.8418207Z 0x000000006ffffff9 (RELACOUNT) 26 2025-05-07T20:11:07.8418300Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:07.8418305Z 2025-05-07T20:11:07.8418425Z ################################################################################ 2025-05-07T20:11:07.8418430Z 2025-05-07T20:11:07.8418434Z 2025-05-07T20:11:07.8418532Z ################################################################################ 2025-05-07T20:11:07.8418824Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:11:07.8418933Z [CHECK] Listing out library size: 2025-05-07T20:11:07.8419215Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:11:07.8419220Z 2025-05-07T20:11:07.8431163Z 2 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:11:07.8432633Z 2025-05-07T20:11:07.8436029Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:11:07.8436577Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:07.8436585Z 2025-05-07T20:11:07.8487027Z GLIBC_2.2.5 2025-05-07T20:11:07.8487125Z GLIBC_2.3 2025-05-07T20:11:07.8487218Z GLIBC_2.14 2025-05-07T20:11:07.8488594Z 2025-05-07T20:11:07.8488733Z 2025-05-07T20:11:07.8489920Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:11:07.8490600Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:07.8490607Z 2025-05-07T20:11:07.8546007Z GLIBCXX_3.4 2025-05-07T20:11:07.8546220Z GLIBCXX_3.4.9 2025-05-07T20:11:07.8546381Z GLIBCXX_3.4.21 2025-05-07T20:11:07.8546537Z GLIBCXX_3.4.29 2025-05-07T20:11:07.8546547Z 2025-05-07T20:11:07.8546556Z 2025-05-07T20:11:07.8570682Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so > /tmp/tmp.6BYgRKivwM.symbols.txt 2025-05-07T20:11:07.8570788Z 2025-05-07T20:11:07.8589498Z 2025-05-07T20:11:07.8615568Z [CHECK] Total Number of symbols: 326 2025-05-07T20:11:07.8628707Z [CHECK] Number of fbgemm symbols: 56 2025-05-07T20:11:07.8646975Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so > /tmp/tmp.U8mnmMsJPn.usymbols.txt 2025-05-07T20:11:07.8647022Z 2025-05-07T20:11:07.8664311Z 2025-05-07T20:11:07.8687874Z [CHECK] Listing out undefined symbols (143 total): 2025-05-07T20:11:07.8704220Z U GOMP_parallel 2025-05-07T20:11:07.8706175Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:07.8706653Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:07.8707258Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:07.8708068Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:07.8708790Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:07.8709689Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:07.8710375Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:07.8711098Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:07.8711495Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:07.8711788Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:07.8712113Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:07.8712412Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:07.8712706Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:07.8713031Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:07.8713322Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:07.8713628Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:07.8713906Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:07.8714492Z U at::_ops::contiguous::call(at::Tensor const&, c10::MemoryFormat) 2025-05-07T20:11:07.8715892Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:07.8716532Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:07.8716703Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:11:07.8716809Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:11:07.8717280Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:07.8717377Z U at::get_num_threads() 2025-05-07T20:11:07.8717470Z U at::get_thread_num() 2025-05-07T20:11:07.8717580Z U at::in_parallel_region() 2025-05-07T20:11:07.8717677Z U at::init_num_threads() 2025-05-07T20:11:07.8717788Z U at::internal::set_thread_num(int) 2025-05-07T20:11:07.8718426Z U at::native::empty_like(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:07.8718693Z U at::native::resize_(at::Tensor const&, c10::ArrayRef, std::optional) 2025-05-07T20:11:07.8718865Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:11:07.8719030Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:07.8719175Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:11:07.8719315Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:07.8719451Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:11:07.8719570Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:11:07.8719722Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:07.8719857Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:07.8720042Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:07.8720184Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:07.8720279Z U c10::TensorType::get() 2025-05-07T20:11:07.8720402Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:07.8721092Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:07.8721233Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:07.8721342Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:07.8721478Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:07.8721590Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:07.8721716Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:07.8721822Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:07.8722060Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:07.8722171Z U c10::cuda::device_count() 2025-05-07T20:11:07.8722302Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:07.8722427Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:07.8722575Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:07.8722706Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:07.8722851Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:07.8722970Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:07.8723479Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:07.8723723Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:07.8724208Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:07.8724535Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:07.8724647Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:07.8724755Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:07.8724895Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:07.8725078Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:07.8725202Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:07.8725340Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:07.8725492Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:07.8725616Z U c10::throwNullDataPtrError() 2025-05-07T20:11:07.8725716Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:07.8725823Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:07.8726024Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:07.8726138Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:07.8726261Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:07.8726393Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:07.8726518Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:07.8726629Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:07.8726748Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:07.8726892Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:07.8726999Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:07.8727116Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:07.8727247Z U cudaFuncGetAttributes@libcudart.so.12 2025-05-07T20:11:07.8727349Z U cudaGetDevice@libcudart.so.12 2025-05-07T20:11:07.8727464Z U cudaGetDeviceCount@libcudart.so.12 2025-05-07T20:11:07.8727577Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:07.8727694Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:07.8727799Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:07.8728087Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.12 2025-05-07T20:11:07.8728217Z U cudaPeekAtLastError@libcudart.so.12 2025-05-07T20:11:07.8728319Z U cudaSetDevice@libcudart.so.12 2025-05-07T20:11:07.8728427Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:07.8728559Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:07.8728672Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:07.8728804Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:07.8728934Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:07.8729101Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:07.8729227Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:07.8729315Z U memcpy@GLIBC_2.14 2025-05-07T20:11:07.8729418Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:07.8729512Z U memset@GLIBC_2.2.5 2025-05-07T20:11:07.8729598Z U omp_get_num_threads 2025-05-07T20:11:07.8729698Z U omp_get_thread_num 2025-05-07T20:11:07.8729837Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:07.8729958Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:07.8730302Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:07.8730676Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:07.8731002Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:07.8731148Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:11:07.8731282Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:11:07.8731392Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:07.8731554Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:07.8731685Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:07.8731934Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:07.8732282Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:07.8732838Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:07.8732953Z U std::current_exception()@CXXABI_1.3.3 2025-05-07T20:11:07.8733077Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:07.8733194Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:07.8733306Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:07.8733428Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:07.8733577Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:07.8733681Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:07.8733861Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:07.8734073Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:11:07.8734163Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:07.8734289Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:07.8734860Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:07.8735329Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:07.8735592Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:07.8735944Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:07.8736088Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:07.8736252Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:07.8736402Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:07.8736738Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:07.8737065Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:07.8737259Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:07.8737477Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:07.8737593Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:07.8737690Z w _ITM_registerTMCloneTable 2025-05-07T20:11:07.8737787Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:07.8737885Z w __gmon_start__ 2025-05-07T20:11:07.8737976Z w __pthread_key_create 2025-05-07T20:11:07.8738114Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:07.8738339Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:11:07.8738358Z 2025-05-07T20:11:07.8746757Z linux-vdso.so.1 (0x00007ffc60df1000) 2025-05-07T20:11:07.8747319Z libc10.so => not found 2025-05-07T20:11:07.8747831Z libc10_cuda.so => not found 2025-05-07T20:11:07.8748575Z libtorch.so => not found 2025-05-07T20:11:07.8748916Z libtorch_cpu.so => not found 2025-05-07T20:11:07.8749405Z libtorch_cuda.so => not found 2025-05-07T20:11:07.8749710Z libcudart.so.12 => not found 2025-05-07T20:11:07.8750319Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007ff08a905000) 2025-05-07T20:11:07.8750770Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007ff08a8d7000) 2025-05-07T20:11:07.8751155Z libc.so.6 => /lib64/libc.so.6 (0x00007ff08a6cf000) 2025-05-07T20:11:07.8751537Z /lib64/ld-linux-x86-64.so.2 (0x00007ff08ad1f000) 2025-05-07T20:11:07.8751885Z libm.so.6 => /lib64/libm.so.6 (0x00007ff08a5f4000) 2025-05-07T20:11:07.8751903Z 2025-05-07T20:11:07.8752216Z [CHECK] Displaying ELF information: 2025-05-07T20:11:07.8753042Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:11:07.8753064Z 2025-05-07T20:11:07.8778684Z 2025-05-07T20:11:07.8779191Z Dynamic section at offset 0x179670 contains 38 entries: 2025-05-07T20:11:07.8779431Z Tag Type Name/Value 2025-05-07T20:11:07.8779829Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:07.8782068Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:07.8782290Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:07.8782524Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:07.8782741Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:07.8782970Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:07.8783171Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:07.8783389Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:07.8783629Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:07.8783851Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:07.8784124Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:11:07.8784313Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:07.8784648Z 0x000000000000000c (INIT) 0xc000 2025-05-07T20:11:07.8784781Z 0x000000000000000d (FINI) 0x237dc 2025-05-07T20:11:07.8784899Z 0x0000000000000019 (INIT_ARRAY) 0x1792c0 2025-05-07T20:11:07.8785024Z 0x000000000000001b (INIT_ARRAYSZ) 32 (bytes) 2025-05-07T20:11:07.8785142Z 0x000000000000001a (FINI_ARRAY) 0x1792e0 2025-05-07T20:11:07.8785277Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:07.8785382Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:07.8785539Z 0x000000006ffffef5 (GNU_HASH) 0x10f8 2025-05-07T20:11:07.8785663Z 0x0000000000000005 (STRTAB) 0x38a8 2025-05-07T20:11:07.8785771Z 0x0000000000000006 (SYMTAB) 0x1a00 2025-05-07T20:11:07.8785904Z 0x000000000000000a (STRSZ) 24404 (bytes) 2025-05-07T20:11:07.8786041Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:07.8786156Z 0x0000000000000003 (PLTGOT) 0x179910 2025-05-07T20:11:07.8786288Z 0x0000000000000002 (PLTRELSZ) 3864 (bytes) 2025-05-07T20:11:07.8786398Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:07.8786514Z 0x0000000000000017 (JMPREL) 0xaba8 2025-05-07T20:11:07.8786619Z 0x0000000000000007 (RELA) 0x9ba0 2025-05-07T20:11:07.8786746Z 0x0000000000000008 (RELASZ) 4104 (bytes) 2025-05-07T20:11:07.8786874Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:07.8786981Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:07.8787106Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:07.8787223Z 0x000000006ffffffe (VERNEED) 0x9a90 2025-05-07T20:11:07.8787439Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:07.8787555Z 0x000000006ffffff0 (VERSYM) 0x97fc 2025-05-07T20:11:07.8787661Z 0x000000006ffffff9 (RELACOUNT) 7 2025-05-07T20:11:07.8787822Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:07.8787828Z 2025-05-07T20:11:07.8787948Z ################################################################################ 2025-05-07T20:11:07.8787953Z 2025-05-07T20:11:07.8787957Z 2025-05-07T20:11:07.8788068Z ################################################################################ 2025-05-07T20:11:07.8788422Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:11:07.8788528Z [CHECK] Listing out library size: 2025-05-07T20:11:07.8788858Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:11:07.8788864Z 2025-05-07T20:11:07.8792056Z 8 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:11:07.8792789Z 2025-05-07T20:11:07.8793944Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:11:07.8794668Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:07.8794676Z 2025-05-07T20:11:07.9225568Z GLIBC_2.2.5 2025-05-07T20:11:07.9226572Z GLIBC_2.3 2025-05-07T20:11:07.9227052Z GLIBC_2.14 2025-05-07T20:11:07.9227090Z 2025-05-07T20:11:07.9227110Z 2025-05-07T20:11:07.9229560Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:11:07.9231321Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:07.9231634Z 2025-05-07T20:11:07.9658508Z GLIBCXX_3.4 2025-05-07T20:11:07.9659729Z GLIBCXX_3.4.9 2025-05-07T20:11:07.9660719Z GLIBCXX_3.4.11 2025-05-07T20:11:07.9661341Z GLIBCXX_3.4.15 2025-05-07T20:11:07.9661942Z GLIBCXX_3.4.18 2025-05-07T20:11:07.9662530Z GLIBCXX_3.4.20 2025-05-07T20:11:07.9663121Z GLIBCXX_3.4.21 2025-05-07T20:11:07.9663682Z GLIBCXX_3.4.29 2025-05-07T20:11:07.9664037Z 2025-05-07T20:11:07.9664071Z 2025-05-07T20:11:07.9675169Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so > /tmp/tmp.xFEY8Lz1vf.symbols.txt 2025-05-07T20:11:07.9675740Z 2025-05-07T20:11:08.0063277Z 2025-05-07T20:11:08.0088317Z [CHECK] Total Number of symbols: 4265 2025-05-07T20:11:08.0116110Z [CHECK] Number of fbgemm symbols: 4 2025-05-07T20:11:08.0133759Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so > /tmp/tmp.NgXLOUTF3P.usymbols.txt 2025-05-07T20:11:08.0134518Z 2025-05-07T20:11:08.0165789Z 2025-05-07T20:11:08.0196493Z [CHECK] Listing out undefined symbols (190 total): 2025-05-07T20:11:08.0219577Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:08.0222658Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:08.0223646Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:08.0224634Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:08.0225340Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:08.0225678Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:08.0226007Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:08.0226361Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:08.0226692Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:08.0227030Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:08.0227357Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:08.0227683Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:08.0228190Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:08.0228527Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:08.0229133Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:11:08.0229568Z U at::RecordFunction::currentThreadId() 2025-05-07T20:11:08.0230020Z U at::RecordFunction::end() 2025-05-07T20:11:08.0230367Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:11:08.0230773Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:11:08.0231360Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:11:08.0232091Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:11:08.0232867Z U at::_ops::clamp::call(at::Tensor const&, std::optional const&, std::optional const&) 2025-05-07T20:11:08.0233538Z U at::_ops::clone::call(at::Tensor const&, std::optional) 2025-05-07T20:11:08.0234601Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:08.0235676Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:11:08.0236173Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:08.0236586Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:11:08.0237008Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:11:08.0237414Z U at::sequence_number::get_and_increment() 2025-05-07T20:11:08.0237826Z U c10::AnyType::get() 2025-05-07T20:11:08.0238144Z U c10::BoolType::get() 2025-05-07T20:11:08.0238521Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:08.0239005Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:11:08.0239431Z U c10::Dispatcher::realSingleton() 2025-05-07T20:11:08.0240221Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:11:08.0241659Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:11:08.0242739Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:08.0243324Z U c10::Error::what() const 2025-05-07T20:11:08.0243614Z U c10::FloatType::get() 2025-05-07T20:11:08.0243921Z U c10::GradMode::is_enabled() 2025-05-07T20:11:08.0244247Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:11:08.0244611Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:11:08.0244997Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:11:08.0245319Z U c10::IValue::isBoolList() const 2025-05-07T20:11:08.0245652Z U c10::IValue::isDoubleList() const 2025-05-07T20:11:08.0245967Z U c10::IValue::isIntList() const 2025-05-07T20:11:08.0246295Z U c10::IValue::isSymIntList() const 2025-05-07T20:11:08.0246623Z U c10::IValue::isTensorList() const 2025-05-07T20:11:08.0246968Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:08.0247320Z U c10::IntType::get() 2025-05-07T20:11:08.0248002Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:08.0248747Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:08.0249179Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:08.0249520Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:08.0249880Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:08.0250318Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:08.0250924Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:11:08.0251418Z U c10::StringType::get() 2025-05-07T20:11:08.0251759Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:11:08.0252161Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:08.0252741Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:08.0253184Z U c10::SymFloat::operator/(c10::SymFloat const&) const 2025-05-07T20:11:08.0253921Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:08.0254590Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:08.0254998Z U c10::SymInt::operator c10::SymFloat() const 2025-05-07T20:11:08.0255386Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:11:08.0255774Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:08.0256127Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:11:08.0256491Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:08.0256850Z U c10::SymIntType::get() 2025-05-07T20:11:08.0257176Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:11:08.0257528Z U c10::TensorType::get() 2025-05-07T20:11:08.0271495Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:08.0272750Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:08.0274072Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:08.0275005Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:08.0275899Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:08.0276905Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:08.0278005Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:08.0279066Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:11:08.0279729Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:11:08.0280169Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:08.0280577Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:11:08.0281245Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:08.0281879Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:11:08.0282298Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:11:08.0282855Z U c10::operator<<(std::ostream&, c10::SymFloat const&) 2025-05-07T20:11:08.0283293Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:11:08.0283810Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:08.0284260Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:08.0285122Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:11:08.0285827Z U fbgemm_gpu::reshape_vbe_output(at::Tensor const&, long, at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:08.0286347Z U free@GLIBC_2.2.5 2025-05-07T20:11:08.0286734Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:08.0287135Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:08.0287448Z U memcpy@GLIBC_2.14 2025-05-07T20:11:08.0287738Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:08.0288042Z U memset@GLIBC_2.2.5 2025-05-07T20:11:08.0288390Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:08.0288898Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:08.0289238Z U realloc@GLIBC_2.2.5 2025-05-07T20:11:08.0289670Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:11:08.0290376Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:08.0291252Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:08.0292136Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:08.0293006Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:11:08.0294057Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:08.0294968Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:08.0295610Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:08.0295961Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:08.0296351Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:08.0296761Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:08.0297207Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:08.0297768Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:08.0298133Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:11:08.0298612Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:08.0299291Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:08.0300298Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:08.0301461Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:08.0302179Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:08.0302534Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:08.0302921Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:08.0303258Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:08.0305343Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:08.0305691Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:08.0306031Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:08.0306413Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:08.0306944Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:08.0307418Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:08.0307791Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:08.0308200Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:11:08.0308851Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:11:08.0309822Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:11:08.0310267Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:08.0310589Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:08.0310900Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:08.0311329Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:08.0312622Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:08.0314271Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:08.0315183Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:08.0315711Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:11:08.0316268Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:11:08.0316876Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:11:08.0317408Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:11:08.0317929Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:11:08.0318609Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:11:08.0319254Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:11:08.0319721Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:11:08.0320233Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:11:08.0320658Z U torch::autograd::Node::assign_parent() 2025-05-07T20:11:08.0321027Z U torch::autograd::Node::metadata() 2025-05-07T20:11:08.0321406Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:11:08.0321912Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:11:08.0322583Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:11:08.0323119Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:11:08.0323612Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:11:08.0324188Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:11:08.0327691Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:11:08.0331254Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:11:08.0331682Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:11:08.0332143Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:11:08.0333277Z U torch::autograd::profiler::record_function_enter_new(std::__cxx11::basic_string, std::allocator > const&, std::optional, std::allocator > > const&) 2025-05-07T20:11:08.0334414Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:11:08.0335131Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:11:08.0336064Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:08.0336664Z U typeinfo for c10::Error 2025-05-07T20:11:08.0337037Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:08.0337461Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:11:08.0337854Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:11:08.0338245Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:11:08.0338621Z U typeinfo for torch::autograd::Node 2025-05-07T20:11:08.0339021Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:08.0339454Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:08.0339917Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:11:08.0340355Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:08.0340752Z U vtable for c10::Error 2025-05-07T20:11:08.0341336Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:08.0342163Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:08.0342777Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:08.0343257Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:08.0343816Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:08.0344288Z U vtable for torch::autograd::Node 2025-05-07T20:11:08.0344699Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:08.0345130Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:08.0345454Z w _ITM_registerTMCloneTable 2025-05-07T20:11:08.0345788Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:08.0346106Z w __gmon_start__ 2025-05-07T20:11:08.0346384Z w __pthread_key_create 2025-05-07T20:11:08.0346711Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:08.0347083Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:08.0347475Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:08.0348044Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:11:08.0348440Z 2025-05-07T20:11:08.0348599Z linux-vdso.so.1 (0x00007fff7a0bc000) 2025-05-07T20:11:08.0348907Z libc10.so => not found 2025-05-07T20:11:08.0349659Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f69ea800000) 2025-05-07T20:11:08.0350371Z libtorch.so => not found 2025-05-07T20:11:08.0351008Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so (0x00007f69eb3de000) 2025-05-07T20:11:08.0351678Z libtorch_cpu.so => not found 2025-05-07T20:11:08.0351969Z libtorch_cuda.so => not found 2025-05-07T20:11:08.0352318Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f69ea59c000) 2025-05-07T20:11:08.0352762Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f69eb3ae000) 2025-05-07T20:11:08.0353152Z libc.so.6 => /lib64/libc.so.6 (0x00007f69ea394000) 2025-05-07T20:11:08.0353571Z /lib64/ld-linux-x86-64.so.2 (0x00007f69eb3ee000) 2025-05-07T20:11:08.0353901Z libc10.so => not found 2025-05-07T20:11:08.0354157Z libc10_cuda.so => not found 2025-05-07T20:11:08.0354714Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so (0x00007f69e9e00000) 2025-05-07T20:11:08.0355297Z libtorch.so => not found 2025-05-07T20:11:08.0355571Z libtorch_cpu.so => not found 2025-05-07T20:11:08.0355851Z libtorch_cuda.so => not found 2025-05-07T20:11:08.0356134Z libcudart.so.12 => not found 2025-05-07T20:11:08.0356402Z libc10.so => not found 2025-05-07T20:11:08.0356663Z libtorch_cpu.so => not found 2025-05-07T20:11:08.0356936Z libtorch_cuda.so => not found 2025-05-07T20:11:08.0357252Z libtorch.so => not found 2025-05-07T20:11:08.0357540Z libm.so.6 => /lib64/libm.so.6 (0x00007f69e9d25000) 2025-05-07T20:11:08.0357885Z libc10.so => not found 2025-05-07T20:11:08.0358415Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so (0x00007f69eb32d000) 2025-05-07T20:11:08.0359002Z libtorch.so => not found 2025-05-07T20:11:08.0359271Z libtorch_cpu.so => not found 2025-05-07T20:11:08.0359545Z libtorch_cuda.so => not found 2025-05-07T20:11:08.0359838Z libtorch_cpu.so => not found 2025-05-07T20:11:08.0360110Z libtorch_cuda.so => not found 2025-05-07T20:11:08.0360389Z libtorch.so => not found 2025-05-07T20:11:08.0360556Z 2025-05-07T20:11:08.0360666Z [CHECK] Displaying ELF information: 2025-05-07T20:11:08.0361185Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:11:08.0361592Z 2025-05-07T20:11:08.0361596Z 2025-05-07T20:11:08.0361767Z Dynamic section at offset 0x701230 contains 38 entries: 2025-05-07T20:11:08.0362154Z Tag Type Name/Value 2025-05-07T20:11:08.0362590Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:08.0363127Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:11:08.0363690Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:08.0364243Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:11:08.0364786Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:08.0365327Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:08.0365854Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:08.0366387Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:08.0366912Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:08.0367451Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:08.0368103Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_pt2.so] 2025-05-07T20:11:08.0368698Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:08.0369144Z 0x000000000000000c (INIT) 0x178000 2025-05-07T20:11:08.0369507Z 0x000000000000000d (FINI) 0x65b3d8 2025-05-07T20:11:08.0369847Z 0x0000000000000019 (INIT_ARRAY) 0x6fcd78 2025-05-07T20:11:08.0370217Z 0x000000000000001b (INIT_ARRAYSZ) 256 (bytes) 2025-05-07T20:11:08.0370580Z 0x000000000000001a (FINI_ARRAY) 0x6fce78 2025-05-07T20:11:08.0370928Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:08.0371284Z 0x0000000000000004 (HASH) 0x200 2025-05-07T20:11:08.0371612Z 0x000000006ffffef5 (GNU_HASH) 0x6490 2025-05-07T20:11:08.0371966Z 0x0000000000000005 (STRTAB) 0x25438 2025-05-07T20:11:08.0372296Z 0x0000000000000006 (SYMTAB) 0xc448 2025-05-07T20:11:08.0372675Z 0x000000000000000a (STRSZ) 1180638 (bytes) 2025-05-07T20:11:08.0373065Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:08.0373449Z 0x0000000000000003 (PLTGOT) 0x7024d0 2025-05-07T20:11:08.0373831Z 0x0000000000000002 (PLTRELSZ) 20976 (bytes) 2025-05-07T20:11:08.0374185Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:08.0374536Z 0x0000000000000017 (JMPREL) 0x171f98 2025-05-07T20:11:08.0374871Z 0x0000000000000007 (RELA) 0x147aa0 2025-05-07T20:11:08.0375252Z 0x0000000000000008 (RELASZ) 173304 (bytes) 2025-05-07T20:11:08.0375614Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:08.0375953Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:08.0376271Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:08.0376619Z 0x000000006ffffffe (VERNEED) 0x147970 2025-05-07T20:11:08.0376992Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:11:08.0377327Z 0x000000006ffffff0 (VERSYM) 0x145816 2025-05-07T20:11:08.0377667Z 0x000000006ffffff9 (RELACOUNT) 34 2025-05-07T20:11:08.0377980Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:08.0378197Z 2025-05-07T20:11:08.0378311Z ################################################################################ 2025-05-07T20:11:08.0378544Z 2025-05-07T20:11:08.0378548Z 2025-05-07T20:11:08.0378679Z ################################################################################ 2025-05-07T20:11:08.0379186Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:11:08.0379697Z [CHECK] Listing out library size: 2025-05-07T20:11:08.0380159Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:11:08.0380555Z 2025-05-07T20:11:08.0380768Z 432 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:11:08.0381092Z 2025-05-07T20:11:08.0381510Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:11:08.0382531Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:08.0383174Z 2025-05-07T20:11:08.0733379Z GLIBC_2.2.5 2025-05-07T20:11:08.0733778Z GLIBC_2.3 2025-05-07T20:11:08.0734165Z GLIBC_2.14 2025-05-07T20:11:08.0734334Z 2025-05-07T20:11:08.0734338Z 2025-05-07T20:11:08.0734788Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:11:08.0735858Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:08.0736514Z 2025-05-07T20:11:08.1125196Z GLIBCXX_3.4 2025-05-07T20:11:08.1126043Z GLIBCXX_3.4.9 2025-05-07T20:11:08.1126928Z GLIBCXX_3.4.11 2025-05-07T20:11:08.1127832Z GLIBCXX_3.4.14 2025-05-07T20:11:08.1128417Z GLIBCXX_3.4.18 2025-05-07T20:11:08.1129012Z GLIBCXX_3.4.20 2025-05-07T20:11:08.1129594Z GLIBCXX_3.4.21 2025-05-07T20:11:08.1130305Z GLIBCXX_3.4.29 2025-05-07T20:11:08.1130804Z 2025-05-07T20:11:08.1130820Z 2025-05-07T20:11:08.1155953Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so > /tmp/tmp.XmlkOOP7zY.symbols.txt 2025-05-07T20:11:08.1157501Z 2025-05-07T20:11:08.1509758Z 2025-05-07T20:11:08.1535931Z [CHECK] Total Number of symbols: 4997 2025-05-07T20:11:08.1561713Z [CHECK] Number of fbgemm symbols: 3788 2025-05-07T20:11:08.1578913Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so > /tmp/tmp.YIXqC9QD9n.usymbols.txt 2025-05-07T20:11:08.1580442Z 2025-05-07T20:11:08.1612871Z 2025-05-07T20:11:08.1637849Z [CHECK] Listing out undefined symbols (258 total): 2025-05-07T20:11:08.1654589Z U GOMP_parallel 2025-05-07T20:11:08.1657122Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:08.1660537Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:08.1662460Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:08.1663507Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:08.1664674Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:08.1665218Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:08.1665595Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:08.1665947Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:08.1666295Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:08.1666641Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:08.1667051Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:08.1667378Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:08.1667674Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:08.1667990Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:08.1668291Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:08.1668621Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:08.1668910Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:08.1669344Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:08.1669839Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:08.1670221Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:08.1670556Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:08.1670877Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:08.1671300Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:11:08.1672191Z U at::_ops::arange_start::call(c10::Scalar const&, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:08.1673491Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:08.1674898Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:08.1675961Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:11:08.1676713Z U at::_ops::scalar_tensor::call(c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:08.1677531Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:11:08.1678106Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:11:08.1679182Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:11:08.1680377Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:08.1681045Z U at::detail::getCUDAHooks() 2025-05-07T20:11:08.1681363Z U at::detail::getHIPHooks() 2025-05-07T20:11:08.1681672Z U at::get_num_threads() 2025-05-07T20:11:08.1681960Z U at::get_thread_num() 2025-05-07T20:11:08.1682254Z U at::globalContext() 2025-05-07T20:11:08.1682542Z U at::in_parallel_region() 2025-05-07T20:11:08.1682849Z U at::init_num_threads() 2025-05-07T20:11:08.1683185Z U at::internal::set_thread_num(int) 2025-05-07T20:11:08.1683576Z U c10::AutogradMetaInterface::~AutogradMetaInterface() 2025-05-07T20:11:08.1684031Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:11:08.1684696Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:08.1685362Z U c10::ClassType::addMethod(torch::jit::Function*) 2025-05-07T20:11:08.1686088Z U c10::ClassType::getMethod(std::__cxx11::basic_string, std::allocator > const&) const 2025-05-07T20:11:08.1686778Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:08.1687818Z U c10::DictType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:08.1689010Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:08.1689632Z U c10::Error::what() const 2025-05-07T20:11:08.1689962Z U c10::GradMode::is_enabled() 2025-05-07T20:11:08.1690288Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:11:08.1690688Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:11:08.1691253Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:08.1691816Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:11:08.1692193Z U c10::IValue::is(c10::IValue const&) const 2025-05-07T20:11:08.1692541Z U c10::IValue::isTensorList() const 2025-05-07T20:11:08.1692911Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:08.1693244Z U c10::IntType::get() 2025-05-07T20:11:08.1693911Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:08.1694639Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:08.1695037Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:08.1695363Z U c10::NoneType::get() 2025-05-07T20:11:08.1695760Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:08.1696223Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:11:08.1696565Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:11:08.1696955Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:08.1697384Z U c10::StringType::get() 2025-05-07T20:11:08.1697766Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:08.1698456Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:08.1699290Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:08.1699672Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:08.1700050Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:08.1701112Z U c10::TensorImpl::set_autograd_meta(std::unique_ptr >) 2025-05-07T20:11:08.1702146Z U c10::TensorType::get() 2025-05-07T20:11:08.1703429Z U c10::TupleType::TupleType(std::vector, std::allocator > >, std::optional, std::shared_ptr) 2025-05-07T20:11:08.1704812Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:08.1705817Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:08.1706902Z U c10::_fastEqualsForContainer(c10::IValue const&, c10::IValue const&) 2025-05-07T20:11:08.1707401Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:08.1707785Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:08.1708142Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:08.1708504Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:08.1708889Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:08.1709355Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:08.1709840Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:08.1710437Z U c10::cuda::device_count() 2025-05-07T20:11:08.1710805Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:08.1711198Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:08.1711610Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:08.1712008Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:08.1712434Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:08.1712845Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:08.1713516Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:08.1714629Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:08.1716376Z U c10::detail::infer_schema::make_function_schema(std::__cxx11::basic_string, std::allocator >&&, std::__cxx11::basic_string, std::allocator >&&, c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:08.1717822Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:08.1718728Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:08.1719761Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:08.1720871Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:08.1721875Z U c10::getCustomClassTypeImpl(std::type_index const&) 2025-05-07T20:11:08.1722248Z U c10::get_default_dtype() 2025-05-07T20:11:08.1722706Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:11:08.1723278Z U c10::impl::ExcludeDispatchKeyGuard::~ExcludeDispatchKeyGuard() 2025-05-07T20:11:08.1723694Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:08.1724026Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:08.1724368Z U c10::impl::PyObjectSlot::~PyObjectSlot() 2025-05-07T20:11:08.1724714Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:08.1725302Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:08.1725952Z U c10::ivalue::Future::extractStorages(c10::IValue const&) 2025-05-07T20:11:08.1726365Z U c10::ivalue::Object::resizeObject(unsigned long) 2025-05-07T20:11:08.1726842Z U c10::ivalue::checkCustomClassType(c10::ClassType const*, c10::Type const*) 2025-05-07T20:11:08.1727321Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:08.1727706Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:08.1728107Z U c10::operator<<(std::ostream&, c10::FunctionSchema const&) 2025-05-07T20:11:08.1728478Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:08.1728908Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:08.1729318Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:08.1729670Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:08.1730022Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:08.1730378Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:08.1730724Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:08.1731054Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:08.1731383Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:08.1731714Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:08.1732062Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:08.1732389Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:08.1732726Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:08.1733055Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:08.1733387Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:08.1733786Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:08.1734754Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:08.1736405Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:08.1738094Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:08.1739793Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:08.1741497Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:08.1743510Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:08.1746146Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:11:08.1747856Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:11:08.1749823Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:08.1751708Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:11:08.1753637Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:08.1755548Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:11:08.1758526Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:11:08.1760588Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:08.1762408Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:11:08.1764140Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:11:08.1765980Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:08.1767891Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:11:08.1770009Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:08.1771929Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:11:08.1773662Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:11:08.1775514Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:08.1777305Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:08.1779104Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:08.1780951Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:08.1782786Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:08.1784773Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:08.1786991Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:08.1788222Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:08.1788642Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:08.1789135Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:08.1789525Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:08.1790219Z U linearize_cache_indices_cuda(at::Tensor const&, at::Tensor const&, at::Tensor const&, std::optional const&, long, long) 2025-05-07T20:11:08.1790946Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:08.1791375Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:08.1791786Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:08.1792655Z U lru_cache_populate_byte_cuda(at::Tensor, at::Tensor, long, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, long, at::Tensor, long, bool, std::optional) 2025-05-07T20:11:08.1793823Z U lxu_cache_lookup_cuda(at::Tensor, at::Tensor, long, bool, std::optional, std::optional, std::optional) 2025-05-07T20:11:08.1794560Z U memchr@GLIBC_2.2.5 2025-05-07T20:11:08.1794854Z U memcpy@GLIBC_2.14 2025-05-07T20:11:08.1795152Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:08.1795486Z U memset@GLIBC_2.2.5 2025-05-07T20:11:08.1795781Z U omp_get_num_threads 2025-05-07T20:11:08.1796079Z U omp_get_thread_num 2025-05-07T20:11:08.1796421Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:08.1796829Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:08.1797278Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:11:08.1797970Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:08.1799036Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:08.1800005Z U std::__cxx11::basic_stringbuf, std::allocator >::_M_sync(char*, unsigned long, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:11:08.1800966Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:08.1801845Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:11:08.1802621Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:08.1803449Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:08.1804041Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:11:08.1804454Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:11:08.1804815Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:08.1805131Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:08.1805476Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:11:08.1805833Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:08.1806209Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:08.1806611Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:08.1806999Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:08.1807459Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:08.1808117Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:08.1809120Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:08.1810272Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:08.1810448Z U std::condition_variable::condition_variable()@GLIBCXX_3.4.11 2025-05-07T20:11:08.1810595Z U std::condition_variable::notify_all()@GLIBCXX_3.4.11 2025-05-07T20:11:08.1810784Z U std::condition_variable::~condition_variable()@GLIBCXX_3.4.11 2025-05-07T20:11:08.1810903Z U std::current_exception()@CXXABI_1.3.3 2025-05-07T20:11:08.1811022Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:08.1811149Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:08.1811294Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:08.1811405Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:08.1811526Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:08.1811727Z U std::istream& std::istream::_M_extract(long&)@GLIBCXX_3.4.9 2025-05-07T20:11:08.1811836Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:08.1811954Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:08.1812344Z U std::logic_error::logic_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:11:08.1812466Z U std::logic_error::~logic_error()@GLIBCXX_3.4 2025-05-07T20:11:08.1812651Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:08.1812883Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:08.1813004Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:08.1813168Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:08.1813320Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:11:08.1813531Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:11:08.1813645Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:08.1813738Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:08.1813827Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:08.1813943Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:08.1814523Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:08.1814997Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:08.1815254Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:08.1816288Z U torch::detail::class_base::class_base(std::__cxx11::basic_string, std::allocator > const&, std::__cxx11::basic_string, std::allocator > const&, std::__cxx11::basic_string, std::allocator >, std::type_info const&, std::type_info const&) 2025-05-07T20:11:08.1816642Z U torch::detail::class_base::withNewArguments(c10::FunctionSchema const&, std::initializer_list) 2025-05-07T20:11:08.1816986Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:08.1817366Z U torch::registerCustomClassMethod(std::unique_ptr >) 2025-05-07T20:11:08.1817516Z U torch::serialize::InputArchive::InputArchive() 2025-05-07T20:11:08.1817821Z U torch::serialize::InputArchive::load_from(char const*, unsigned long, std::optional) 2025-05-07T20:11:08.1818257Z U torch::serialize::InputArchive::read(std::__cxx11::basic_string, std::allocator > const&, at::Tensor&, bool) 2025-05-07T20:11:08.1818571Z U torch::serialize::OutputArchive::OutputArchive(std::shared_ptr) 2025-05-07T20:11:08.1818731Z U torch::serialize::OutputArchive::save_to(std::ostream&) 2025-05-07T20:11:08.1819201Z U torch::serialize::OutputArchive::write(std::__cxx11::basic_string, std::allocator > const&, at::Tensor const&, bool) 2025-05-07T20:11:08.1819319Z U typeinfo for c10::Error 2025-05-07T20:11:08.1819463Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:11:08.1819584Z U typeinfo for std::logic_error@GLIBCXX_3.4 2025-05-07T20:11:08.1819744Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:11:08.1819921Z U unsigned char* at::TensorBase::data_ptr() const 2025-05-07T20:11:08.1820121Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:08.1820279Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:08.1820435Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:08.1820580Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:08.1820686Z U vtable for c10::Error 2025-05-07T20:11:08.1821025Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:08.1821338Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:08.1821703Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:08.1821896Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:08.1822106Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:08.1822238Z U vtable for torch::autograd::AutogradMeta 2025-05-07T20:11:08.1822345Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:08.1822448Z w _ITM_registerTMCloneTable 2025-05-07T20:11:08.1822551Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:08.1824369Z w __gmon_start__ 2025-05-07T20:11:08.1824456Z w __pthread_key_create 2025-05-07T20:11:08.1824563Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:08.1824678Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:08.1824823Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:08.1825030Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:11:08.1825038Z 2025-05-07T20:11:08.1825153Z linux-vdso.so.1 (0x00007ffed51c1000) 2025-05-07T20:11:08.1825241Z libc10.so => not found 2025-05-07T20:11:08.1825330Z libc10_cuda.so => not found 2025-05-07T20:11:08.1825682Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so (0x00007f159a000000) 2025-05-07T20:11:08.1826123Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007f1598800000) 2025-05-07T20:11:08.1826215Z libtorch.so => not found 2025-05-07T20:11:08.1826308Z libtorch_cpu.so => not found 2025-05-07T20:11:08.1826407Z libtorch_cuda.so => not found 2025-05-07T20:11:08.1826498Z libcudart.so.12 => not found 2025-05-07T20:11:08.1826651Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f159859c000) 2025-05-07T20:11:08.1826803Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f15b5a07000) 2025-05-07T20:11:08.1826919Z libc.so.6 => /lib64/libc.so.6 (0x00007f1598394000) 2025-05-07T20:11:08.1827039Z /lib64/ld-linux-x86-64.so.2 (0x00007f15b5a3b000) 2025-05-07T20:11:08.1827128Z libc10.so => not found 2025-05-07T20:11:08.1827467Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so (0x00007f159a585000) 2025-05-07T20:11:08.1827556Z libtorch.so => not found 2025-05-07T20:11:08.1827647Z libtorch_cpu.so => not found 2025-05-07T20:11:08.1827745Z libtorch_cuda.so => not found 2025-05-07T20:11:08.1827861Z libm.so.6 => /lib64/libm.so.6 (0x00007f1599f25000) 2025-05-07T20:11:08.1827951Z libtorch.so => not found 2025-05-07T20:11:08.1828040Z libc10.so => not found 2025-05-07T20:11:08.1828159Z libc10_cuda.so => not found 2025-05-07T20:11:08.1828248Z libtorch_cpu.so => not found 2025-05-07T20:11:08.1828334Z libtorch_cuda.so => not found 2025-05-07T20:11:08.1828431Z libcudart.so.12 => not found 2025-05-07T20:11:08.1828548Z libtorch_cpu.so => not found 2025-05-07T20:11:08.1828640Z libtorch_cuda.so => not found 2025-05-07T20:11:08.1828734Z libtorch.so => not found 2025-05-07T20:11:08.1828738Z 2025-05-07T20:11:08.1828838Z [CHECK] Displaying ELF information: 2025-05-07T20:11:08.1829158Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:11:08.1829164Z 2025-05-07T20:11:08.1829204Z 2025-05-07T20:11:08.1829543Z Dynamic section at offset 0x1af13978 contains 40 entries: 2025-05-07T20:11:08.1829663Z Tag Type Name/Value 2025-05-07T20:11:08.1829859Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:08.1830078Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:08.1830272Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:11:08.1830501Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:11:08.1830778Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:08.1830980Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:08.1831185Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:08.1831405Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:08.1831608Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:08.1831806Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:08.1831999Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:08.1832272Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:08.1832508Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_inference.so] 2025-05-07T20:11:08.1832696Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:08.1832842Z 0x000000000000000c (INIT) 0x19a000 2025-05-07T20:11:08.1832961Z 0x000000000000000d (FINI) 0x7e3f4c 2025-05-07T20:11:08.1833092Z 0x0000000000000019 (INIT_ARRAY) 0x1af13d58 2025-05-07T20:11:08.1833243Z 0x000000000000001b (INIT_ARRAYSZ) 392 (bytes) 2025-05-07T20:11:08.1833375Z 0x000000000000001a (FINI_ARRAY) 0x1af13ee0 2025-05-07T20:11:08.1833502Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:08.1833612Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:08.1833751Z 0x000000006ffffef5 (GNU_HASH) 0x7048 2025-05-07T20:11:08.1833869Z 0x0000000000000005 (STRTAB) 0x2bee8 2025-05-07T20:11:08.1833988Z 0x0000000000000006 (SYMTAB) 0xea58 2025-05-07T20:11:08.1834154Z 0x000000000000000a (STRSZ) 1363139 (bytes) 2025-05-07T20:11:08.1834280Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:08.1834405Z 0x0000000000000003 (PLTGOT) 0x1af14c38 2025-05-07T20:11:08.1834570Z 0x0000000000000002 (PLTRELSZ) 15648 (bytes) 2025-05-07T20:11:08.1834691Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:08.1834811Z 0x0000000000000017 (JMPREL) 0x195ff8 2025-05-07T20:11:08.1834930Z 0x0000000000000007 (RELA) 0x17b418 2025-05-07T20:11:08.1835089Z 0x0000000000000008 (RELASZ) 109536 (bytes) 2025-05-07T20:11:08.1835213Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:08.1835319Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:08.1835464Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:08.1835587Z 0x000000006ffffffe (VERNEED) 0x17b2b8 2025-05-07T20:11:08.1835698Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:08.1835849Z 0x000000006ffffff0 (VERSYM) 0x178bac 2025-05-07T20:11:08.1835975Z 0x000000006ffffff9 (RELACOUNT) 79 2025-05-07T20:11:08.1836081Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:08.1836113Z 2025-05-07T20:11:08.1836236Z ################################################################################ 2025-05-07T20:11:08.1836241Z 2025-05-07T20:11:08.1836258Z 2025-05-07T20:11:08.1836373Z ################################################################################ 2025-05-07T20:11:08.1836743Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:11:08.1836850Z [CHECK] Listing out library size: 2025-05-07T20:11:08.1837217Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:11:08.1837224Z 2025-05-07T20:11:08.1837505Z 4 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:11:08.1837509Z 2025-05-07T20:11:08.1837992Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:11:08.1838623Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:08.1838629Z 2025-05-07T20:11:08.1981772Z GLIBC_2.2.5 2025-05-07T20:11:08.1982260Z GLIBC_2.3 2025-05-07T20:11:08.1982676Z GLIBC_2.14 2025-05-07T20:11:08.1982705Z 2025-05-07T20:11:08.1982728Z 2025-05-07T20:11:08.1985506Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:11:08.1986164Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:08.1986331Z 2025-05-07T20:11:08.2198161Z GLIBCXX_3.4 2025-05-07T20:11:08.2198614Z GLIBCXX_3.4.9 2025-05-07T20:11:08.2199097Z GLIBCXX_3.4.11 2025-05-07T20:11:08.2199547Z GLIBCXX_3.4.15 2025-05-07T20:11:08.2199983Z GLIBCXX_3.4.18 2025-05-07T20:11:08.2200427Z GLIBCXX_3.4.20 2025-05-07T20:11:08.2200840Z GLIBCXX_3.4.21 2025-05-07T20:11:08.2201098Z GLIBCXX_3.4.29 2025-05-07T20:11:08.2201116Z 2025-05-07T20:11:08.2201129Z 2025-05-07T20:11:08.2222883Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so > /tmp/tmp.ZrmLYjOiMa.symbols.txt 2025-05-07T20:11:08.2222959Z 2025-05-07T20:11:08.2401374Z 2025-05-07T20:11:08.2430044Z [CHECK] Total Number of symbols: 2654 2025-05-07T20:11:08.2448824Z [CHECK] Number of fbgemm symbols: 1 2025-05-07T20:11:08.2467093Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so > /tmp/tmp.DaZu4UezJj.usymbols.txt 2025-05-07T20:11:08.2467163Z 2025-05-07T20:11:08.2487112Z 2025-05-07T20:11:08.2511853Z [CHECK] Listing out undefined symbols (194 total): 2025-05-07T20:11:08.2531252Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:08.2531534Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:08.2531742Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:08.2531943Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:08.2532166Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:08.2532394Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:08.2532553Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:08.2532666Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:08.2532780Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:08.2532908Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:08.2533016Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:08.2533121Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:08.2533447Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:08.2533562Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:08.2533739Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:11:08.2533958Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:11:08.2534097Z U at::RecordFunction::currentThreadId() 2025-05-07T20:11:08.2534216Z U at::RecordFunction::end() 2025-05-07T20:11:08.2534350Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:11:08.2534526Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:11:08.2535312Z U at::_ops::_sparse_coo_tensor_unsafe::call(at::Tensor const&, at::Tensor const&, c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:08.2535668Z U at::_ops::clamp::call(at::Tensor const&, std::optional const&, std::optional const&) 2025-05-07T20:11:08.2536280Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:08.2537004Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:08.2537195Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:11:08.2537390Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:11:08.2537591Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:08.2537800Z U at::_ops::unsqueeze::call(at::Tensor const&, long) 2025-05-07T20:11:08.2538085Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:08.2538217Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:11:08.2538395Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:11:08.2538527Z U at::sequence_number::get_and_increment() 2025-05-07T20:11:08.2538630Z U c10::AnyType::get() 2025-05-07T20:11:08.2538749Z U c10::BoolType::get() 2025-05-07T20:11:08.2538936Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:11:08.2539058Z U c10::Dispatcher::realSingleton() 2025-05-07T20:11:08.2539606Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:11:08.2540254Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:11:08.2540637Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:08.2540764Z U c10::Error::what() const 2025-05-07T20:11:08.2540867Z U c10::FloatType::get() 2025-05-07T20:11:08.2540975Z U c10::GradMode::is_enabled() 2025-05-07T20:11:08.2541107Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:11:08.2541270Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:11:08.2541387Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:11:08.2541514Z U c10::IValue::isBoolList() const 2025-05-07T20:11:08.2541625Z U c10::IValue::isIntList() const 2025-05-07T20:11:08.2541744Z U c10::IValue::isSymIntList() const 2025-05-07T20:11:08.2541910Z U c10::IValue::isTensorList() const 2025-05-07T20:11:08.2542057Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:08.2542191Z U c10::IntType::get() 2025-05-07T20:11:08.2542699Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:08.2542873Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:08.2542998Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:08.2543145Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:08.2543273Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:08.2543500Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:08.2543808Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:11:08.2543912Z U c10::StringType::get() 2025-05-07T20:11:08.2544063Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:11:08.2544243Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:08.2544430Z U c10::SymBool::guard_size_oblivious(char const*, long) const 2025-05-07T20:11:08.2544581Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:08.2544739Z U c10::SymFloat::operator/(c10::SymFloat const&) const 2025-05-07T20:11:08.2545170Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:08.2545313Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:08.2545475Z U c10::SymInt::operator c10::SymFloat() const 2025-05-07T20:11:08.2545633Z U c10::SymInt::operator*(c10::SymInt const&) const 2025-05-07T20:11:08.2545773Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:11:08.2545900Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:08.2546051Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:11:08.2546184Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:11:08.2546315Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:11:08.2546443Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:08.2546548Z U c10::SymIntType::get() 2025-05-07T20:11:08.2546672Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:11:08.2546795Z U c10::TensorType::get() 2025-05-07T20:11:08.2546921Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:08.2547363Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:08.2547910Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:08.2548174Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:08.2548680Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:08.2549169Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:08.2549964Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:08.2550361Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:11:08.2550556Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:11:08.2550714Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:08.2550893Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:11:08.2551279Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:08.2551412Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:11:08.2551609Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:11:08.2551774Z U c10::operator<<(std::ostream&, c10::SymFloat const&) 2025-05-07T20:11:08.2551929Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:11:08.2552145Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:08.2552293Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:08.2552598Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:11:08.2552701Z U free@GLIBC_2.2.5 2025-05-07T20:11:08.2552903Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:08.2553007Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:08.2553107Z U memcpy@GLIBC_2.14 2025-05-07T20:11:08.2553224Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:08.2553324Z U memset@GLIBC_2.2.5 2025-05-07T20:11:08.2553483Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:08.2553626Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:08.2553809Z U realloc@GLIBC_2.2.5 2025-05-07T20:11:08.2554031Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:11:08.2554400Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:08.2554815Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:08.2555169Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:08.2555526Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:11:08.2556021Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:08.2556382Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:08.2556513Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:08.2556624Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:08.2556768Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:08.2556918Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:08.2557083Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:08.2557212Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:08.2557367Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:11:08.2557601Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:08.2557936Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:08.2558536Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:08.2559052Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:08.2559179Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:08.2559314Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:08.2559430Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:08.2559547Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:08.2559679Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:08.2559790Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:08.2559901Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:08.2560095Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:08.2560331Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:08.2560477Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:08.2560654Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:08.2560785Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:11:08.2561199Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:11:08.2561355Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:11:08.2561465Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:08.2561590Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:08.2561702Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:08.2561829Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:08.2562409Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:08.2562878Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:08.2563134Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:08.2563256Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:11:08.2563565Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:11:08.2563747Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:11:08.2563945Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:11:08.2564146Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:11:08.2564485Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:11:08.2564637Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:11:08.2564839Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:11:08.2565015Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:11:08.2565133Z U torch::autograd::Node::assign_parent() 2025-05-07T20:11:08.2565260Z U torch::autograd::Node::metadata() 2025-05-07T20:11:08.2565398Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:11:08.2565634Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:11:08.2565944Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:11:08.2566105Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:11:08.2566309Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:11:08.2566532Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:11:08.2569149Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:11:08.2569340Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:11:08.2569485Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:11:08.2569655Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:11:08.2570417Z U torch::autograd::profiler::record_function_enter_new(std::__cxx11::basic_string, std::allocator > const&, std::optional, std::allocator > > const&) 2025-05-07T20:11:08.2570600Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:11:08.2571008Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:11:08.2571361Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:08.2571462Z U typeinfo for c10::Error 2025-05-07T20:11:08.2571611Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:08.2571733Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:11:08.2571880Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:11:08.2572020Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:11:08.2572134Z U typeinfo for torch::autograd::Node 2025-05-07T20:11:08.2572283Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:08.2572455Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:08.2572606Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:11:08.2572764Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:08.2572880Z U vtable for c10::Error 2025-05-07T20:11:08.2573217Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:08.2573535Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:08.2573690Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:08.2573887Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:08.2574108Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:08.2574271Z U vtable for torch::autograd::Node 2025-05-07T20:11:08.2574445Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:08.2574555Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:08.2574705Z w _ITM_registerTMCloneTable 2025-05-07T20:11:08.2574807Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:08.2574905Z w __gmon_start__ 2025-05-07T20:11:08.2575000Z w __pthread_key_create 2025-05-07T20:11:08.2575117Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:08.2575226Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:08.2575366Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:08.2575849Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:11:08.2575859Z 2025-05-07T20:11:08.2577885Z linux-vdso.so.1 (0x00007ffc78bd2000) 2025-05-07T20:11:08.2578156Z libc10.so => not found 2025-05-07T20:11:08.2578646Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so (0x00007fb552339000) 2025-05-07T20:11:08.2579569Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007fb550c00000) 2025-05-07T20:11:08.2579746Z libtorch.so => not found 2025-05-07T20:11:08.2579942Z libtorch_cpu.so => not found 2025-05-07T20:11:08.2580140Z libtorch_cuda.so => not found 2025-05-07T20:11:08.2580470Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fb55099c000) 2025-05-07T20:11:08.2580723Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fb552309000) 2025-05-07T20:11:08.2580852Z libc.so.6 => /lib64/libc.so.6 (0x00007fb550794000) 2025-05-07T20:11:08.2580985Z /lib64/ld-linux-x86-64.so.2 (0x00007fb552349000) 2025-05-07T20:11:08.2581080Z libc10.so => not found 2025-05-07T20:11:08.2581237Z libtorch_cpu.so => not found 2025-05-07T20:11:08.2581339Z libtorch_cuda.so => not found 2025-05-07T20:11:08.2581439Z libtorch.so => not found 2025-05-07T20:11:08.2581552Z libtorch.so => not found 2025-05-07T20:11:08.2581643Z libc10.so => not found 2025-05-07T20:11:08.2581744Z libc10_cuda.so => not found 2025-05-07T20:11:08.2581844Z libtorch_cpu.so => not found 2025-05-07T20:11:08.2581961Z libtorch_cuda.so => not found 2025-05-07T20:11:08.2582061Z libcudart.so.12 => not found 2025-05-07T20:11:08.2582188Z libm.so.6 => /lib64/libm.so.6 (0x00007fb55222a000) 2025-05-07T20:11:08.2582194Z 2025-05-07T20:11:08.2582318Z [CHECK] Displaying ELF information: 2025-05-07T20:11:08.2582645Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:11:08.2582651Z 2025-05-07T20:11:08.2617407Z 2025-05-07T20:11:08.2617959Z Dynamic section at offset 0x39abb0 contains 38 entries: 2025-05-07T20:11:08.2618205Z Tag Type Name/Value 2025-05-07T20:11:08.2618600Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:08.2618864Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:11:08.2619094Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:11:08.2619306Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:08.2619534Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:08.2619742Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:08.2619949Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:08.2620161Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:08.2620379Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:08.2620604Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:08.2621114Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_split_host.so] 2025-05-07T20:11:08.2621309Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:08.2621436Z 0x000000000000000c (INIT) 0xb9000 2025-05-07T20:11:08.2621628Z 0x000000000000000d (FINI) 0x33effc 2025-05-07T20:11:08.2621745Z 0x0000000000000019 (INIT_ARRAY) 0x397b28 2025-05-07T20:11:08.2621876Z 0x000000000000001b (INIT_ARRAYSZ) 304 (bytes) 2025-05-07T20:11:08.2621997Z 0x000000000000001a (FINI_ARRAY) 0x397c58 2025-05-07T20:11:08.2622132Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:08.2622241Z 0x0000000000000004 (HASH) 0x200 2025-05-07T20:11:08.2622362Z 0x000000006ffffef5 (GNU_HASH) 0x3b08 2025-05-07T20:11:08.2622488Z 0x0000000000000005 (STRTAB) 0x17258 2025-05-07T20:11:08.2622598Z 0x0000000000000006 (SYMTAB) 0x7970 2025-05-07T20:11:08.2622741Z 0x000000000000000a (STRSZ) 529940 (bytes) 2025-05-07T20:11:08.2622879Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:08.2622998Z 0x0000000000000003 (PLTGOT) 0x39ae50 2025-05-07T20:11:08.2623185Z 0x0000000000000002 (PLTRELSZ) 14112 (bytes) 2025-05-07T20:11:08.2623296Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:08.2623421Z 0x0000000000000017 (JMPREL) 0xb52c8 2025-05-07T20:11:08.2623531Z 0x0000000000000007 (RELA) 0x99e60 2025-05-07T20:11:08.2623669Z 0x0000000000000008 (RELASZ) 111720 (bytes) 2025-05-07T20:11:08.2623809Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:08.2623909Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:08.2624037Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:08.2624175Z 0x000000006ffffffe (VERNEED) 0x99d30 2025-05-07T20:11:08.2624328Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:11:08.2624446Z 0x000000006ffffff0 (VERSYM) 0x9886c 2025-05-07T20:11:08.2624560Z 0x000000006ffffff9 (RELACOUNT) 40 2025-05-07T20:11:08.2624674Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:08.2624682Z 2025-05-07T20:11:08.2624803Z ################################################################################ 2025-05-07T20:11:08.2624808Z 2025-05-07T20:11:08.2624812Z 2025-05-07T20:11:08.2624924Z ################################################################################ 2025-05-07T20:11:08.2625260Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:11:08.2625366Z [CHECK] Listing out library size: 2025-05-07T20:11:08.2625680Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:11:08.2625684Z 2025-05-07T20:11:08.2631109Z 343 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:11:08.2631470Z 2025-05-07T20:11:08.2632318Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:11:08.2632879Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:08.2632902Z 2025-05-07T20:11:08.3611616Z GLIBC_2.2.5 2025-05-07T20:11:08.3612597Z GLIBC_2.3 2025-05-07T20:11:08.3612857Z GLIBC_2.14 2025-05-07T20:11:08.3612901Z 2025-05-07T20:11:08.3612915Z 2025-05-07T20:11:08.3614323Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:11:08.3614923Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:08.3614929Z 2025-05-07T20:11:08.4601545Z GLIBCXX_3.4 2025-05-07T20:11:08.4602389Z GLIBCXX_3.4.9 2025-05-07T20:11:08.4603047Z GLIBCXX_3.4.20 2025-05-07T20:11:08.4603601Z GLIBCXX_3.4.21 2025-05-07T20:11:08.4604020Z GLIBCXX_3.4.29 2025-05-07T20:11:08.4604158Z 2025-05-07T20:11:08.4604162Z 2025-05-07T20:11:08.4624871Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so > /tmp/tmp.kmbgd14ugj.symbols.txt 2025-05-07T20:11:08.4625469Z 2025-05-07T20:11:08.5587667Z 2025-05-07T20:11:08.5630326Z [CHECK] Total Number of symbols: 12731 2025-05-07T20:11:08.5675682Z [CHECK] Number of fbgemm symbols: 5268 2025-05-07T20:11:08.5692663Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so > /tmp/tmp.3VKzGQYqW6.usymbols.txt 2025-05-07T20:11:08.5693245Z 2025-05-07T20:11:08.5746999Z 2025-05-07T20:11:08.5771520Z [CHECK] Listing out undefined symbols (178 total): 2025-05-07T20:11:08.5787412Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:08.5789340Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:08.5790575Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:08.5791751Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:08.5793208Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:08.5794481Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:08.5794881Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:08.5795247Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:08.5795625Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:08.5795994Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:08.5796344Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:08.5796678Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:08.5796999Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:08.5797404Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:08.5797729Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:08.5798082Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:08.5798409Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:08.5798747Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:08.5799070Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:08.5799385Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:08.5799832Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:08.5800211Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:11:08.5800633Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:11:08.5801325Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:11:08.5802017Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:11:08.5802639Z U at::Tensor::index(std::initializer_list) const 2025-05-07T20:11:08.5803217Z U at::Tensor::index_put_(std::initializer_list, at::Tensor const&) 2025-05-07T20:11:08.5804235Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:08.5805174Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:11:08.5805644Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:08.5806120Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:11:08.5806546Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:11:08.5806988Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:11:08.5807539Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:08.5807942Z U c10::BoolType::get() 2025-05-07T20:11:08.5808352Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:08.5808785Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:11:08.5809193Z U c10::Dispatcher::realSingleton() 2025-05-07T20:11:08.5809930Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:11:08.5811149Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:11:08.5812245Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:08.5812828Z U c10::Error::what() const 2025-05-07T20:11:08.5813123Z U c10::FloatType::get() 2025-05-07T20:11:08.5813505Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:11:08.5813922Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:08.5814346Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:08.5814682Z U c10::IntType::get() 2025-05-07T20:11:08.5815038Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:08.5815438Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:08.5815778Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:08.5816140Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:08.5816593Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:11:08.5816990Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:08.5817388Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:08.5818030Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:08.5818673Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:08.5819026Z U c10::SymInt::operator+=(c10::SymInt const&) 2025-05-07T20:11:08.5819403Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:11:08.5819770Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:08.5820110Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:11:08.5820485Z U c10::SymInt::sym_ge(c10::SymInt const&) const 2025-05-07T20:11:08.5820837Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:11:08.5821208Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:11:08.5821551Z U c10::SymInt::sym_ne(c10::SymInt const&) const 2025-05-07T20:11:08.5821904Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:08.5822225Z U c10::SymIntType::get() 2025-05-07T20:11:08.5822510Z U c10::TensorType::get() 2025-05-07T20:11:08.5822823Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:08.5823727Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:08.5824664Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:08.5825018Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:08.5825347Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:08.5825717Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:08.5826040Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:08.5826381Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:08.5826872Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:08.5827319Z U c10::cuda::device_count() 2025-05-07T20:11:08.5827664Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:08.5828031Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:08.5828412Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:08.5828778Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:08.5829274Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:08.5829860Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:08.5830695Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:08.5831666Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:08.5832591Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:08.5833603Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:08.5834709Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:08.5835569Z U c10::get_default_dtype() 2025-05-07T20:11:08.5835924Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:08.5836274Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:08.5836863Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:11:08.5837530Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:11:08.5837975Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:08.5838358Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:08.5838758Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:11:08.5839183Z U c10::operator%(c10::SymInt const&, int) 2025-05-07T20:11:08.5839554Z U c10::operator*(c10::SymInt const&, long) 2025-05-07T20:11:08.5839944Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:11:08.5840551Z U c10::operator<(c10::SymInt const&, int) 2025-05-07T20:11:08.5840914Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:08.5841304Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:08.5841701Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:11:08.5842131Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:11:08.5842499Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:08.5842891Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:08.5843344Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:08.5843700Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:08.5844058Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:08.5844396Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:08.5844738Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:08.5845073Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:08.5845435Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:08.5845794Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:08.5846157Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:08.5846499Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:08.5846824Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:08.5847178Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:08.5847519Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:08.5848034Z U fbgemm_gpu::reshape_vbe_output(at::Tensor const&, long, at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:08.5848549Z U float at::Tensor::item() const 2025-05-07T20:11:08.5848897Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:08.5849314Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:08.5849675Z U free@GLIBC_2.2.5 2025-05-07T20:11:08.5849978Z U int at::Tensor::item() const 2025-05-07T20:11:08.5850355Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:08.5850721Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:08.5851146Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:08.5851549Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:08.5851937Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:08.5852284Z U memcpy@GLIBC_2.14 2025-05-07T20:11:08.5852588Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:08.5852897Z U memset@GLIBC_2.2.5 2025-05-07T20:11:08.5853224Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:08.5853641Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:08.5854204Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:08.5855026Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:08.5855849Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:08.5856392Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:08.5856755Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:08.5857132Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:08.5857543Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:08.5858064Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:08.5858736Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:08.5859748Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:08.5860908Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:08.5861626Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:08.5861984Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:08.5862316Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:08.5862669Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:08.5863009Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:08.5863361Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:08.5863687Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:08.5864104Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:08.5864632Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:08.5865106Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:08.5865437Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:08.5865747Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:08.5866047Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:08.5866853Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:08.5867993Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:08.5868827Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:08.5869852Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:08.5870451Z U typeinfo for c10::Error 2025-05-07T20:11:08.5870837Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:08.5871291Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:08.5871735Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:08.5872128Z U vtable for c10::Error 2025-05-07T20:11:08.5872738Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:08.5873586Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:08.5874279Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:08.5874836Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:08.5875393Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:08.5875811Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:08.5876155Z w _ITM_registerTMCloneTable 2025-05-07T20:11:08.5876493Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:08.5876801Z w __gmon_start__ 2025-05-07T20:11:08.5877095Z w __pthread_key_create 2025-05-07T20:11:08.5877455Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:08.5877977Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:11:08.5878340Z 2025-05-07T20:11:08.5878495Z linux-vdso.so.1 (0x00007ffea0711000) 2025-05-07T20:11:08.5878797Z libc10.so => not found 2025-05-07T20:11:08.5879077Z libc10_cuda.so => not found 2025-05-07T20:11:08.5879750Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so (0x00007faa8f400000) 2025-05-07T20:11:08.5880476Z libtorch.so => not found 2025-05-07T20:11:08.5880744Z libtorch_cpu.so => not found 2025-05-07T20:11:08.5881043Z libtorch_cuda.so => not found 2025-05-07T20:11:08.5881341Z libcudart.so.12 => not found 2025-05-07T20:11:08.5881683Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007faa8f19c000) 2025-05-07T20:11:08.5882225Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007faaa581b000) 2025-05-07T20:11:08.5882601Z libc.so.6 => /lib64/libc.so.6 (0x00007faa8ef94000) 2025-05-07T20:11:08.5882977Z /lib64/ld-linux-x86-64.so.2 (0x00007faaa584f000) 2025-05-07T20:11:08.5883323Z libc10.so => not found 2025-05-07T20:11:08.5883586Z libc10_cuda.so => not found 2025-05-07T20:11:08.5884129Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so (0x00007faa8ea00000) 2025-05-07T20:11:08.5885436Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so (0x00007faaa580d000) 2025-05-07T20:11:08.5886135Z libtorch.so => not found 2025-05-07T20:11:08.5886407Z libtorch_cpu.so => not found 2025-05-07T20:11:08.5886708Z libtorch_cuda.so => not found 2025-05-07T20:11:08.5906403Z libcudart.so.12 => not found 2025-05-07T20:11:08.5906735Z libm.so.6 => /lib64/libm.so.6 (0x00007faaa5730000) 2025-05-07T20:11:08.5907212Z libc10.so => not found 2025-05-07T20:11:08.5907958Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so (0x00007faa8f785000) 2025-05-07T20:11:08.5908570Z libtorch.so => not found 2025-05-07T20:11:08.5908869Z libtorch_cpu.so => not found 2025-05-07T20:11:08.5909282Z libtorch_cuda.so => not found 2025-05-07T20:11:08.5909659Z libc10.so => not found 2025-05-07T20:11:08.5910122Z libtorch_cpu.so => not found 2025-05-07T20:11:08.5910416Z libtorch_cuda.so => not found 2025-05-07T20:11:08.5910688Z libtorch.so => not found 2025-05-07T20:11:08.5910952Z libtorch_cpu.so => not found 2025-05-07T20:11:08.5911229Z libtorch_cuda.so => not found 2025-05-07T20:11:08.5911503Z libtorch.so => not found 2025-05-07T20:11:08.5911667Z 2025-05-07T20:11:08.5911790Z [CHECK] Displaying ELF information: 2025-05-07T20:11:08.5912270Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:11:08.5912664Z 2025-05-07T20:11:08.5912714Z 2025-05-07T20:11:08.5912886Z Dynamic section at offset 0x1569a110 contains 39 entries: 2025-05-07T20:11:08.5913292Z Tag Type Name/Value 2025-05-07T20:11:08.5913768Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:08.5914299Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:08.5914843Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:11:08.5915408Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:08.5915929Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:08.5916477Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:08.5917022Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:08.5917553Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:08.5918079Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:08.5918588Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:08.5919131Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:08.5919737Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_forward.so] 2025-05-07T20:11:08.5920315Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:08.5920740Z 0x000000000000000c (INIT) 0x44b000 2025-05-07T20:11:08.5921086Z 0x000000000000000d (FINI) 0x22530cc 2025-05-07T20:11:08.5921446Z 0x0000000000000019 (INIT_ARRAY) 0x15698508 2025-05-07T20:11:08.5921924Z 0x000000000000001b (INIT_ARRAYSZ) 752 (bytes) 2025-05-07T20:11:08.5922292Z 0x000000000000001a (FINI_ARRAY) 0x156987f8 2025-05-07T20:11:08.5922637Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:08.5922988Z 0x0000000000000004 (HASH) 0x200 2025-05-07T20:11:08.5923325Z 0x000000006ffffef5 (GNU_HASH) 0x10898 2025-05-07T20:11:08.5923659Z 0x0000000000000005 (STRTAB) 0x6f610 2025-05-07T20:11:08.5923994Z 0x0000000000000006 (SYMTAB) 0x24c70 2025-05-07T20:11:08.5924403Z 0x000000000000000a (STRSZ) 3691715 (bytes) 2025-05-07T20:11:08.5924774Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:08.5925166Z 0x0000000000000003 (PLTGOT) 0x1569a3c0 2025-05-07T20:11:08.5925537Z 0x0000000000000002 (PLTRELSZ) 10920 (bytes) 2025-05-07T20:11:08.5925877Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:08.5926213Z 0x0000000000000017 (JMPREL) 0x4484b0 2025-05-07T20:11:08.5926552Z 0x0000000000000007 (RELA) 0x3faf60 2025-05-07T20:11:08.5926897Z 0x0000000000000008 (RELASZ) 316752 (bytes) 2025-05-07T20:11:08.5927271Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:08.5927594Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:08.5927928Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:08.5928281Z 0x000000006ffffffe (VERNEED) 0x3fae50 2025-05-07T20:11:08.5928623Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:08.5928949Z 0x000000006ffffff0 (VERSYM) 0x3f4ad4 2025-05-07T20:11:08.5929441Z 0x000000006ffffff9 (RELACOUNT) 136 2025-05-07T20:11:08.5929789Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:08.5929988Z 2025-05-07T20:11:08.5930098Z ################################################################################ 2025-05-07T20:11:08.5930335Z 2025-05-07T20:11:08.5930339Z 2025-05-07T20:11:08.5930449Z ################################################################################ 2025-05-07T20:11:08.5930981Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:11:08.5931517Z [CHECK] Listing out library size: 2025-05-07T20:11:08.5932016Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:11:08.5932455Z 2025-05-07T20:11:08.5932690Z 1 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:11:08.5933043Z 2025-05-07T20:11:08.5933469Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:11:08.5934534Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:08.5935183Z 2025-05-07T20:11:08.5963470Z GLIBC_2.2.5 2025-05-07T20:11:08.5963958Z GLIBC_2.3 2025-05-07T20:11:08.5964167Z GLIBC_2.14 2025-05-07T20:11:08.5964298Z 2025-05-07T20:11:08.5964303Z 2025-05-07T20:11:08.5964772Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:11:08.5965928Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:08.5966618Z 2025-05-07T20:11:08.6022759Z GLIBCXX_3.4 2025-05-07T20:11:08.6023535Z GLIBCXX_3.4.9 2025-05-07T20:11:08.6023746Z GLIBCXX_3.4.18 2025-05-07T20:11:08.6023971Z GLIBCXX_3.4.20 2025-05-07T20:11:08.6024181Z GLIBCXX_3.4.21 2025-05-07T20:11:08.6024401Z GLIBCXX_3.4.29 2025-05-07T20:11:08.6024613Z 2025-05-07T20:11:08.6024617Z 2025-05-07T20:11:08.6044632Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so > /tmp/tmp.teuFSJv7lS.symbols.txt 2025-05-07T20:11:08.6045202Z 2025-05-07T20:11:08.6069534Z 2025-05-07T20:11:08.6093174Z [CHECK] Total Number of symbols: 356 2025-05-07T20:11:08.6109342Z [CHECK] Number of fbgemm symbols: 56 2025-05-07T20:11:08.6124330Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so > /tmp/tmp.N0GUF3YZkK.usymbols.txt 2025-05-07T20:11:08.6124866Z 2025-05-07T20:11:08.6148657Z 2025-05-07T20:11:08.6175814Z [CHECK] Listing out undefined symbols (123 total): 2025-05-07T20:11:08.6195113Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:08.6195956Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:08.6196621Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:08.6196988Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:08.6197390Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:08.6197796Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:08.6198192Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:08.6198576Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:08.6198944Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:08.6199310Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:08.6199677Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:08.6200041Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:08.6200363Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:08.6200690Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:08.6201063Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:08.6201399Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:08.6201714Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:08.6202043Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:08.6202389Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:08.6202710Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:08.6203555Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:08.6205009Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:08.6205978Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:08.6206423Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:08.6206778Z U c10::IntType::get() 2025-05-07T20:11:08.6207174Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:08.6207599Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:08.6208057Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:08.6208954Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:08.6209725Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:08.6210074Z U c10::TensorType::get() 2025-05-07T20:11:08.6210399Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:08.6211310Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:08.6212243Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:08.6212587Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:08.6212931Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:08.6213261Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:08.6213577Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:08.6213911Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:08.6214356Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:08.6214841Z U c10::cuda::device_count() 2025-05-07T20:11:08.6215159Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:08.6215536Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:08.6215941Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:08.6216309Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:08.6216702Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:08.6217065Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:08.6217781Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:08.6218631Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:08.6219465Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:08.6220380Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:08.6220965Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:08.6221278Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:08.6221606Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:08.6221959Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:08.6222339Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:08.6222684Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:08.6223073Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:08.6223529Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:08.6223884Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:08.6224234Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:08.6224564Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:08.6224902Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:08.6225219Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:08.6225555Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:08.6225923Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:08.6226270Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:08.6226605Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:08.6226921Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:08.6227254Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:08.6227586Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:08.6227930Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:08.6228305Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:08.6228710Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:08.6229239Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:08.6229770Z U memcpy@GLIBC_2.14 2025-05-07T20:11:08.6230078Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:08.6230421Z U memset@GLIBC_2.2.5 2025-05-07T20:11:08.6230777Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:08.6231175Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:08.6231776Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:08.6232651Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:08.6233648Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:08.6234543Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:08.6235155Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:08.6235511Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:08.6235885Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:08.6236308Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:08.6236762Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:08.6237302Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:08.6238238Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:08.6239336Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:08.6240614Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:08.6241396Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:08.6241766Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:08.6242121Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:08.6242480Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:08.6242838Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:08.6243189Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:08.6243597Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:08.6244164Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:08.6244670Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:08.6245023Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:08.6245355Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:08.6245671Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:08.6246531Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:08.6247752Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:08.6248612Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:08.6249388Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:08.6250113Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:08.6250608Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:08.6251048Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:08.6251486Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:08.6252141Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:08.6253003Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:08.6254060Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:08.6256458Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:08.6256999Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:08.6257423Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:08.6257739Z w _ITM_registerTMCloneTable 2025-05-07T20:11:08.6258031Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:08.6258323Z w __gmon_start__ 2025-05-07T20:11:08.6258579Z w __pthread_key_create 2025-05-07T20:11:08.6258909Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:08.6259391Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:11:08.6259741Z 2025-05-07T20:11:08.6259871Z linux-vdso.so.1 (0x00007ffcbdec1000) 2025-05-07T20:11:08.6260164Z libtorch.so => not found 2025-05-07T20:11:08.6260397Z libc10.so => not found 2025-05-07T20:11:08.6260669Z libc10_cuda.so => not found 2025-05-07T20:11:08.6260918Z libtorch_cpu.so => not found 2025-05-07T20:11:08.6261185Z libtorch_cuda.so => not found 2025-05-07T20:11:08.6261438Z libcudart.so.12 => not found 2025-05-07T20:11:08.6261756Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fc8e6a03000) 2025-05-07T20:11:08.6262173Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fc8e69d5000) 2025-05-07T20:11:08.6262535Z libc.so.6 => /lib64/libc.so.6 (0x00007fc8e67cd000) 2025-05-07T20:11:08.6262884Z /lib64/ld-linux-x86-64.so.2 (0x00007fc8e6cda000) 2025-05-07T20:11:08.6263219Z libm.so.6 => /lib64/libm.so.6 (0x00007fc8e66f2000) 2025-05-07T20:11:08.6263454Z 2025-05-07T20:11:08.6263556Z [CHECK] Displaying ELF information: 2025-05-07T20:11:08.6264023Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:11:08.6264388Z 2025-05-07T20:11:08.6272030Z 2025-05-07T20:11:08.6272593Z Dynamic section at offset 0x6a540 contains 37 entries: 2025-05-07T20:11:08.6273704Z Tag Type Name/Value 2025-05-07T20:11:08.6274516Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:08.6275059Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:08.6275577Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:08.6276095Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:08.6276631Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:08.6277159Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:08.6277693Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:08.6278212Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:08.6278720Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:08.6279255Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:08.6279854Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_embedding_inplace_ops.so] 2025-05-07T20:11:08.6280354Z 0x000000000000000c (INIT) 0xf000 2025-05-07T20:11:08.6280687Z 0x000000000000000d (FINI) 0x2c63c 2025-05-07T20:11:08.6281031Z 0x0000000000000019 (INIT_ARRAY) 0x6b1f8 2025-05-07T20:11:08.6281375Z 0x000000000000001b (INIT_ARRAYSZ) 40 (bytes) 2025-05-07T20:11:08.6281737Z 0x000000000000001a (FINI_ARRAY) 0x6b220 2025-05-07T20:11:08.6282094Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:08.6282433Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:08.6282775Z 0x000000006ffffef5 (GNU_HASH) 0x12b0 2025-05-07T20:11:08.6283155Z 0x0000000000000005 (STRTAB) 0x3ff0 2025-05-07T20:11:08.6283484Z 0x0000000000000006 (SYMTAB) 0x1e78 2025-05-07T20:11:08.6283829Z 0x000000000000000a (STRSZ) 31425 (bytes) 2025-05-07T20:11:08.6284232Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:08.6284764Z 0x0000000000000003 (PLTGOT) 0x6b7e0 2025-05-07T20:11:08.6285201Z 0x0000000000000002 (PLTRELSZ) 4320 (bytes) 2025-05-07T20:11:08.6285553Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:08.6285871Z 0x0000000000000017 (JMPREL) 0xd0f8 2025-05-07T20:11:08.6286204Z 0x0000000000000007 (RELA) 0xbeb0 2025-05-07T20:11:08.6286543Z 0x0000000000000008 (RELASZ) 4680 (bytes) 2025-05-07T20:11:08.6286902Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:08.6287223Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:08.6287559Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:08.6287906Z 0x000000006ffffffe (VERNEED) 0xbd80 2025-05-07T20:11:08.6288244Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:08.6288575Z 0x000000006ffffff0 (VERSYM) 0xbab2 2025-05-07T20:11:08.6288980Z 0x000000006ffffff9 (RELACOUNT) 24 2025-05-07T20:11:08.6289299Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:08.6289502Z 2025-05-07T20:11:08.6289617Z ################################################################################ 2025-05-07T20:11:08.6289858Z 2025-05-07T20:11:08.6289862Z 2025-05-07T20:11:08.6289973Z ################################################################################ 2025-05-07T20:11:08.6290504Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:11:08.6291015Z [CHECK] Listing out library size: 2025-05-07T20:11:08.6291503Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:11:08.6291944Z 2025-05-07T20:11:08.6292175Z 35 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:11:08.6292516Z 2025-05-07T20:11:08.6292929Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:11:08.6293984Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:08.6294608Z 2025-05-07T20:11:08.6396230Z GLIBC_2.2.5 2025-05-07T20:11:08.6396897Z GLIBC_2.3 2025-05-07T20:11:08.6397456Z GLIBC_2.14 2025-05-07T20:11:08.6397802Z 2025-05-07T20:11:08.6397817Z 2025-05-07T20:11:08.6399114Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:11:08.6402027Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:08.6402686Z 2025-05-07T20:11:08.6509176Z GLIBCXX_3.4 2025-05-07T20:11:08.6509902Z GLIBCXX_3.4.9 2025-05-07T20:11:08.6510496Z GLIBCXX_3.4.11 2025-05-07T20:11:08.6511111Z GLIBCXX_3.4.15 2025-05-07T20:11:08.6511731Z GLIBCXX_3.4.18 2025-05-07T20:11:08.6512308Z GLIBCXX_3.4.20 2025-05-07T20:11:08.6512877Z GLIBCXX_3.4.21 2025-05-07T20:11:08.6513081Z GLIBCXX_3.4.29 2025-05-07T20:11:08.6513207Z 2025-05-07T20:11:08.6513224Z 2025-05-07T20:11:08.6529931Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so > /tmp/tmp.XWghfXbddg.symbols.txt 2025-05-07T20:11:08.6531473Z 2025-05-07T20:11:08.6606462Z 2025-05-07T20:11:08.6631308Z [CHECK] Total Number of symbols: 1477 2025-05-07T20:11:08.6648261Z [CHECK] Number of fbgemm symbols: 213 2025-05-07T20:11:08.6664645Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so > /tmp/tmp.71mjKFh9Vm.usymbols.txt 2025-05-07T20:11:08.6665292Z 2025-05-07T20:11:08.6686109Z 2025-05-07T20:11:08.6719153Z [CHECK] Listing out undefined symbols (270 total): 2025-05-07T20:11:08.6744541Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:08.6745556Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:08.6746128Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:08.6746515Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:08.6746918Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:08.6747317Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:08.6747697Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:08.6748084Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:08.6748206Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:08.6748356Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:08.6748476Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:08.6748579Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:08.6748766Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:08.6748869Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:08.6749113Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:08.6749221Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:08.6749339Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:08.6749446Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:08.6749548Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:08.6749670Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:08.6749770Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:08.6749919Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:08.6750091Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:08.6750206Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:11:08.6750356Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:11:08.6750539Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:11:08.6750679Z U at::RecordFunction::currentThreadId() 2025-05-07T20:11:08.6750806Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:11:08.6750958Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:11:08.6751162Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:11:08.6751274Z U at::TensorMaker::make_tensor() 2025-05-07T20:11:08.6751395Z U at::_ops::all::call(at::Tensor const&) 2025-05-07T20:11:08.6751557Z U at::_ops::concat::call(c10::ArrayRef, long) 2025-05-07T20:11:08.6751721Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:11:08.6752336Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:08.6753029Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:08.6753204Z U at::_ops::eq_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:08.6753377Z U at::_ops::eq_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:08.6753580Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:11:08.6753750Z U at::_ops::gt_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:08.6754116Z U at::_ops::index_add::call(at::Tensor const&, long, at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:08.6754327Z U at::_ops::index_select::call(at::Tensor const&, long, at::Tensor const&) 2025-05-07T20:11:08.6754480Z U at::_ops::max::call(at::Tensor const&) 2025-05-07T20:11:08.6754665Z U at::_ops::mul_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:08.6754872Z U at::_ops::narrow::call(at::Tensor const&, long, c10::SymInt, c10::SymInt) 2025-05-07T20:11:08.6755049Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:11:08.6755424Z U at::_ops::split_with_sizes::call(at::Tensor const&, c10::ArrayRef, long) 2025-05-07T20:11:08.6755719Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:11:08.6756334Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:11:08.6756545Z U at::_ops::view::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:08.6756697Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:08.6757153Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:08.6757720Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:08.6757845Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:11:08.6757993Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:11:08.6758158Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:11:08.6758256Z U at::globalContext() 2025-05-07T20:11:08.6758389Z U at::has_internal_overlap(at::TensorBase const&) 2025-05-07T20:11:08.6758524Z U at::sequence_number::get_and_increment() 2025-05-07T20:11:08.6758633Z U bool at::Tensor::item() const 2025-05-07T20:11:08.6758725Z U c10::AnyType::get() 2025-05-07T20:11:08.6758899Z U c10::AutogradMetaInterface::~AutogradMetaInterface() 2025-05-07T20:11:08.6759095Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:08.6759193Z U c10::BoolType::get() 2025-05-07T20:11:08.6759355Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:08.6759528Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:11:08.6759639Z U c10::Dispatcher::realSingleton() 2025-05-07T20:11:08.6760145Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:11:08.6760758Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:11:08.6761120Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:08.6761229Z U c10::Error::what() const 2025-05-07T20:11:08.6761331Z U c10::GradMode::is_enabled() 2025-05-07T20:11:08.6761434Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:11:08.6761612Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:08.6761793Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:11:08.6761908Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:11:08.6762027Z U c10::IValue::isBoolList() const 2025-05-07T20:11:08.6762159Z U c10::IValue::isIntList() const 2025-05-07T20:11:08.6762268Z U c10::IValue::isSymIntList() const 2025-05-07T20:11:08.6762384Z U c10::IValue::isTensorList() const 2025-05-07T20:11:08.6762518Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:08.6762609Z U c10::IntType::get() 2025-05-07T20:11:08.6763071Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:08.6763256Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:08.6763374Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:08.6763492Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:08.6763621Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:08.6763917Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:11:08.6764072Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:08.6764167Z U c10::StringType::get() 2025-05-07T20:11:08.6764314Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:08.6764710Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:08.6764842Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:08.6764962Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:08.6765138Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:08.6765236Z U c10::SymIntType::get() 2025-05-07T20:11:08.6765395Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:08.6765513Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:11:08.6765942Z U c10::TensorImpl::set_autograd_meta(std::unique_ptr >) 2025-05-07T20:11:08.6766104Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:08.6766201Z U c10::TensorType::get() 2025-05-07T20:11:08.6766385Z U c10::Type::isSubtypeOfExt(c10::Type const&, std::ostream*) const 2025-05-07T20:11:08.6766501Z U c10::Type::is_module() const 2025-05-07T20:11:08.6766622Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:08.6767378Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:08.6767506Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:08.6767635Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:08.6767750Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:08.6767859Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:08.6767987Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:08.6768095Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:08.6768336Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:08.6768447Z U c10::cuda::device_count() 2025-05-07T20:11:08.6768577Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:08.6768708Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:08.6768866Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:08.6769012Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:08.6769184Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:08.6769293Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:08.6769720Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:08.6770221Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:08.6770475Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:08.6770956Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:08.6771283Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:08.6771882Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:08.6772146Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:11:08.6772333Z U c10::impl::ExcludeDispatchKeyGuard::~ExcludeDispatchKeyGuard() 2025-05-07T20:11:08.6772459Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:08.6772564Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:08.6772868Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:11:08.6773084Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:11:08.6773228Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:08.6773384Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:08.6773507Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:08.6773622Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:08.6773764Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:11:08.6774136Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:08.6774277Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:08.6774400Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:08.6774571Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:11:08.6774687Z U c10::throwNullDataPtrError() 2025-05-07T20:11:08.6774801Z U c10::typeKindToString(c10::TypeKind) 2025-05-07T20:11:08.6774921Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:08.6775027Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:08.6775209Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:08.6775325Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:08.6775455Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:08.6775599Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:08.6775727Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:08.6775832Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:08.6775948Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:08.6776066Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:08.6776198Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:08.6776315Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:08.6776444Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:11:08.6776600Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:08.6776714Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:08.6776833Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:08.6776941Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:08.6777047Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:08.6777164Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:08.6777298Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:08.6777487Z U fbgemm_gpu::asynchronous_complete_cumsum_cpu(at::Tensor const&) 2025-05-07T20:11:08.6777639Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:08.6777742Z U free@GLIBC_2.2.5 2025-05-07T20:11:08.6777883Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:08.6777997Z U log2f@GLIBC_2.2.5 2025-05-07T20:11:08.6778120Z U long at::Tensor::item() const 2025-05-07T20:11:08.6778291Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:08.6778417Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:08.6778570Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:08.6778666Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:08.6778758Z U memcpy@GLIBC_2.14 2025-05-07T20:11:08.6778845Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:08.6778943Z U memset@GLIBC_2.2.5 2025-05-07T20:11:08.6779116Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:08.6779232Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:08.6779333Z U realloc@GLIBC_2.2.5 2025-05-07T20:11:08.6779538Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:11:08.6779869Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:08.6780257Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:08.6780581Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:08.6780937Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:08.6781062Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:08.6781170Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:08.6781306Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:08.6781448Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:08.6781617Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:08.6781744Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:08.6781887Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:11:08.6782116Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:08.6782450Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:08.6783050Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:08.6783572Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:08.6783702Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:08.6783828Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:08.6783939Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:08.6784046Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:08.6784169Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:08.6784271Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:08.6784588Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:08.6784960Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:08.6785208Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:08.6785334Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:08.6785720Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:08.6785858Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:11:08.6786305Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:11:08.6786461Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:11:08.6786569Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:08.6786667Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:08.6786770Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:08.6786938Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:08.6787567Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:08.6788065Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:08.6788333Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:08.6788459Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:11:08.6788774Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:11:08.6789099Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:11:08.6789314Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:11:08.6789523Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:11:08.6789885Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:11:08.6790064Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:11:08.6790268Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:11:08.6790450Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:11:08.6790577Z U torch::autograd::Node::assign_parent() 2025-05-07T20:11:08.6790709Z U torch::autograd::Node::metadata() 2025-05-07T20:11:08.6790845Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:11:08.6791094Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:11:08.6791385Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:11:08.6791574Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:11:08.6791792Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:11:08.6792067Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:11:08.6794884Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:11:08.6795101Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:11:08.6795257Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:11:08.6795433Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:11:08.6795589Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:11:08.6796014Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:11:08.6796402Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:08.6797006Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:11:08.6797115Z U typeinfo for c10::Error 2025-05-07T20:11:08.6797232Z U typeinfo for c10::Type 2025-05-07T20:11:08.6797376Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:08.6797510Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:11:08.6797659Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:11:08.6797781Z U typeinfo for torch::autograd::Node 2025-05-07T20:11:08.6797935Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:08.6798119Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:08.6798279Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:11:08.6798440Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:08.6798602Z U vtable for __cxxabiv1::__vmi_class_type_info@CXXABI_1.3 2025-05-07T20:11:08.6798714Z U vtable for c10::Error 2025-05-07T20:11:08.6798819Z U vtable for c10::ListType 2025-05-07T20:11:08.6799180Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:08.6799532Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:08.6799882Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:08.6800031Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:08.6800241Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:08.6800499Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:08.6800628Z U vtable for torch::autograd::AutogradMeta 2025-05-07T20:11:08.6800755Z U vtable for torch::autograd::Node 2025-05-07T20:11:08.6800968Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:08.6801080Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:08.6801197Z w _ITM_registerTMCloneTable 2025-05-07T20:11:08.6801301Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:08.6801405Z w __cxa_pure_virtual@CXXABI_1.3 2025-05-07T20:11:08.6801505Z w __gmon_start__ 2025-05-07T20:11:08.6801599Z w __pthread_key_create 2025-05-07T20:11:08.6801708Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:08.6801824Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:08.6801984Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:08.6802211Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:11:08.6802219Z 2025-05-07T20:11:08.6802362Z linux-vdso.so.1 (0x00007ffc62feb000) 2025-05-07T20:11:08.6802480Z libc10.so => not found 2025-05-07T20:11:08.6802578Z libc10_cuda.so => not found 2025-05-07T20:11:08.6803156Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f306fe50000) 2025-05-07T20:11:08.6803649Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f306ec00000) 2025-05-07T20:11:08.6803747Z libtorch.so => not found 2025-05-07T20:11:08.6803845Z libtorch_cpu.so => not found 2025-05-07T20:11:08.6803953Z libtorch_cuda.so => not found 2025-05-07T20:11:08.6804049Z libcudart.so.12 => not found 2025-05-07T20:11:08.6804241Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f306e99c000) 2025-05-07T20:11:08.6804380Z libm.so.6 => /lib64/libm.so.6 (0x00007f306fd75000) 2025-05-07T20:11:08.6804531Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f30723c6000) 2025-05-07T20:11:08.6804654Z libc.so.6 => /lib64/libc.so.6 (0x00007f306e794000) 2025-05-07T20:11:08.6804782Z /lib64/ld-linux-x86-64.so.2 (0x00007f30723fa000) 2025-05-07T20:11:08.6804886Z libc10.so => not found 2025-05-07T20:11:08.6804985Z libc10_cuda.so => not found 2025-05-07T20:11:08.6805077Z libtorch.so => not found 2025-05-07T20:11:08.6805189Z libtorch_cpu.so => not found 2025-05-07T20:11:08.6805282Z libtorch_cuda.so => not found 2025-05-07T20:11:08.6805376Z libcudart.so.12 => not found 2025-05-07T20:11:08.6805582Z libtorch.so => not found 2025-05-07T20:11:08.6805676Z libc10.so => not found 2025-05-07T20:11:08.6805767Z libc10_cuda.so => not found 2025-05-07T20:11:08.6805858Z libtorch_cpu.so => not found 2025-05-07T20:11:08.6805966Z libtorch_cuda.so => not found 2025-05-07T20:11:08.6806061Z libcudart.so.12 => not found 2025-05-07T20:11:08.6806065Z 2025-05-07T20:11:08.6806168Z [CHECK] Displaying ELF information: 2025-05-07T20:11:08.6806419Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:11:08.6806437Z 2025-05-07T20:11:08.6831239Z 2025-05-07T20:11:08.6831887Z Dynamic section at offset 0x2201930 contains 41 entries: 2025-05-07T20:11:08.6832123Z Tag Type Name/Value 2025-05-07T20:11:08.6832513Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:08.6832898Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:08.6833196Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:11:08.6833421Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:11:08.6833620Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:08.6833839Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:08.6834169Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:08.6834382Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:08.6834629Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:08.6834834Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:11:08.6835034Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:08.6835222Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:08.6835450Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:08.6835718Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_index_select.so] 2025-05-07T20:11:08.6835910Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:08.6836030Z 0x000000000000000c (INIT) 0x51000 2025-05-07T20:11:08.6836148Z 0x000000000000000d (FINI) 0x14a27c 2025-05-07T20:11:08.6836273Z 0x0000000000000019 (INIT_ARRAY) 0x2201bc8 2025-05-07T20:11:08.6836449Z 0x000000000000001b (INIT_ARRAYSZ) 144 (bytes) 2025-05-07T20:11:08.6836571Z 0x000000000000001a (FINI_ARRAY) 0x2201c58 2025-05-07T20:11:08.6836691Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:08.6836806Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:08.6836922Z 0x000000006ffffef5 (GNU_HASH) 0x2900 2025-05-07T20:11:08.6837032Z 0x0000000000000005 (STRTAB) 0xda10 2025-05-07T20:11:08.6837139Z 0x0000000000000006 (SYMTAB) 0x4f80 2025-05-07T20:11:08.6837283Z 0x000000000000000a (STRSZ) 224745 (bytes) 2025-05-07T20:11:08.6837403Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:08.6837563Z 0x0000000000000003 (PLTGOT) 0x2202c00 2025-05-07T20:11:08.6837702Z 0x0000000000000002 (PLTRELSZ) 11784 (bytes) 2025-05-07T20:11:08.6837808Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:08.6837925Z 0x0000000000000017 (JMPREL) 0x4da10 2025-05-07T20:11:08.6838044Z 0x0000000000000007 (RELA) 0x45508 2025-05-07T20:11:08.6838173Z 0x0000000000000008 (RELASZ) 34056 (bytes) 2025-05-07T20:11:08.6838292Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:08.6838393Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:08.6838530Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:08.6838646Z 0x000000006ffffffe (VERNEED) 0x45388 2025-05-07T20:11:08.6838754Z 0x000000006fffffff (VERNEEDNUM) 6 2025-05-07T20:11:08.6838880Z 0x000000006ffffff0 (VERSYM) 0x447fa 2025-05-07T20:11:08.6838987Z 0x000000006ffffff9 (RELACOUNT) 388 2025-05-07T20:11:08.6839090Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:08.6839099Z 2025-05-07T20:11:08.6839225Z ################################################################################ 2025-05-07T20:11:08.6839230Z 2025-05-07T20:11:08.6839234Z 2025-05-07T20:11:08.6839346Z ################################################################################ 2025-05-07T20:11:08.6839589Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:11:08.6839709Z [CHECK] Listing out library size: 2025-05-07T20:11:08.6839944Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:11:08.6839949Z 2025-05-07T20:11:08.6847735Z 74 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:11:08.6847747Z 2025-05-07T20:11:08.6850519Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:11:08.6851021Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:08.7223538Z 2025-05-07T20:11:08.7223831Z GLIBC_2.2.5 2025-05-07T20:11:08.7224290Z GLIBC_2.3 2025-05-07T20:11:08.7224445Z GLIBC_2.14 2025-05-07T20:11:08.7228577Z 2025-05-07T20:11:08.7228588Z 2025-05-07T20:11:08.7229566Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:11:08.7230101Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:08.7230106Z 2025-05-07T20:11:08.7599412Z GLIBCXX_3.4 2025-05-07T20:11:08.7599520Z GLIBCXX_3.4.9 2025-05-07T20:11:08.7599613Z GLIBCXX_3.4.11 2025-05-07T20:11:08.7599712Z GLIBCXX_3.4.14 2025-05-07T20:11:08.7599797Z GLIBCXX_3.4.15 2025-05-07T20:11:08.7599880Z GLIBCXX_3.4.18 2025-05-07T20:11:08.7599965Z GLIBCXX_3.4.19 2025-05-07T20:11:08.7600058Z GLIBCXX_3.4.20 2025-05-07T20:11:08.7600141Z GLIBCXX_3.4.21 2025-05-07T20:11:08.7600239Z GLIBCXX_3.4.29 2025-05-07T20:11:08.7600518Z 2025-05-07T20:11:08.7600754Z 2025-05-07T20:11:08.7626320Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so > /tmp/tmp.1LtLXq2i1N.symbols.txt 2025-05-07T20:11:08.7626569Z 2025-05-07T20:11:08.7951131Z 2025-05-07T20:11:08.7980824Z [CHECK] Total Number of symbols: 6350 2025-05-07T20:11:08.8003750Z [CHECK] Number of fbgemm symbols: 4411 2025-05-07T20:11:08.8021363Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so > /tmp/tmp.e7c2Csw5Yc.usymbols.txt 2025-05-07T20:11:08.8021414Z 2025-05-07T20:11:08.8056262Z 2025-05-07T20:11:08.8079538Z [CHECK] Listing out undefined symbols (483 total): 2025-05-07T20:11:08.8094747Z U GOMP_parallel 2025-05-07T20:11:08.8095928Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:08.8096637Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:08.8097252Z U VTT for std::basic_ifstream >@GLIBCXX_3.4 2025-05-07T20:11:08.8097650Z U VTT for std::basic_ofstream >@GLIBCXX_3.4 2025-05-07T20:11:08.8097804Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:08.8097973Z U __assert_fail@GLIBC_2.2.5 2025-05-07T20:11:08.8098192Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:08.8098462Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:08.8098648Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:08.8098837Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:08.8099046Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:08.8099228Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:08.8099428Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:08.8099631Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:08.8099795Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:08.8099968Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:08.8100125Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:08.8100320Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:08.8100520Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:08.8100680Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:08.8100873Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:08.8101051Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:08.8101201Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:08.8101336Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:08.8101578Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:08.8101788Z U __once_proxy@GLIBCXX_3.4.11 2025-05-07T20:11:08.8101994Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:08.8102342Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:11:08.8102483Z U at::RecordFunction::currentThreadId() 2025-05-07T20:11:08.8102671Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:11:08.8102849Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:11:08.8102975Z U at::SplitUntil32Bit::begin() const 2025-05-07T20:11:08.8103101Z U at::SplitUntil32Bit::end() const 2025-05-07T20:11:08.8103272Z U at::SplitUntil32Bit::iterator::operator*() const 2025-05-07T20:11:08.8103415Z U at::SplitUntil32Bit::iterator::operator++() 2025-05-07T20:11:08.8103650Z U at::Tensor::index(std::initializer_list) const 2025-05-07T20:11:08.8103865Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:11:08.8104054Z U at::TensorIteratorBase::build(at::TensorIteratorConfig&) 2025-05-07T20:11:08.8104224Z U at::TensorIteratorBase::can_use_32bit_indexing() const 2025-05-07T20:11:08.8104385Z U at::TensorIteratorBase::data_ptr(long) const 2025-05-07T20:11:08.8104580Z U at::TensorIteratorBase::is_contiguous() const 2025-05-07T20:11:08.8104713Z U at::TensorIteratorBase::numel() const 2025-05-07T20:11:08.8104874Z U at::TensorIteratorBase::with_32bit_indexing() const 2025-05-07T20:11:08.8105118Z U at::TensorIteratorConfig::add_borrowed_input(at::TensorBase const&) 2025-05-07T20:11:08.8105350Z U at::TensorIteratorConfig::add_borrowed_output(at::TensorBase const&) 2025-05-07T20:11:08.8105468Z U at::TensorMaker::make_tensor() 2025-05-07T20:11:08.8105625Z U at::_ops::_is_all_true::call(at::Tensor const&) 2025-05-07T20:11:08.8105821Z U at::_ops::_unique::call(at::Tensor const&, bool, bool) 2025-05-07T20:11:08.8106071Z U at::_ops::add_Tensor::call(at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:08.8106318Z U at::_ops::add__Tensor::call(at::Tensor&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:08.8106452Z U at::_ops::all::call(at::Tensor const&) 2025-05-07T20:11:08.8106934Z U at::_ops::baddbmm::call(at::Tensor const&, at::Tensor const&, at::Tensor const&, c10::Scalar const&, c10::Scalar const&) 2025-05-07T20:11:08.8107171Z U at::_ops::broadcast_to::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:08.8107336Z U at::_ops::cat::call(c10::IListRef const&, long) 2025-05-07T20:11:08.8107550Z U at::_ops::cat_out::call(c10::IListRef const&, long, at::Tensor&) 2025-05-07T20:11:08.8107753Z U at::_ops::clamp_max::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:08.8107972Z U at::_ops::clone::call(at::Tensor const&, std::optional) 2025-05-07T20:11:08.8108160Z U at::_ops::contiguous::call(at::Tensor const&, c10::MemoryFormat) 2025-05-07T20:11:08.8108350Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:11:08.8108581Z U at::_ops::cumsum::call(at::Tensor const&, long, std::optional) 2025-05-07T20:11:08.8109036Z U at::_ops::diff::call(at::Tensor const&, long, long, std::optional const&, std::optional const&) 2025-05-07T20:11:08.8109234Z U at::_ops::div_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:08.8110032Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:08.8110827Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:08.8111036Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:11:08.8111235Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:11:08.8111387Z U at::_ops::floor::call(at::Tensor const&) 2025-05-07T20:11:08.8111950Z U at::_ops::full::call(c10::ArrayRef, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:08.8112131Z U at::_ops::ge_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:08.8112479Z U at::_ops::index_put_::call(at::Tensor&, c10::List > const&, at::Tensor const&, bool) 2025-05-07T20:11:08.8112700Z U at::_ops::index_select::call(at::Tensor const&, long, at::Tensor const&) 2025-05-07T20:11:08.8112832Z U at::_ops::item::call(at::Tensor const&) 2025-05-07T20:11:08.8113058Z U at::_ops::le_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:08.8113183Z U at::_ops::max::call(at::Tensor const&) 2025-05-07T20:11:08.8113364Z U at::_ops::mul_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:08.8113987Z U at::_ops::ones_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:08.8114178Z U at::_ops::permute::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:08.8114736Z U at::_ops::range::call(c10::Scalar const&, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:08.8114969Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:08.8115289Z U at::_ops::resize_::call(at::Tensor const&, c10::ArrayRef, std::optional) 2025-05-07T20:11:08.8115490Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:11:08.8116040Z U at::_ops::set__source_Storage_storage_offset::call(at::Tensor&, c10::Storage, c10::SymInt, c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:08.8116389Z U at::_ops::slice_Tensor::call(at::Tensor const&, long, std::optional, std::optional, c10::SymInt) 2025-05-07T20:11:08.8116550Z U at::_ops::sort::call(at::Tensor const&, long, bool) 2025-05-07T20:11:08.8116774Z U at::_ops::split_sizes::call(at::Tensor const&, c10::ArrayRef, long) 2025-05-07T20:11:08.8116920Z U at::_ops::squeeze_dim::call(at::Tensor const&, long) 2025-05-07T20:11:08.8117167Z U at::_ops::sub_Tensor::call(at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:08.8117358Z U at::_ops::sum::call(at::Tensor const&, std::optional) 2025-05-07T20:11:08.8117610Z U at::_ops::tensor_split_indices::call(at::Tensor const&, c10::ArrayRef, long) 2025-05-07T20:11:08.8117926Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:11:08.8118544Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:11:08.8118731Z U at::_ops::transpose_int::call(at::Tensor const&, long, long) 2025-05-07T20:11:08.8119011Z U at::_ops::unique_consecutive::call(at::Tensor const&, bool, bool, std::optional) 2025-05-07T20:11:08.8119155Z U at::_ops::unsqueeze::call(at::Tensor const&, long) 2025-05-07T20:11:08.8119384Z U at::_ops::view::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:08.8119542Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:08.8119653Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:11:08.8120136Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:08.8120699Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:08.8121078Z U at::checkScalarTypes(char const*, at::TensorArg const&, c10::ArrayRef) 2025-05-07T20:11:08.8121219Z U at::cuda::getCurrentCUDABlasHandle() 2025-05-07T20:11:08.8123053Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:11:08.8123188Z U at::cuda::getDeviceProperties(signed char) 2025-05-07T20:11:08.8123346Z U at::cuda::get_p2p_access(signed char, signed char) 2025-05-07T20:11:08.8123679Z U at::detail::computeStorageNbytes(c10::ArrayRef, c10::ArrayRef, unsigned long, unsigned long) 2025-05-07T20:11:08.8123808Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:11:08.8123978Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:11:08.8124079Z U at::get_num_threads() 2025-05-07T20:11:08.8124176Z U at::get_thread_num() 2025-05-07T20:11:08.8124332Z U at::in_parallel_region() 2025-05-07T20:11:08.8124431Z U at::init_num_threads() 2025-05-07T20:11:08.8124643Z U at::internal::OpaqueOptionalTensorRef::~OpaqueOptionalTensorRef() 2025-05-07T20:11:08.8124758Z U at::internal::set_thread_num(int) 2025-05-07T20:11:08.8125096Z U at::native::_rowwise_prune(at::Tensor const&, at::Tensor const&, c10::ScalarType) 2025-05-07T20:11:08.8126277Z U at::native::empty_like(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:08.8126945Z U at::native::empty_meta_symint(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:08.8127227Z U at::native::resize_(at::Tensor const&, c10::ArrayRef, std::optional) 2025-05-07T20:11:08.8127379Z U at::print(std::ostream&, at::Tensor const&, long) 2025-05-07T20:11:08.8127530Z U at::sequence_number::get_and_increment() 2025-05-07T20:11:08.8127700Z U at::tensor(c10::ArrayRef, c10::TensorOptions const&) 2025-05-07T20:11:08.8127825Z U bool at::Tensor::item() const 2025-05-07T20:11:08.8127979Z U bool* at::TensorBase::data_ptr() const 2025-05-07T20:11:08.8128136Z U bool* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:08.8128248Z U c10::AnyType::get() 2025-05-07T20:11:08.8128435Z U c10::AutogradMetaInterface::~AutogradMetaInterface() 2025-05-07T20:11:08.8128618Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:11:08.8128832Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:08.8128954Z U c10::BoolType::get() 2025-05-07T20:11:08.8129063Z U c10::DeviceObjType::get() 2025-05-07T20:11:08.8129271Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:08.8129459Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:11:08.8129629Z U c10::Dispatcher::realSingleton() 2025-05-07T20:11:08.8130164Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:11:08.8130929Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:11:08.8131285Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:08.8131389Z U c10::Error::what() const 2025-05-07T20:11:08.8131506Z U c10::FloatType::get() 2025-05-07T20:11:08.8131609Z U c10::GradMode::is_enabled() 2025-05-07T20:11:08.8131719Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:11:08.8131919Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:11:08.8132091Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:08.8132246Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:11:08.8132361Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:11:08.8132499Z U c10::IValue::isBoolList() const 2025-05-07T20:11:08.8132603Z U c10::IValue::isIntList() const 2025-05-07T20:11:08.8132713Z U c10::IValue::isSymIntList() const 2025-05-07T20:11:08.8132833Z U c10::IValue::isTensorList() const 2025-05-07T20:11:08.8133002Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:08.8133111Z U c10::InferenceMode::is_enabled() 2025-05-07T20:11:08.8133223Z U c10::IntType::get() 2025-05-07T20:11:08.8133692Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:08.8133859Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:08.8133996Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:08.8134127Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:08.8134254Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:08.8134484Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:08.8134610Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:11:08.8134729Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:11:08.8134848Z U c10::ScalarTypeType::get() 2025-05-07T20:11:08.8135120Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:11:08.8135428Z U c10::SmallVectorBase::mallocForGrow(unsigned long, unsigned long, unsigned long&) 2025-05-07T20:11:08.8135602Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:08.8135706Z U c10::StringType::get() 2025-05-07T20:11:08.8135844Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:11:08.8136000Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:08.8136149Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:08.8136546Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:08.8136693Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:08.8136857Z U c10::SymInt::operator%(c10::SymInt const&) const 2025-05-07T20:11:08.8136991Z U c10::SymInt::operator*(c10::SymInt const&) const 2025-05-07T20:11:08.8137165Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:11:08.8137285Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:08.8137412Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:11:08.8137540Z U c10::SymInt::sym_ne(c10::SymInt const&) const 2025-05-07T20:11:08.8137844Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:08.8137952Z U c10::SymIntType::get() 2025-05-07T20:11:08.8138169Z U c10::SymbolicShapeMeta::init_is_channels_last_3d_contiguous() const 2025-05-07T20:11:08.8138396Z U c10::SymbolicShapeMeta::init_is_channels_last_contiguous() const 2025-05-07T20:11:08.8138555Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:08.8138678Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:11:08.8139184Z U c10::TensorImpl::set_autograd_meta(std::unique_ptr >) 2025-05-07T20:11:08.8139343Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:08.8139501Z U c10::TensorImpl::throw_storage_access_error() const 2025-05-07T20:11:08.8139639Z U c10::TensorType::get() 2025-05-07T20:11:08.8141150Z U c10::TupleType::TupleType(std::vector, std::allocator > >, std::optional, std::shared_ptr) 2025-05-07T20:11:08.8141355Z U c10::Type::isSubtypeOfExt(c10::Type const&, std::ostream*) const 2025-05-07T20:11:08.8141542Z U c10::Type::is_module() const 2025-05-07T20:11:08.8141673Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:08.8142738Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:08.8143007Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:08.8143197Z U c10::cuda::CUDAKernelLaunchRegistry::get_singleton_ref() 2025-05-07T20:11:08.8143491Z U c10::cuda::CUDAKernelLaunchRegistry::get_uvm_assertions_ptr_for_current_device() 2025-05-07T20:11:08.8143843Z U c10::cuda::CUDAKernelLaunchRegistry::insert(char const*, char const*, unsigned int, char const*, int) 2025-05-07T20:11:08.8143965Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:08.8144106Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:08.8144224Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:08.8144352Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:08.8144473Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:08.8144745Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:08.8144858Z U c10::cuda::current_device() 2025-05-07T20:11:08.8144963Z U c10::cuda::device_count() 2025-05-07T20:11:08.8145120Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:08.8145262Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:08.8145409Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:08.8145568Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:08.8145735Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:08.8145888Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:08.8146381Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:08.8146927Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:08.8147190Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:08.8147721Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:08.8148078Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:08.8148708Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:08.8149136Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:11:08.8149353Z U c10::impl::ExcludeDispatchKeyGuard::~ExcludeDispatchKeyGuard() 2025-05-07T20:11:08.8149499Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:08.8149613Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:08.8150004Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:11:08.8150217Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:11:08.8150351Z U c10::impl::PyObjectSlot::PyObjectSlot() 2025-05-07T20:11:08.8150524Z U c10::impl::PyObjectSlot::~PyObjectSlot() 2025-05-07T20:11:08.8150702Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:08.8150876Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:08.8151006Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:08.8151130Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:08.8151302Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:11:08.8151694Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:08.8151823Z U c10::operator*(c10::SymInt const&, int) 2025-05-07T20:11:08.8151961Z U c10::operator+(c10::SymInt const&, int) 2025-05-07T20:11:08.8152111Z U c10::operator+(c10::SymInt const&, unsigned long) 2025-05-07T20:11:08.8152240Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:11:08.8152404Z U c10::operator-(c10::SymInt const&, unsigned long) 2025-05-07T20:11:08.8152530Z U c10::operator/(c10::SymInt const&, int) 2025-05-07T20:11:08.8152652Z U c10::operator<(c10::SymInt const&, int) 2025-05-07T20:11:08.8152822Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:08.8152960Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:08.8153128Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:11:08.8153291Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:11:08.8153422Z U c10::operator==(c10::SymInt const&, int) 2025-05-07T20:11:08.8153547Z U c10::operator>(c10::SymInt const&, int) 2025-05-07T20:11:08.8153693Z U c10::operator>=(c10::SymInt const&, int) 2025-05-07T20:11:08.8153812Z U c10::report_overflow(char const*) 2025-05-07T20:11:08.8153937Z U c10::throwNullDataPtrError() 2025-05-07T20:11:08.8154116Z U c10::typeKindToString(c10::TypeKind) 2025-05-07T20:11:08.8154230Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:08.8154357Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:08.8154591Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:08.8154734Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:08.8154854Z U cublasGemmStridedBatchedEx 2025-05-07T20:11:08.8154958Z U cublasSetStream_v2 2025-05-07T20:11:08.8155110Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:08.8155246Z U cudaDeviceGetByPCIBusId@libcudart.so.12 2025-05-07T20:11:08.8155377Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:08.8155636Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:08.8155752Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:08.8155872Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:08.8155982Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:08.8156131Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:08.8156247Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:08.8156347Z U cudaFree@libcudart.so.12 2025-05-07T20:11:08.8156484Z U cudaFuncGetAttributes@libcudart.so.12 2025-05-07T20:11:08.8156601Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:11:08.8156706Z U cudaGetDevice@libcudart.so.12 2025-05-07T20:11:08.8156836Z U cudaGetDeviceCount@libcudart.so.12 2025-05-07T20:11:08.8156966Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:08.8157085Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:08.8157223Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:08.8157363Z U cudaHostGetDevicePointer@libcudart.so.12 2025-05-07T20:11:08.8157471Z U cudaHostRegister@libcudart.so.12 2025-05-07T20:11:08.8157590Z U cudaHostUnregister@libcudart.so.12 2025-05-07T20:11:08.8157718Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:08.8157831Z U cudaMallocManaged@libcudart.so.12 2025-05-07T20:11:08.8157937Z U cudaMemAdvise@libcudart.so.12 2025-05-07T20:11:08.8158067Z U cudaMemPrefetchAsync@libcudart.so.12 2025-05-07T20:11:08.8158176Z U cudaMemcpy2DAsync@libcudart.so.12 2025-05-07T20:11:08.8158289Z U cudaMemcpyAsync@libcudart.so.12 2025-05-07T20:11:08.8158398Z U cudaMemsetAsync@libcudart.so.12 2025-05-07T20:11:08.8158681Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.12 2025-05-07T20:11:08.8158798Z U cudaPeekAtLastError@libcudart.so.12 2025-05-07T20:11:08.8158907Z U cudaSetDevice@libcudart.so.12 2025-05-07T20:11:08.8159033Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:08.8159148Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:08.8159265Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:08.8159416Z U double* at::TensorBase::data_ptr() const 2025-05-07T20:11:08.8159575Z U double* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:08.8159669Z U exit@GLIBC_2.2.5 2025-05-07T20:11:08.8159759Z U exp10@GLIBC_2.2.5 2025-05-07T20:11:08.8159861Z U exp@GLIBC_2.2.5 2025-05-07T20:11:08.8159950Z U expf@GLIBC_2.2.5 2025-05-07T20:11:08.8160140Z U fbgemm_gpu::asynchronous_complete_cumsum_cpu(at::Tensor const&) 2025-05-07T20:11:08.8160340Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:11:08.8160558Z U fbgemm_gpu::asynchronous_exclusive_cumsum_cpu(at::Tensor const&) 2025-05-07T20:11:08.8160746Z U fbgemm_gpu::asynchronous_exclusive_cumsum_gpu(at::Tensor const&) 2025-05-07T20:11:08.8160979Z U fbgemm_gpu::asynchronous_inclusive_cumsum_gpu(at::Tensor const&) 2025-05-07T20:11:08.8161110Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:08.8161262Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:08.8161364Z U fminf@GLIBC_2.2.5 2025-05-07T20:11:08.8161448Z U fmod@GLIBC_2.2.5 2025-05-07T20:11:08.8161533Z U free@GLIBC_2.2.5 2025-05-07T20:11:08.8161647Z U get_info_B_num_bits_from_T(int, int) 2025-05-07T20:11:08.8161766Z U int at::Tensor::item() const 2025-05-07T20:11:08.8161927Z U int const* at::TensorBase::const_data_ptr() const 2025-05-07T20:11:08.8162051Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:08.8162208Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:08.8162299Z U lgamma@GLIBC_2.2.5 2025-05-07T20:11:08.8162415Z U llrint@GLIBC_2.2.5 2025-05-07T20:11:08.8162518Z U log10@GLIBC_2.2.5 2025-05-07T20:11:08.8162605Z U log2@GLIBC_2.2.5 2025-05-07T20:11:08.8162686Z U log@GLIBC_2.2.5 2025-05-07T20:11:08.8162790Z U long at::Tensor::item() const 2025-05-07T20:11:08.8162963Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:08.8163117Z U long const* at::TensorBase::const_data_ptr() const 2025-05-07T20:11:08.8163249Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:08.8163400Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:08.8163521Z U lrint@GLIBC_2.2.5 2025-05-07T20:11:08.8163623Z U madvise@GLIBC_2.2.5 2025-05-07T20:11:08.8163727Z U malloc@GLIBC_2.2.5 2025-05-07T20:11:08.8163813Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:08.8163903Z U memcpy@GLIBC_2.14 2025-05-07T20:11:08.8164006Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:08.8164096Z U memset@GLIBC_2.2.5 2025-05-07T20:11:08.8164194Z U nvmlDeviceGetCount_v2 2025-05-07T20:11:08.8164307Z U nvmlDeviceGetHandleByIndex_v2 2025-05-07T20:11:08.8164443Z U nvmlDeviceGetNvLinkRemotePciInfo_v2 2025-05-07T20:11:08.8164545Z U nvmlDeviceGetNvLinkState 2025-05-07T20:11:08.8164645Z U nvmlDeviceGetPciInfo_v3 2025-05-07T20:11:08.8164748Z U nvmlInit_v2 2025-05-07T20:11:08.8164836Z U omp_get_num_threads 2025-05-07T20:11:08.8164929Z U omp_get_thread_num 2025-05-07T20:11:08.8165075Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:08.8165210Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:08.8165331Z U operator new[](unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:08.8165425Z U pow@GLIBC_2.2.5 2025-05-07T20:11:08.8165530Z U realloc@GLIBC_2.2.5 2025-05-07T20:11:08.8165681Z U short* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:08.8165871Z U signed char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:08.8165975Z U sin@GLIBC_2.2.5 2025-05-07T20:11:08.8166178Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:11:08.8166351Z U std::_Rb_tree_decrement(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:11:08.8166548Z U std::_Rb_tree_increment(std::_Rb_tree_node_base const*)@GLIBCXX_3.4 2025-05-07T20:11:08.8166744Z U std::_Rb_tree_increment(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:11:08.8167119Z U std::_Rb_tree_insert_and_rebalance(bool, std::_Rb_tree_node_base*, std::_Rb_tree_node_base*, std::_Rb_tree_node_base&)@GLIBCXX_3.4 2025-05-07T20:11:08.8167314Z U std::__basic_file::~__basic_file()@GLIBCXX_3.4 2025-05-07T20:11:08.8167646Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:08.8168026Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:08.8168375Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:08.8168751Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:08.8169120Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:08.8169322Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:11:08.8169460Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:11:08.8169574Z U std::__once_call@GLIBCXX_3.4.11 2025-05-07T20:11:08.8169718Z U std::__once_callable@GLIBCXX_3.4.11 2025-05-07T20:11:08.8169837Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:08.8169946Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:08.8170076Z U std::__throw_bad_cast()@GLIBCXX_3.4 2025-05-07T20:11:08.8170213Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:11:08.8170358Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:08.8170542Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:08.8170682Z U std::__throw_out_of_range(char const*)@GLIBCXX_3.4 2025-05-07T20:11:08.8170848Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:08.8170989Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:08.8171120Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:11:08.8171345Z U std::basic_filebuf >::basic_filebuf()@GLIBCXX_3.4 2025-05-07T20:11:08.8171563Z U std::basic_filebuf >::close()@GLIBCXX_3.4 2025-05-07T20:11:08.8171852Z U std::basic_filebuf >::open(char const*, std::_Ios_Openmode)@GLIBCXX_3.4 2025-05-07T20:11:08.8172073Z U std::basic_filebuf >::~basic_filebuf()@GLIBCXX_3.4 2025-05-07T20:11:08.8172322Z U std::basic_ifstream >::~basic_ifstream()@GLIBCXX_3.4 2025-05-07T20:11:08.8172550Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:08.8172885Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:08.8173128Z U std::basic_ofstream >::~basic_ofstream()@GLIBCXX_3.4 2025-05-07T20:11:08.8173685Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:08.8174184Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:08.8174353Z U std::chrono::_V2::system_clock::now()@GLIBCXX_3.4.19 2025-05-07T20:11:08.8174477Z U std::cout@GLIBCXX_3.4 2025-05-07T20:11:08.8174627Z U std::ctype::_M_widen_init() const@GLIBCXX_3.4.11 2025-05-07T20:11:08.8174783Z U std::current_exception()@CXXABI_1.3.3 2025-05-07T20:11:08.8174905Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:08.8175020Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:08.8175148Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:08.8175261Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:08.8175376Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:08.8175492Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:08.8175597Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:08.8175793Z U std::ostream& std::ostream::_M_insert(double)@GLIBCXX_3.4.9 2025-05-07T20:11:08.8175982Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:08.8176208Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:08.8176341Z U std::ostream::flush()@GLIBCXX_3.4 2025-05-07T20:11:08.8176473Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:08.8176582Z U std::ostream::put(char)@GLIBCXX_3.4 2025-05-07T20:11:08.8176720Z U std::ostream::write(char const*, long)@GLIBCXX_3.4 2025-05-07T20:11:08.8176894Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:08.8177022Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:11:08.8177235Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:11:08.8177689Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:11:08.8177827Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:11:08.8177933Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:08.8178042Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:08.8178132Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:08.8178222Z U sysconf@GLIBC_2.2.5 2025-05-07T20:11:08.8178341Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:08.8178927Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:08.8179371Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:08.8179865Z U torch::Library::_def(std::variant&&, torch::CppFunction&&, std::vector > const&) & 2025-05-07T20:11:08.8180116Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:08.8180238Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:11:08.8180534Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:11:08.8180710Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:11:08.8180907Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:11:08.8181099Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:11:08.8181433Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:11:08.8181580Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:11:08.8181806Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:11:08.8182003Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:11:08.8182120Z U torch::autograd::Node::assign_parent() 2025-05-07T20:11:08.8182432Z U torch::autograd::Node::metadata() 2025-05-07T20:11:08.8182568Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:11:08.8182812Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:11:08.8183097Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:11:08.8183235Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:11:08.8183447Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:11:08.8183687Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:11:08.8188043Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:11:08.8188358Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:11:08.8188514Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:11:08.8188687Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:11:08.8188862Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:11:08.8189396Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:11:08.8189777Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:08.8190198Z U torch::jit::parseSchemaOrName(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:08.8190406Z U torch::pickle_load(std::vector > const&) 2025-05-07T20:11:08.8190554Z U torch::pickle_save(c10::IValue const&) 2025-05-07T20:11:08.8191133Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:11:08.8191244Z U typeinfo for c10::Error 2025-05-07T20:11:08.8191372Z U typeinfo for c10::Type 2025-05-07T20:11:08.8191518Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:08.8191650Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:11:08.8191783Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:11:08.8191930Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:11:08.8192055Z U typeinfo for torch::autograd::Node 2025-05-07T20:11:08.8192250Z U unsigned char* at::TensorBase::data_ptr() const 2025-05-07T20:11:08.8192526Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:08.8192993Z U void fbgemm::FloatOrHalfToFused8BitRowwiseQuantizedSBFloat(float const*, unsigned long, int, unsigned char*) 2025-05-07T20:11:08.8193585Z U void fbgemm::FloatOrHalfToFused8BitRowwiseQuantizedSBFloat(unsigned short const*, unsigned long, int, unsigned char*) 2025-05-07T20:11:08.8194053Z U void fbgemm::FloatOrHalfToFusedNBitRowwiseQuantizedSBHalf(int, float const*, unsigned long, int, unsigned char*) 2025-05-07T20:11:08.8194599Z U void fbgemm::FloatOrHalfToFusedNBitRowwiseQuantizedSBHalf(int, unsigned short const*, unsigned long, int, unsigned char*) 2025-05-07T20:11:08.8195073Z U void fbgemm::Fused8BitRowwiseQuantizedSBFloatToFloatOrHalf(unsigned char const*, unsigned long, int, float*) 2025-05-07T20:11:08.8195613Z U void fbgemm::Fused8BitRowwiseQuantizedSBFloatToFloatOrHalf(unsigned char const*, unsigned long, int, unsigned short*) 2025-05-07T20:11:08.8196107Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalf(int, unsigned char const*, unsigned long, int, float*, bool) 2025-05-07T20:11:08.8196724Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalf(int, unsigned char const*, unsigned long, int, unsigned short*, bool) 2025-05-07T20:11:08.8197328Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalfRef(int, unsigned char const*, unsigned long, int, float*, bool) 2025-05-07T20:11:08.8198323Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalfRef(int, unsigned char const*, unsigned long, int, unsigned short*, bool) 2025-05-07T20:11:08.8199399Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalfRef(int, unsigned char const*, unsigned long, int, unsigned short*, bool) 2025-05-07T20:11:08.8199772Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:08.8200077Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:08.8200243Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:11:08.8200401Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:08.8200576Z U vtable for __cxxabiv1::__vmi_class_type_info@CXXABI_1.3 2025-05-07T20:11:08.8200693Z U vtable for at::TensorIterator 2025-05-07T20:11:08.8200815Z U vtable for at::TensorIteratorBase 2025-05-07T20:11:08.8200931Z U vtable for c10::Error 2025-05-07T20:11:08.8201037Z U vtable for c10::ListType 2025-05-07T20:11:08.8201403Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:08.8201767Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:08.8202125Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:08.8202261Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:08.8202501Z U vtable for std::basic_filebuf >@GLIBCXX_3.4 2025-05-07T20:11:08.8202727Z U vtable for std::basic_ifstream >@GLIBCXX_3.4 2025-05-07T20:11:08.8202932Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:08.8203168Z U vtable for std::basic_ofstream >@GLIBCXX_3.4 2025-05-07T20:11:08.8203399Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:08.8203567Z U vtable for torch::autograd::AutogradMeta 2025-05-07T20:11:08.8203699Z U vtable for torch::autograd::Node 2025-05-07T20:11:08.8203880Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:08.8204047Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:08.8204158Z w _ITM_registerTMCloneTable 2025-05-07T20:11:08.8204277Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:08.8204392Z w __cxa_pure_virtual@CXXABI_1.3 2025-05-07T20:11:08.8204483Z w __gmon_start__ 2025-05-07T20:11:08.8204592Z w __pthread_key_create 2025-05-07T20:11:08.8204705Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:08.8204821Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:08.8204926Z w pthread_once 2025-05-07T20:11:08.8205078Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:08.8205258Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:11:08.8205265Z 2025-05-07T20:11:08.8205428Z linux-vdso.so.1 (0x00007ffcab4a8000) 2025-05-07T20:11:08.8205543Z libc10.so => not found 2025-05-07T20:11:08.8205647Z libc10_cuda.so => not found 2025-05-07T20:11:08.8206035Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so (0x00007f48de000000) 2025-05-07T20:11:08.8206138Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:08.8206237Z libtorch.so => not found 2025-05-07T20:11:08.8206828Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f48dde50000) 2025-05-07T20:11:08.8207306Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f48dcc00000) 2025-05-07T20:11:08.8207437Z libtorch_cpu.so => not found 2025-05-07T20:11:08.8207535Z libtorch_cuda.so => not found 2025-05-07T20:11:08.8207642Z libcudart.so.12 => not found 2025-05-07T20:11:08.8207809Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f48dc99c000) 2025-05-07T20:11:08.8207936Z libm.so.6 => /lib64/libm.so.6 (0x00007f48e346a000) 2025-05-07T20:11:08.8208108Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f48e343c000) 2025-05-07T20:11:08.8208235Z libc.so.6 => /lib64/libc.so.6 (0x00007f48dc794000) 2025-05-07T20:11:08.8208366Z /lib64/ld-linux-x86-64.so.2 (0x00007f48e354d000) 2025-05-07T20:11:08.8208469Z libc10.so => not found 2025-05-07T20:11:08.8208837Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so (0x00007f48de585000) 2025-05-07T20:11:08.8208930Z libtorch.so => not found 2025-05-07T20:11:08.8209028Z libtorch_cpu.so => not found 2025-05-07T20:11:08.8209134Z libtorch_cuda.so => not found 2025-05-07T20:11:08.8209219Z libc10.so => not found 2025-05-07T20:11:08.8209315Z libc10_cuda.so => not found 2025-05-07T20:11:08.8209423Z libtorch.so => not found 2025-05-07T20:11:08.8209522Z libtorch_cpu.so => not found 2025-05-07T20:11:08.8209622Z libtorch_cuda.so => not found 2025-05-07T20:11:08.8209722Z libcudart.so.12 => not found 2025-05-07T20:11:08.8209827Z libtorch.so => not found 2025-05-07T20:11:08.8209916Z libc10.so => not found 2025-05-07T20:11:08.8210011Z libc10_cuda.so => not found 2025-05-07T20:11:08.8210118Z libtorch_cpu.so => not found 2025-05-07T20:11:08.8210212Z libtorch_cuda.so => not found 2025-05-07T20:11:08.8210307Z libcudart.so.12 => not found 2025-05-07T20:11:08.8210400Z libtorch_cpu.so => not found 2025-05-07T20:11:08.8210508Z libtorch_cuda.so => not found 2025-05-07T20:11:08.8210600Z libtorch.so => not found 2025-05-07T20:11:08.8210605Z 2025-05-07T20:11:08.8210712Z [CHECK] Displaying ELF information: 2025-05-07T20:11:08.8210934Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:11:08.8210939Z 2025-05-07T20:11:08.8210946Z 2025-05-07T20:11:08.8211111Z Dynamic section at offset 0x4953578 contains 43 entries: 2025-05-07T20:11:08.8211372Z Tag Type Name/Value 2025-05-07T20:11:08.8211690Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:08.8211907Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:08.8212089Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:11:08.8212311Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:11:08.8212498Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:08.8212735Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:11:08.8212948Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:11:08.8213155Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:08.8213354Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:08.8213551Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:08.8213758Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:08.8213972Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:11:08.8214159Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:08.8214356Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:08.8214559Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:08.8214754Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_py.so] 2025-05-07T20:11:08.8214943Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:08.8215058Z 0x000000000000000c (INIT) 0x18e000 2025-05-07T20:11:08.8215190Z 0x000000000000000d (FINI) 0x7e464c 2025-05-07T20:11:08.8215303Z 0x0000000000000019 (INIT_ARRAY) 0x494d470 2025-05-07T20:11:08.8215447Z 0x000000000000001b (INIT_ARRAYSZ) 1160 (bytes) 2025-05-07T20:11:08.8215559Z 0x000000000000001a (FINI_ARRAY) 0x494d8f8 2025-05-07T20:11:08.8215679Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:08.8215794Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:08.8215906Z 0x000000006ffffef5 (GNU_HASH) 0x8530 2025-05-07T20:11:08.8216014Z 0x0000000000000005 (STRTAB) 0x363a0 2025-05-07T20:11:08.8216130Z 0x0000000000000006 (SYMTAB) 0x11038 2025-05-07T20:11:08.8216267Z 0x000000000000000a (STRSZ) 1209140 (bytes) 2025-05-07T20:11:08.8216386Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:08.8216497Z 0x0000000000000003 (PLTGOT) 0x4954868 2025-05-07T20:11:08.8216649Z 0x0000000000000002 (PLTRELSZ) 42168 (bytes) 2025-05-07T20:11:08.8216753Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:08.8216860Z 0x0000000000000017 (JMPREL) 0x183378 2025-05-07T20:11:08.8216987Z 0x0000000000000007 (RELA) 0x160a28 2025-05-07T20:11:08.8217111Z 0x0000000000000008 (RELASZ) 141648 (bytes) 2025-05-07T20:11:08.8217226Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:08.8217323Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:08.8217459Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:08.8217575Z 0x000000006ffffffe (VERNEED) 0x160878 2025-05-07T20:11:08.8217682Z 0x000000006fffffff (VERNEEDNUM) 6 2025-05-07T20:11:08.8217804Z 0x000000006ffffff0 (VERSYM) 0x15d6d4 2025-05-07T20:11:08.8217904Z 0x000000006ffffff9 (RELACOUNT) 516 2025-05-07T20:11:08.8217999Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:08.8218003Z 2025-05-07T20:11:08.8218130Z ################################################################################ 2025-05-07T20:11:08.8218136Z 2025-05-07T20:11:08.8218139Z 2025-05-07T20:11:08.8218270Z ################################################################################ 2025-05-07T20:11:08.8218567Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:11:08.8218718Z [CHECK] Listing out library size: 2025-05-07T20:11:08.8219003Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:11:08.8219007Z 2025-05-07T20:11:08.8219229Z 908 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:11:08.8219234Z 2025-05-07T20:11:08.8219658Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:11:08.8220156Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:08.8220162Z 2025-05-07T20:11:09.0051336Z GLIBC_2.2.5 2025-05-07T20:11:09.0052465Z GLIBC_2.3 2025-05-07T20:11:09.0053163Z GLIBC_2.14 2025-05-07T20:11:09.0053296Z 2025-05-07T20:11:09.0053300Z 2025-05-07T20:11:09.0053778Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:11:09.0055117Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:09.0055797Z 2025-05-07T20:11:09.1898234Z GLIBCXX_3.4 2025-05-07T20:11:09.1899374Z GLIBCXX_3.4.9 2025-05-07T20:11:09.1900452Z GLIBCXX_3.4.11 2025-05-07T20:11:09.1901053Z GLIBCXX_3.4.14 2025-05-07T20:11:09.1901641Z GLIBCXX_3.4.15 2025-05-07T20:11:09.1902209Z GLIBCXX_3.4.18 2025-05-07T20:11:09.1902894Z GLIBCXX_3.4.20 2025-05-07T20:11:09.1903085Z GLIBCXX_3.4.21 2025-05-07T20:11:09.1903288Z GLIBCXX_3.4.29 2025-05-07T20:11:09.1904659Z 2025-05-07T20:11:09.1904664Z 2025-05-07T20:11:09.1916741Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so > /tmp/tmp.dnd7yUyylq.symbols.txt 2025-05-07T20:11:09.1918390Z 2025-05-07T20:11:09.3736536Z 2025-05-07T20:11:09.3801461Z [CHECK] Total Number of symbols: 12349 2025-05-07T20:11:09.3882620Z [CHECK] Number of fbgemm symbols: 2031 2025-05-07T20:11:09.3903047Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so > /tmp/tmp.NfCUniFwgK.usymbols.txt 2025-05-07T20:11:09.3904154Z 2025-05-07T20:11:09.3961553Z 2025-05-07T20:11:09.3987331Z [CHECK] Listing out undefined symbols (289 total): 2025-05-07T20:11:09.4011019Z U GOMP_parallel 2025-05-07T20:11:09.4012090Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:09.4013491Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:09.4014065Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:09.4014439Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:09.4014838Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:09.4015252Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:09.4015636Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:09.4016027Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:09.4016403Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:09.4016768Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:09.4017154Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:09.4017484Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:09.4017925Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:09.4018240Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:09.4018689Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:09.4019175Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:09.4019484Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:09.4019809Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:09.4020162Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:09.4020475Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:09.4020774Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:09.4021083Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:09.4021379Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:09.4021712Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:11:09.4022113Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:11:09.4022514Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:11:09.4022929Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:11:09.4023321Z U at::RecordFunction::currentThreadId() 2025-05-07T20:11:09.4023679Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:11:09.4024090Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:11:09.4024532Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:11:09.4025149Z U at::_ops::clamp::call(at::Tensor const&, std::optional const&, std::optional const&) 2025-05-07T20:11:09.4025716Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:11:09.4026564Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:09.4027872Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:09.4028849Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:09.4030241Z U at::_ops::sparse_coo_tensor_indices_size::call(at::Tensor const&, at::Tensor const&, c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:09.4031473Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:11:09.4032060Z U at::_ops::unsqueeze::call(at::Tensor const&, long) 2025-05-07T20:11:09.4032518Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:09.4033331Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:09.4034558Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:09.4035470Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:11:09.4036015Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:11:09.4036372Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:11:09.4036767Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:11:09.4037124Z U at::get_num_threads() 2025-05-07T20:11:09.4037430Z U at::get_thread_num() 2025-05-07T20:11:09.4037720Z U at::globalContext() 2025-05-07T20:11:09.4038025Z U at::in_parallel_region() 2025-05-07T20:11:09.4051070Z U at::init_num_threads() 2025-05-07T20:11:09.4051565Z U at::internal::set_thread_num(int) 2025-05-07T20:11:09.4051915Z U at::sequence_number::get_and_increment() 2025-05-07T20:11:09.4052333Z U at::tensor(c10::ArrayRef, c10::TensorOptions const&) 2025-05-07T20:11:09.4052803Z U at::toAccumulateType(c10::ScalarType, bool) 2025-05-07T20:11:09.4053142Z U c10::AnyType::get() 2025-05-07T20:11:09.4053526Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:09.4053944Z U c10::BoolType::get() 2025-05-07T20:11:09.4054288Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:09.4054732Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:11:09.4055125Z U c10::Dispatcher::realSingleton() 2025-05-07T20:11:09.4055865Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:11:09.4057084Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:11:09.4058201Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:09.4058777Z U c10::Error::what() const 2025-05-07T20:11:09.4059086Z U c10::FloatType::get() 2025-05-07T20:11:09.4059390Z U c10::GeneratorImpl::device() const 2025-05-07T20:11:09.4059722Z U c10::GradMode::is_enabled() 2025-05-07T20:11:09.4060029Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:11:09.4060401Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:11:09.4061062Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:09.4061516Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:11:09.4061912Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:11:09.4062251Z U c10::IValue::isBoolList() const 2025-05-07T20:11:09.4062693Z U c10::IValue::isIntList() const 2025-05-07T20:11:09.4063226Z U c10::IValue::isSymIntList() const 2025-05-07T20:11:09.4063580Z U c10::IValue::isTensorList() const 2025-05-07T20:11:09.4064170Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:09.4064747Z U c10::IntType::get() 2025-05-07T20:11:09.4065259Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:09.4066043Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:09.4066468Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:09.4066830Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:09.4067311Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:09.4067809Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:11:09.4068181Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:11:09.4068726Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:11:09.4069420Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:09.4069834Z U c10::StringType::get() 2025-05-07T20:11:09.4070204Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:09.4070619Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:09.4071324Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:09.4072059Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:09.4072458Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:09.4072819Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:09.4073177Z U c10::SymIntType::get() 2025-05-07T20:11:09.4073561Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:09.4073957Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:11:09.4074368Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:09.4074746Z U c10::TensorType::get() 2025-05-07T20:11:09.4075105Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:09.4076119Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:09.4077135Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:09.4077529Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:09.4077940Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:09.4078298Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:09.4078664Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:09.4079014Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:09.4079516Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:09.4080001Z U c10::cuda::device_count() 2025-05-07T20:11:09.4080380Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:09.4080794Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:09.4081233Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:09.4081776Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:09.4082166Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:09.4082557Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:09.4083252Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:09.4084280Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:09.4085514Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:09.4086425Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:09.4087436Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:09.4088515Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:09.4089379Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:09.4089742Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:09.4090301Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:11:09.4090959Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:11:09.4091422Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:09.4091872Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:09.4092299Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:09.4092739Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:09.4093142Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:11:09.4093842Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:09.4094476Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:11:09.4094863Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:09.4095260Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:09.4095692Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:11:09.4096119Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:11:09.4096524Z U c10::operator<=(c10::SymInt const&, int) 2025-05-07T20:11:09.4096907Z U c10::operator>(c10::SymInt const&, int) 2025-05-07T20:11:09.4097266Z U c10::operator>=(c10::SymInt const&, int) 2025-05-07T20:11:09.4097736Z U c10::throwNullDataPtrError() 2025-05-07T20:11:09.4098085Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:09.4098417Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:09.4098813Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:09.4099239Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:09.4099593Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:09.4099940Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:09.4100302Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:09.4100641Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:09.4100986Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:09.4101353Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:09.4101691Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:09.4102043Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:09.4102380Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:11:09.4102740Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:09.4103093Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:09.4103432Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:09.4103759Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:09.4104099Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:09.4104437Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:09.4104793Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:09.4105770Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:09.4106940Z U fbgemm::SparseAdaGradSignature::Type fbgemm::GenerateSparseAdaGrad(int, bool, int, bool) 2025-05-07T20:11:09.4107497Z U fbgemm::fbgemmAlignedFree(void*) 2025-05-07T20:11:09.4107909Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:11:09.4108316Z U float at::Tensor::item() const 2025-05-07T20:11:09.4108672Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:09.4109160Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:09.4109721Z U free@GLIBC_2.2.5 2025-05-07T20:11:09.4110131Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:09.4110527Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:09.4110989Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:09.4111462Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:09.4111868Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:09.4112267Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:09.4112568Z U memcpy@GLIBC_2.14 2025-05-07T20:11:09.4112855Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:09.4113156Z U memset@GLIBC_2.2.5 2025-05-07T20:11:09.4113456Z U omp_get_num_threads 2025-05-07T20:11:09.4113748Z U omp_get_thread_num 2025-05-07T20:11:09.4114099Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:09.4114489Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:09.4115076Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:11:09.4115849Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:11:09.4116623Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:11:09.4117465Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:11:09.4118247Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:11:09.4119037Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:11:09.4119597Z U realloc@GLIBC_2.2.5 2025-05-07T20:11:09.4120268Z U split_embedding_codegen_forward_cpu(at::Tensor, at::Tensor, at::Tensor, c10::SymInt, at::Tensor, at::Tensor, at::Tensor, long, at::Tensor, long) 2025-05-07T20:11:09.4121325Z U split_embedding_codegen_grad_indice_weights_cpu(at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor) 2025-05-07T20:11:09.4122068Z U sqrt@GLIBC_2.2.5 2025-05-07T20:11:09.4122346Z U sqrtf@GLIBC_2.2.5 2025-05-07T20:11:09.4122740Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:11:09.4123377Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:09.4124191Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:09.4125002Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:09.4125795Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:09.4126433Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:11:09.4126816Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:11:09.4127178Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:09.4127688Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:09.4128044Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:11:09.4128445Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:09.4128828Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:09.4129262Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:09.4129686Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:09.4130423Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:11:09.4131377Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:09.4132132Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:09.4133394Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:09.4134637Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:09.4135412Z U std::current_exception()@CXXABI_1.3.3 2025-05-07T20:11:09.4135771Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:09.4136135Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:09.4136489Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:09.4136834Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:09.4137206Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:09.4137534Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:09.4137876Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:09.4138311Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:09.4138863Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:09.4139362Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:09.4139767Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:09.4140189Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:11:09.4140691Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:11:09.4141475Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:11:09.4142168Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:11:09.4142523Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:09.4143056Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:09.4143499Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:09.4143826Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:09.4144843Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:09.4146646Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:09.4147634Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:09.4148437Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:11:09.4149075Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:11:09.4149697Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:11:09.4150206Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:11:09.4150732Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:11:09.4151411Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:11:09.4152041Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:11:09.4152527Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:11:09.4153068Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:11:09.4153508Z U torch::autograd::Node::assign_parent() 2025-05-07T20:11:09.4153915Z U torch::autograd::Node::metadata() 2025-05-07T20:11:09.4154273Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:11:09.4154789Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:11:09.4155439Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:11:09.4155986Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:11:09.4156473Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:11:09.4157028Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:11:09.4160217Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:11:09.4163408Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:11:09.4163863Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:11:09.4164291Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:11:09.4164741Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:11:09.4165438Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:11:09.4166327Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:09.4167367Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:11:09.4168153Z U typeinfo for c10::Error 2025-05-07T20:11:09.4168493Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:09.4168880Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:11:09.4169241Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:11:09.4169612Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:11:09.4169971Z U typeinfo for torch::autograd::Node 2025-05-07T20:11:09.4171278Z U void internal::csr2csc(internal::HyperCompressedSparseColumn&, int, at::TensorAccessor const&, at::TensorAccessor const&, at::TensorAccessor const&, long, int const*, long) 2025-05-07T20:11:09.4173553Z U void internal::csr2csc(internal::HyperCompressedSparseColumn&, int, at::TensorAccessor const&, at::TensorAccessor const&, at::TensorAccessor const&, long, int const*, long) 2025-05-07T20:11:09.4174933Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:09.4175356Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:09.4175821Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:11:09.4176355Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:09.4176725Z U vtable for c10::Error 2025-05-07T20:11:09.4177258Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:09.4178015Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:09.4178768Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:09.4179340Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:09.4179755Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:09.4180276Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:09.4180734Z U vtable for torch::autograd::Node 2025-05-07T20:11:09.4181124Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:09.4181510Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:09.4181811Z w _ITM_registerTMCloneTable 2025-05-07T20:11:09.4182116Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:09.4182401Z w __gmon_start__ 2025-05-07T20:11:09.4182671Z w __pthread_key_create 2025-05-07T20:11:09.4182956Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:09.4183279Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:09.4183657Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:09.4184135Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:11:09.4184938Z 2025-05-07T20:11:09.4185246Z linux-vdso.so.1 (0x00007ffce07bd000) 2025-05-07T20:11:09.4185593Z libc10.so => not found 2025-05-07T20:11:09.4185850Z libc10_cuda.so => not found 2025-05-07T20:11:09.4186512Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f0148000000) 2025-05-07T20:11:09.4187680Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f0147e50000) 2025-05-07T20:11:09.4188464Z libtorch.so => not found 2025-05-07T20:11:09.4189087Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so (0x00007f0147800000) 2025-05-07T20:11:09.4190046Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f0146600000) 2025-05-07T20:11:09.4190732Z libtorch_cpu.so => not found 2025-05-07T20:11:09.4191020Z libtorch_cuda.so => not found 2025-05-07T20:11:09.4191295Z libcudart.so.12 => not found 2025-05-07T20:11:09.4191647Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f014639c000) 2025-05-07T20:11:09.4192045Z libm.so.6 => /lib64/libm.so.6 (0x00007f0147d75000) 2025-05-07T20:11:09.4192446Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f018259f000) 2025-05-07T20:11:09.4192843Z libc.so.6 => /lib64/libc.so.6 (0x00007f0146194000) 2025-05-07T20:11:09.4193206Z /lib64/ld-linux-x86-64.so.2 (0x00007f01825d5000) 2025-05-07T20:11:09.4193551Z libc10.so => not found 2025-05-07T20:11:09.4193798Z libc10_cuda.so => not found 2025-05-07T20:11:09.4194442Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so (0x00007f0182593000) 2025-05-07T20:11:09.4195106Z libtorch.so => not found 2025-05-07T20:11:09.4195379Z libtorch_cpu.so => not found 2025-05-07T20:11:09.4195761Z libtorch_cuda.so => not found 2025-05-07T20:11:09.4196040Z libcudart.so.12 => not found 2025-05-07T20:11:09.4196315Z libc10.so => not found 2025-05-07T20:11:09.4196568Z libc10_cuda.so => not found 2025-05-07T20:11:09.4196916Z libtorch.so => not found 2025-05-07T20:11:09.4197171Z libtorch_cpu.so => not found 2025-05-07T20:11:09.4197468Z libtorch_cuda.so => not found 2025-05-07T20:11:09.4197740Z libcudart.so.12 => not found 2025-05-07T20:11:09.4198019Z libc10.so => not found 2025-05-07T20:11:09.4198541Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so (0x00007f0148385000) 2025-05-07T20:11:09.4199135Z libtorch.so => not found 2025-05-07T20:11:09.4199414Z libtorch_cpu.so => not found 2025-05-07T20:11:09.4199689Z libtorch_cuda.so => not found 2025-05-07T20:11:09.4199983Z libtorch.so => not found 2025-05-07T20:11:09.4200239Z libc10.so => not found 2025-05-07T20:11:09.4200498Z libc10_cuda.so => not found 2025-05-07T20:11:09.4200769Z libtorch_cpu.so => not found 2025-05-07T20:11:09.4201056Z libtorch_cuda.so => not found 2025-05-07T20:11:09.4201327Z libcudart.so.12 => not found 2025-05-07T20:11:09.4201744Z libc10.so => not found 2025-05-07T20:11:09.4201980Z libtorch_cpu.so => not found 2025-05-07T20:11:09.4202249Z libtorch_cuda.so => not found 2025-05-07T20:11:09.4202509Z libtorch.so => not found 2025-05-07T20:11:09.4202748Z libtorch_cpu.so => not found 2025-05-07T20:11:09.4203016Z libtorch_cuda.so => not found 2025-05-07T20:11:09.4203264Z libtorch.so => not found 2025-05-07T20:11:09.4203420Z 2025-05-07T20:11:09.4203540Z [CHECK] Displaying ELF information: 2025-05-07T20:11:09.4203986Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:11:09.4204361Z 2025-05-07T20:11:09.4204400Z 2025-05-07T20:11:09.4204555Z Dynamic section at offset 0x38b44998 contains 43 entries: 2025-05-07T20:11:09.4204971Z Tag Type Name/Value 2025-05-07T20:11:09.4205361Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:09.4205848Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:09.4206354Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:11:09.4206916Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:11:09.4207451Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:09.4207921Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:11:09.4208424Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:11:09.4208931Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:09.4209436Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:09.4209937Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:09.4210447Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:09.4210942Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:11:09.4211418Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:09.4211900Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:09.4212389Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:09.4212962Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:11:09.4213504Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:09.4213886Z 0x000000000000000c (INIT) 0x611000 2025-05-07T20:11:09.4214223Z 0x000000000000000d (FINI) 0x32390cc 2025-05-07T20:11:09.4214543Z 0x0000000000000019 (INIT_ARRAY) 0x38b425f8 2025-05-07T20:11:09.4214922Z 0x000000000000001b (INIT_ARRAYSZ) 1824 (bytes) 2025-05-07T20:11:09.4215259Z 0x000000000000001a (FINI_ARRAY) 0x38b42d18 2025-05-07T20:11:09.4215591Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:09.4215931Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:09.4216252Z 0x000000006ffffef5 (GNU_HASH) 0x10330 2025-05-07T20:11:09.4216573Z 0x0000000000000005 (STRTAB) 0x69580 2025-05-07T20:11:09.4216877Z 0x0000000000000006 (SYMTAB) 0x20fb0 2025-05-07T20:11:09.4217223Z 0x000000000000000a (STRSZ) 4919620 (bytes) 2025-05-07T20:11:09.4217561Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:09.4217896Z 0x0000000000000003 (PLTGOT) 0x38b44c88 2025-05-07T20:11:09.4218239Z 0x0000000000000002 (PLTRELSZ) 50064 (bytes) 2025-05-07T20:11:09.4218576Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:09.4218878Z 0x0000000000000017 (JMPREL) 0x603da0 2025-05-07T20:11:09.4219201Z 0x0000000000000007 (RELA) 0x5208e0 2025-05-07T20:11:09.4219542Z 0x0000000000000008 (RELASZ) 931008 (bytes) 2025-05-07T20:11:09.4219908Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:09.4220229Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:09.4220530Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:09.4220869Z 0x000000006ffffffe (VERNEED) 0x520740 2025-05-07T20:11:09.4221177Z 0x000000006fffffff (VERNEEDNUM) 6 2025-05-07T20:11:09.4221487Z 0x000000006ffffff0 (VERSYM) 0x51a6c4 2025-05-07T20:11:09.4221812Z 0x000000006ffffff9 (RELACOUNT) 26208 2025-05-07T20:11:09.4222102Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:09.4222290Z 2025-05-07T20:11:09.4222407Z ################################################################################ 2025-05-07T20:11:09.4222649Z 2025-05-07T20:11:09.4222652Z 2025-05-07T20:11:09.4222757Z ################################################################################ 2025-05-07T20:11:09.4223281Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:11:09.4223797Z [CHECK] Listing out library size: 2025-05-07T20:11:09.4224268Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:11:09.4224664Z 2025-05-07T20:11:09.4224907Z 142 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:11:09.4225249Z 2025-05-07T20:11:09.4225665Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:11:09.4226700Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:09.4227310Z 2025-05-07T20:11:09.4422921Z GLIBC_2.2.5 2025-05-07T20:11:09.4423360Z GLIBC_2.3 2025-05-07T20:11:09.4423706Z GLIBC_2.14 2025-05-07T20:11:09.4423885Z 2025-05-07T20:11:09.4423889Z 2025-05-07T20:11:09.4424376Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:11:09.4425659Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:09.4426325Z 2025-05-07T20:11:09.4698267Z GLIBCXX_3.4 2025-05-07T20:11:09.4698973Z GLIBCXX_3.4.9 2025-05-07T20:11:09.4699585Z GLIBCXX_3.4.11 2025-05-07T20:11:09.4700200Z GLIBCXX_3.4.18 2025-05-07T20:11:09.4700768Z GLIBCXX_3.4.20 2025-05-07T20:11:09.4701364Z GLIBCXX_3.4.21 2025-05-07T20:11:09.4701967Z GLIBCXX_3.4.29 2025-05-07T20:11:09.4702335Z 2025-05-07T20:11:09.4702349Z 2025-05-07T20:11:09.4719624Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so > /tmp/tmp.k6SSdCNu1e.symbols.txt 2025-05-07T20:11:09.4721299Z 2025-05-07T20:11:09.4958204Z 2025-05-07T20:11:09.4985460Z [CHECK] Total Number of symbols: 1624 2025-05-07T20:11:09.5007922Z [CHECK] Number of fbgemm symbols: 228 2025-05-07T20:11:09.5025114Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so > /tmp/tmp.YBhnrRhb2f.usymbols.txt 2025-05-07T20:11:09.5025740Z 2025-05-07T20:11:09.5047505Z 2025-05-07T20:11:09.5072204Z [CHECK] Listing out undefined symbols (178 total): 2025-05-07T20:11:09.5087091Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:09.5089651Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:09.5091273Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:09.5092305Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:09.5093473Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:09.5094615Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:09.5095696Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:09.5096486Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:09.5096858Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:09.5097226Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:09.5097587Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:09.5097901Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:09.5098241Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:09.5098544Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:09.5098866Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:09.5099206Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:09.5099613Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:09.5099950Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:11:09.5100377Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:11:09.5100925Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:11:09.5101369Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:11:09.5101936Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:11:09.5102766Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:09.5104057Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:09.5105377Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:09.5105988Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:11:09.5106911Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:09.5108099Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:09.5109056Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:11:09.5109665Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:11:09.5110029Z U at::globalContext() 2025-05-07T20:11:09.5110507Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:09.5111013Z U c10::BoolType::get() 2025-05-07T20:11:09.5111375Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:09.5111801Z U c10::FloatType::get() 2025-05-07T20:11:09.5112118Z U c10::GeneratorImpl::device() const 2025-05-07T20:11:09.5112531Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:09.5112975Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:09.5113328Z U c10::IntType::get() 2025-05-07T20:11:09.5113690Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:09.5114090Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:09.5114487Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:09.5114907Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:09.5115307Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:09.5116005Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:09.5116711Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:09.5117086Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:09.5117409Z U c10::SymIntType::get() 2025-05-07T20:11:09.5117777Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:09.5118216Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:09.5118582Z U c10::TensorType::get() 2025-05-07T20:11:09.5118915Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:09.5119898Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:09.5120934Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:09.5121310Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:09.5121662Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:09.5122108Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:09.5122425Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:09.5122748Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:09.5123196Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:09.5123636Z U c10::cuda::device_count() 2025-05-07T20:11:09.5123959Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:09.5124310Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:09.5124680Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:09.5125052Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:09.5125432Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:09.5125799Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:09.5126496Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:09.5127345Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:09.5128176Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:09.5129078Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:09.5130102Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:09.5130918Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:09.5131222Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:09.5131559Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:09.5131954Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:09.5132328Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:09.5132672Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:09.5133025Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:09.5133365Z U c10::throwNullDataPtrError() 2025-05-07T20:11:09.5133666Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:09.5133981Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:09.5134548Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:09.5135079Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:09.5135440Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:09.5135801Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:09.5136178Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:09.5136539Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:09.5137070Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:09.5137489Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:09.5137842Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:09.5138241Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:09.5138595Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:11:09.5138977Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:09.5139343Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:09.5139705Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:09.5140043Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:09.5140393Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:09.5140742Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:09.5141111Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:09.5143657Z U embedding_ops::split_embedding_backward_codegen_find_long_segments(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, int, bool) 2025-05-07T20:11:09.5146258Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:11:09.5146710Z U float at::Tensor::item() const 2025-05-07T20:11:09.5147077Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:09.5147499Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:09.5147895Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:09.5148287Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:09.5148715Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:09.5149265Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:09.5149706Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:09.5150068Z U memcpy@GLIBC_2.14 2025-05-07T20:11:09.5150371Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:09.5150687Z U memset@GLIBC_2.2.5 2025-05-07T20:11:09.5151038Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:09.5151425Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:09.5152012Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:11:09.5152791Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:11:09.5153564Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:11:09.5154363Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:11:09.5155172Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:09.5156119Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:09.5157000Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:09.5157862Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:09.5158479Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:09.5158838Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:09.5159236Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:09.5159654Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:09.5160087Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:09.5160530Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:09.5161049Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:09.5161884Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:09.5162912Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:09.5164090Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:09.5164816Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:09.5165178Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:09.5165519Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:09.5165860Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:09.5166205Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:09.5166521Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:09.5166929Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:09.5167442Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:09.5167919Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:09.5168262Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:09.5168558Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:09.5168900Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:09.5169721Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:09.5170858Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:09.5171672Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:09.5172380Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:09.5173385Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:11:09.5175718Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:09.5179059Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:09.5182108Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:09.5185478Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:09.5188431Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:09.5191468Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:09.5195141Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:09.5199429Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:09.5203783Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:09.5207624Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:09.5211400Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:09.5215195Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:09.5218807Z U void embedding_ops::split_embedding_backward_count_unique_indices_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int) 2025-05-07T20:11:09.5220716Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:09.5221137Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:09.5221561Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:09.5222160Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:09.5222938Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:09.5223724Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:09.5224362Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:09.5224945Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:09.5225371Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:09.5225696Z w _ITM_registerTMCloneTable 2025-05-07T20:11:09.5225994Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:09.5226289Z w __gmon_start__ 2025-05-07T20:11:09.5226567Z w __pthread_key_create 2025-05-07T20:11:09.5226862Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:09.5227197Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:09.5227543Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:09.5228036Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:11:09.5228389Z 2025-05-07T20:11:09.5228553Z linux-vdso.so.1 (0x00007fff43f2c000) 2025-05-07T20:11:09.5228827Z libc10.so => not found 2025-05-07T20:11:09.5229176Z libc10_cuda.so => not found 2025-05-07T20:11:09.5230103Z fbgemm_gpu_tbe_training_backward.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so (0x00007f7d94e00000) 2025-05-07T20:11:09.5230914Z libtorch.so => not found 2025-05-07T20:11:09.5231175Z libtorch_cpu.so => not found 2025-05-07T20:11:09.5231467Z libtorch_cuda.so => not found 2025-05-07T20:11:09.5231755Z libcudart.so.12 => not found 2025-05-07T20:11:09.5232093Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f7d94b9c000) 2025-05-07T20:11:09.5232536Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f7dd81f8000) 2025-05-07T20:11:09.5232928Z libc.so.6 => /lib64/libc.so.6 (0x00007f7d94994000) 2025-05-07T20:11:09.5233352Z /lib64/ld-linux-x86-64.so.2 (0x00007f7dd822c000) 2025-05-07T20:11:09.5233682Z libc10.so => not found 2025-05-07T20:11:09.5233942Z libc10_cuda.so => not found 2025-05-07T20:11:09.5234607Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f7d94600000) 2025-05-07T20:11:09.5235784Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f7d94450000) 2025-05-07T20:11:09.5236570Z libtorch.so => not found 2025-05-07T20:11:09.5237113Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so (0x00007f7d93e00000) 2025-05-07T20:11:09.5238094Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f7d92c00000) 2025-05-07T20:11:09.5238799Z libtorch_cpu.so => not found 2025-05-07T20:11:09.5239075Z libtorch_cuda.so => not found 2025-05-07T20:11:09.5239363Z libcudart.so.12 => not found 2025-05-07T20:11:09.5239665Z libm.so.6 => /lib64/libm.so.6 (0x00007f7dd8119000) 2025-05-07T20:11:09.5240007Z libc10.so => not found 2025-05-07T20:11:09.5240256Z libc10_cuda.so => not found 2025-05-07T20:11:09.5240906Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so (0x00007f7dd810b000) 2025-05-07T20:11:09.5241570Z libtorch.so => not found 2025-05-07T20:11:09.5241954Z libtorch_cpu.so => not found 2025-05-07T20:11:09.5242227Z libtorch_cuda.so => not found 2025-05-07T20:11:09.5242472Z libcudart.so.12 => not found 2025-05-07T20:11:09.5242721Z libc10.so => not found 2025-05-07T20:11:09.5242939Z libc10_cuda.so => not found 2025-05-07T20:11:09.5243192Z libtorch.so => not found 2025-05-07T20:11:09.5243425Z libtorch_cpu.so => not found 2025-05-07T20:11:09.5243687Z libtorch_cuda.so => not found 2025-05-07T20:11:09.5243937Z libcudart.so.12 => not found 2025-05-07T20:11:09.5244185Z libc10.so => not found 2025-05-07T20:11:09.5244696Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so (0x00007f7dd808c000) 2025-05-07T20:11:09.5245239Z libtorch.so => not found 2025-05-07T20:11:09.5245486Z libtorch_cpu.so => not found 2025-05-07T20:11:09.5245736Z libtorch_cuda.so => not found 2025-05-07T20:11:09.5246016Z libtorch.so => not found 2025-05-07T20:11:09.5246243Z libc10.so => not found 2025-05-07T20:11:09.5246482Z libc10_cuda.so => not found 2025-05-07T20:11:09.5246727Z libtorch_cpu.so => not found 2025-05-07T20:11:09.5246983Z libtorch_cuda.so => not found 2025-05-07T20:11:09.5247234Z libcudart.so.12 => not found 2025-05-07T20:11:09.5247481Z libc10.so => not found 2025-05-07T20:11:09.5247708Z libtorch_cpu.so => not found 2025-05-07T20:11:09.5247966Z libtorch_cuda.so => not found 2025-05-07T20:11:09.5248223Z libtorch.so => not found 2025-05-07T20:11:09.5248456Z libtorch_cpu.so => not found 2025-05-07T20:11:09.5248703Z libtorch_cuda.so => not found 2025-05-07T20:11:09.5248946Z libtorch.so => not found 2025-05-07T20:11:09.5249090Z 2025-05-07T20:11:09.5249204Z [CHECK] Displaying ELF information: 2025-05-07T20:11:09.5249658Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:11:09.5251394Z 2025-05-07T20:11:09.5251397Z 2025-05-07T20:11:09.5251555Z Dynamic section at offset 0x8dbfdd8 contains 39 entries: 2025-05-07T20:11:09.5251922Z Tag Type Name/Value 2025-05-07T20:11:09.5252305Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:09.5252787Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:09.5253316Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:11:09.5253866Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:09.5254341Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:09.5254874Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:09.5255372Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:09.5255861Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:09.5256354Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:09.5256833Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:09.5257321Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:09.5257883Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_gwd.so] 2025-05-07T20:11:09.5258433Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:09.5258814Z 0x000000000000000c (INIT) 0xbf000 2025-05-07T20:11:09.5259123Z 0x000000000000000d (FINI) 0x62dd0c 2025-05-07T20:11:09.5259453Z 0x0000000000000019 (INIT_ARRAY) 0x8dbf998 2025-05-07T20:11:09.5259776Z 0x000000000000001b (INIT_ARRAYSZ) 200 (bytes) 2025-05-07T20:11:09.5260115Z 0x000000000000001a (FINI_ARRAY) 0x8dbfa60 2025-05-07T20:11:09.5260429Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:09.5260748Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:09.5261070Z 0x000000006ffffef5 (GNU_HASH) 0x2b38 2025-05-07T20:11:09.5261373Z 0x0000000000000005 (STRTAB) 0xedf0 2025-05-07T20:11:09.5261680Z 0x0000000000000006 (SYMTAB) 0x5598 2025-05-07T20:11:09.5262007Z 0x000000000000000a (STRSZ) 594745 (bytes) 2025-05-07T20:11:09.5262350Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:09.5262671Z 0x0000000000000003 (PLTGOT) 0x8dc0088 2025-05-07T20:11:09.5263009Z 0x0000000000000002 (PLTRELSZ) 11400 (bytes) 2025-05-07T20:11:09.5263334Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:09.5263643Z 0x0000000000000017 (JMPREL) 0xbb9f8 2025-05-07T20:11:09.5263986Z 0x0000000000000007 (RELA) 0xa0f20 2025-05-07T20:11:09.5264307Z 0x0000000000000008 (RELASZ) 109272 (bytes) 2025-05-07T20:11:09.5264652Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:09.5264983Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:09.5265299Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:09.5265624Z 0x000000006ffffffe (VERNEED) 0xa0de0 2025-05-07T20:11:09.5265937Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:09.5266239Z 0x000000006ffffff0 (VERSYM) 0xa012a 2025-05-07T20:11:09.5266560Z 0x000000006ffffff9 (RELACOUNT) 3126 2025-05-07T20:11:09.5266859Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:09.5267046Z 2025-05-07T20:11:09.5267149Z ################################################################################ 2025-05-07T20:11:09.5267358Z 2025-05-07T20:11:09.5267371Z 2025-05-07T20:11:09.5267476Z ################################################################################ 2025-05-07T20:11:09.5267999Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:11:09.5268551Z [CHECK] Listing out library size: 2025-05-07T20:11:09.5269152Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:11:09.5269760Z 2025-05-07T20:11:09.5270014Z 59 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:11:09.5270395Z 2025-05-07T20:11:09.5270855Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:11:09.5271983Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:09.5272734Z 2025-05-07T20:11:09.5343679Z GLIBC_2.2.5 2025-05-07T20:11:09.5344051Z GLIBC_2.3 2025-05-07T20:11:09.5344919Z GLIBC_2.14 2025-05-07T20:11:09.5345089Z 2025-05-07T20:11:09.5345093Z 2025-05-07T20:11:09.5345594Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:11:09.5346885Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:09.5347567Z 2025-05-07T20:11:09.5494109Z GLIBCXX_3.4 2025-05-07T20:11:09.5494866Z GLIBCXX_3.4.9 2025-05-07T20:11:09.5495674Z GLIBCXX_3.4.11 2025-05-07T20:11:09.5496252Z GLIBCXX_3.4.15 2025-05-07T20:11:09.5496828Z GLIBCXX_3.4.18 2025-05-07T20:11:09.5497388Z GLIBCXX_3.4.20 2025-05-07T20:11:09.5497974Z GLIBCXX_3.4.21 2025-05-07T20:11:09.5498548Z GLIBCXX_3.4.29 2025-05-07T20:11:09.5498914Z 2025-05-07T20:11:09.5498954Z 2025-05-07T20:11:09.5514679Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so > /tmp/tmp.LUJmNy35wS.symbols.txt 2025-05-07T20:11:09.5515684Z 2025-05-07T20:11:09.5631372Z 2025-05-07T20:11:09.5655999Z [CHECK] Total Number of symbols: 1791 2025-05-07T20:11:09.5672280Z [CHECK] Number of fbgemm symbols: 94 2025-05-07T20:11:09.5687546Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so > /tmp/tmp.xscNUVNWDi.usymbols.txt 2025-05-07T20:11:09.5688135Z 2025-05-07T20:11:09.5711291Z 2025-05-07T20:11:09.5737447Z [CHECK] Listing out undefined symbols (266 total): 2025-05-07T20:11:09.5758086Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:09.5759377Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:09.5760217Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:09.5760934Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:09.5761841Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:09.5762333Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:09.5762766Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:09.5763158Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:09.5763519Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:09.5763906Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:09.5764288Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:09.5764613Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:09.5764943Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:09.5765261Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:09.5765592Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:09.5765923Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:09.5766252Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:09.5766570Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:09.5766891Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:09.5767264Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:09.5767575Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:09.5767899Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:09.5768215Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:09.5768535Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:11:09.5768900Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:11:09.5769331Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:11:09.5769751Z U at::RecordFunction::currentThreadId() 2025-05-07T20:11:09.5770090Z U at::RecordFunction::end() 2025-05-07T20:11:09.5770478Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:11:09.5770858Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:11:09.5771313Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:11:09.5771783Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:11:09.5772684Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:09.5774113Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:09.5775071Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:09.5775800Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:09.5776930Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:09.5777727Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:11:09.5778080Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:11:09.5778453Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:11:09.5778798Z U at::globalContext() 2025-05-07T20:11:09.5779113Z U at::sequence_number::get_and_increment() 2025-05-07T20:11:09.5779430Z U c10::AnyType::get() 2025-05-07T20:11:09.5779806Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:09.5780212Z U c10::BoolType::get() 2025-05-07T20:11:09.5780571Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:09.5781008Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:11:09.5781429Z U c10::Dispatcher::realSingleton() 2025-05-07T20:11:09.5782148Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:11:09.5783359Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:11:09.5784826Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:09.5785676Z U c10::Error::what() const 2025-05-07T20:11:09.5786006Z U c10::FloatType::get() 2025-05-07T20:11:09.5786323Z U c10::GradMode::is_enabled() 2025-05-07T20:11:09.5786659Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:11:09.5787122Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:09.5787582Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:11:09.5787979Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:11:09.5788312Z U c10::IValue::isBoolList() const 2025-05-07T20:11:09.5788654Z U c10::IValue::isIntList() const 2025-05-07T20:11:09.5789083Z U c10::IValue::isSymIntList() const 2025-05-07T20:11:09.5789430Z U c10::IValue::isTensorList() const 2025-05-07T20:11:09.5789792Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:09.5790156Z U c10::IntType::get() 2025-05-07T20:11:09.5790585Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:09.5790995Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:09.5791364Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:09.5791727Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:09.5792197Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:09.5792837Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:11:09.5793400Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:09.5793783Z U c10::StringType::get() 2025-05-07T20:11:09.5794140Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:11:09.5794556Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:09.5794984Z U c10::SymBool::guard_size_oblivious(char const*, long) const 2025-05-07T20:11:09.5795439Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:09.5795869Z U c10::SymFloat::operator/(c10::SymFloat const&) const 2025-05-07T20:11:09.5796556Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:09.5797240Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:09.5797613Z U c10::SymInt::operator c10::SymFloat() const 2025-05-07T20:11:09.5798001Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:11:09.5798386Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:09.5798736Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:11:09.5799118Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:11:09.5799486Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:11:09.5799887Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:09.5800215Z U c10::SymIntType::get() 2025-05-07T20:11:09.5800569Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:09.5801021Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:11:09.5801411Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:09.5801889Z U c10::TensorType::get() 2025-05-07T20:11:09.5802196Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:09.5803115Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:09.5804057Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:09.5804394Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:09.5804731Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:09.5805052Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:09.5805417Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:09.5805764Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:09.5806208Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:09.5806663Z U c10::cuda::device_count() 2025-05-07T20:11:09.5806991Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:09.5807367Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:09.5807751Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:09.5808123Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:09.5808608Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:09.5808973Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:09.5809615Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:09.5810655Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:09.5811500Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:09.5812351Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:09.5813449Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:09.5814682Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:09.5815704Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:09.5816327Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:09.5817097Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:11:09.5818186Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:11:09.5818646Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:09.5819096Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:09.5819543Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:09.5820268Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:09.5820706Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:11:09.5821397Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:09.5822065Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:11:09.5822445Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:09.5822852Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:09.5823282Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:11:09.5823709Z U c10::operator<<(std::ostream&, c10::SymFloat const&) 2025-05-07T20:11:09.5824130Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:11:09.5824505Z U c10::throwNullDataPtrError() 2025-05-07T20:11:09.5824860Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:09.5825211Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:09.5825634Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:09.5826080Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:09.5826477Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:09.5826863Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:09.5827242Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:09.5827632Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:09.5828003Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:09.5828356Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:09.5828708Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:09.5829163Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:09.5829545Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:11:09.5830048Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:09.5830439Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:09.5830793Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:09.5831167Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:09.5831533Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:09.5831890Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:09.5832271Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:09.5834820Z U embedding_ops::split_embedding_backward_codegen_find_long_segments(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, int, bool) 2025-05-07T20:11:09.5837508Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:11:09.5838041Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:09.5838471Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:09.5838855Z U free@GLIBC_2.2.5 2025-05-07T20:11:09.5839201Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:09.5839610Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:09.5840049Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:09.5840495Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:09.5840896Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:09.5841396Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:09.5841702Z U memcpy@GLIBC_2.14 2025-05-07T20:11:09.5841984Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:09.5842278Z U memset@GLIBC_2.2.5 2025-05-07T20:11:09.5842634Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:09.5843024Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:09.5843555Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:11:09.5844310Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:11:09.5844829Z U realloc@GLIBC_2.2.5 2025-05-07T20:11:09.5845237Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:11:09.5845896Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:09.5846707Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:09.5847561Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:09.5848323Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:11:09.5849114Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:09.5849958Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:09.5850740Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:09.5851111Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:09.5851664Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:09.5852062Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:09.5852721Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:09.5853541Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:09.5854257Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:11:09.5854877Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:09.5855610Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:09.5856706Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:09.5857981Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:09.5858762Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:09.5859140Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:09.5859500Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:09.5873274Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:09.5873929Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:09.5874280Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:09.5874629Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:09.5875041Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:09.5875704Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:09.5876199Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:09.5876661Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:09.5877090Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:11:09.5877786Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:11:09.5878485Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:11:09.5878842Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:09.5879164Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:09.5879459Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:09.5879777Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:09.5880642Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:09.5883621Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:09.5884711Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:09.5885229Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:11:09.5885772Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:11:09.5886376Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:11:09.5886894Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:11:09.5887504Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:11:09.5888178Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:11:09.5888800Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:11:09.5889266Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:11:09.5889765Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:11:09.5890178Z U torch::autograd::Node::assign_parent() 2025-05-07T20:11:09.5890538Z U torch::autograd::Node::metadata() 2025-05-07T20:11:09.5890894Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:11:09.5891396Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:11:09.5892046Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:11:09.5892586Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:11:09.5893063Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:11:09.5893620Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:11:09.5896845Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:11:09.5900067Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:11:09.5900469Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:11:09.5900871Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:11:09.5901908Z U torch::autograd::profiler::record_function_enter_new(std::__cxx11::basic_string, std::allocator > const&, std::optional, std::allocator > > const&) 2025-05-07T20:11:09.5902926Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:11:09.5903565Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:11:09.5904418Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:09.5905447Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:11:09.5906171Z U typeinfo for c10::Error 2025-05-07T20:11:09.5906506Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:09.5906869Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:11:09.5907204Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:11:09.5907565Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:11:09.5907895Z U typeinfo for torch::autograd::Node 2025-05-07T20:11:09.5909811Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:09.5912913Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:09.5915949Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:09.5918931Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:09.5921995Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:09.5924905Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:09.5926459Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:09.5926868Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:09.5927283Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:11:09.5927680Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:09.5928032Z U vtable for c10::Error 2025-05-07T20:11:09.5928556Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:09.5929319Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:09.5930091Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:09.5930646Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:09.5931063Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:09.5931564Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:09.5931993Z U vtable for torch::autograd::Node 2025-05-07T20:11:09.5932375Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:09.5932745Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:09.5933071Z w _ITM_registerTMCloneTable 2025-05-07T20:11:09.5933362Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:09.5933648Z w __gmon_start__ 2025-05-07T20:11:09.5933902Z w __pthread_key_create 2025-05-07T20:11:09.5934198Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:09.5934510Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:09.5934847Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:09.5935337Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:11:09.5935697Z 2025-05-07T20:11:09.5935856Z linux-vdso.so.1 (0x00007ffd6d4ae000) 2025-05-07T20:11:09.5936120Z libc10.so => not found 2025-05-07T20:11:09.5936355Z libc10_cuda.so => not found 2025-05-07T20:11:09.5936903Z fbgemm_gpu_tbe_training_backward.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so (0x00007f1311e00000) 2025-05-07T20:11:09.5936990Z libtorch.so => not found 2025-05-07T20:11:09.5937092Z libtorch_cpu.so => not found 2025-05-07T20:11:09.5937180Z libtorch_cuda.so => not found 2025-05-07T20:11:09.5937266Z libcudart.so.12 => not found 2025-05-07T20:11:09.5937420Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f1311b9c000) 2025-05-07T20:11:09.5937572Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f134fc34000) 2025-05-07T20:11:09.5937685Z libc.so.6 => /lib64/libc.so.6 (0x00007f1311994000) 2025-05-07T20:11:09.5937802Z /lib64/ld-linux-x86-64.so.2 (0x00007f134fc68000) 2025-05-07T20:11:09.5937896Z libc10.so => not found 2025-05-07T20:11:09.5937980Z libc10_cuda.so => not found 2025-05-07T20:11:09.5938428Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f1311600000) 2025-05-07T20:11:09.5938962Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f1311450000) 2025-05-07T20:11:09.5939046Z libtorch.so => not found 2025-05-07T20:11:09.5939408Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so (0x00007f1310e00000) 2025-05-07T20:11:09.5939884Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f130fc00000) 2025-05-07T20:11:09.5939972Z libtorch_cpu.so => not found 2025-05-07T20:11:09.5940059Z libtorch_cuda.so => not found 2025-05-07T20:11:09.5940148Z libcudart.so.12 => not found 2025-05-07T20:11:09.5940276Z libm.so.6 => /lib64/libm.so.6 (0x00007f1311375000) 2025-05-07T20:11:09.5940355Z libc10.so => not found 2025-05-07T20:11:09.5940440Z libc10_cuda.so => not found 2025-05-07T20:11:09.5940872Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so (0x00007f134fc22000) 2025-05-07T20:11:09.5940957Z libtorch.so => not found 2025-05-07T20:11:09.5941045Z libtorch_cpu.so => not found 2025-05-07T20:11:09.5941148Z libtorch_cuda.so => not found 2025-05-07T20:11:09.5941240Z libcudart.so.12 => not found 2025-05-07T20:11:09.5941318Z libc10.so => not found 2025-05-07T20:11:09.5941444Z libc10_cuda.so => not found 2025-05-07T20:11:09.5941543Z libtorch.so => not found 2025-05-07T20:11:09.5941626Z libtorch_cpu.so => not found 2025-05-07T20:11:09.5941713Z libtorch_cuda.so => not found 2025-05-07T20:11:09.5941809Z libcudart.so.12 => not found 2025-05-07T20:11:09.5941893Z libc10.so => not found 2025-05-07T20:11:09.5942230Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so (0x00007f134bf85000) 2025-05-07T20:11:09.5942311Z libtorch.so => not found 2025-05-07T20:11:09.5942412Z libtorch_cpu.so => not found 2025-05-07T20:11:09.5942498Z libtorch_cuda.so => not found 2025-05-07T20:11:09.5942584Z libtorch.so => not found 2025-05-07T20:11:09.5942677Z libc10.so => not found 2025-05-07T20:11:09.5942789Z libc10_cuda.so => not found 2025-05-07T20:11:09.5942877Z libtorch_cpu.so => not found 2025-05-07T20:11:09.5942967Z libtorch_cuda.so => not found 2025-05-07T20:11:09.5943065Z libcudart.so.12 => not found 2025-05-07T20:11:09.5943143Z libc10.so => not found 2025-05-07T20:11:09.5943227Z libtorch_cpu.so => not found 2025-05-07T20:11:09.5943327Z libtorch_cuda.so => not found 2025-05-07T20:11:09.5943414Z libtorch.so => not found 2025-05-07T20:11:09.5943499Z libtorch_cpu.so => not found 2025-05-07T20:11:09.5943585Z libtorch_cuda.so => not found 2025-05-07T20:11:09.5943680Z libtorch.so => not found 2025-05-07T20:11:09.5943685Z 2025-05-07T20:11:09.5943786Z [CHECK] Displaying ELF information: 2025-05-07T20:11:09.5944067Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:11:09.5944072Z 2025-05-07T20:11:09.5944075Z 2025-05-07T20:11:09.5944239Z Dynamic section at offset 0x3a22e50 contains 39 entries: 2025-05-07T20:11:09.5944351Z Tag Type Name/Value 2025-05-07T20:11:09.5944533Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:09.5944732Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:09.5944980Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:11:09.5945163Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:09.5945361Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:09.5945551Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:09.5945737Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:09.5945924Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:09.5946113Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:09.5946293Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:09.5946514Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:09.5946791Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_dense.so] 2025-05-07T20:11:09.5946989Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:09.5947094Z 0x000000000000000c (INIT) 0x7a000 2025-05-07T20:11:09.5947206Z 0x000000000000000d (FINI) 0x26a70c 2025-05-07T20:11:09.5947317Z 0x0000000000000019 (INIT_ARRAY) 0x3a23350 2025-05-07T20:11:09.5947435Z 0x000000000000001b (INIT_ARRAYSZ) 184 (bytes) 2025-05-07T20:11:09.5947554Z 0x000000000000001a (FINI_ARRAY) 0x3a23408 2025-05-07T20:11:09.5947664Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:09.5947766Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:09.5947870Z 0x000000006ffffef5 (GNU_HASH) 0x2e00 2025-05-07T20:11:09.5947982Z 0x0000000000000005 (STRTAB) 0x101c8 2025-05-07T20:11:09.5948085Z 0x0000000000000006 (SYMTAB) 0x59c8 2025-05-07T20:11:09.5948212Z 0x000000000000000a (STRSZ) 353759 (bytes) 2025-05-07T20:11:09.5948353Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:09.5948462Z 0x0000000000000003 (PLTGOT) 0x3a24100 2025-05-07T20:11:09.5948585Z 0x0000000000000002 (PLTRELSZ) 13056 (bytes) 2025-05-07T20:11:09.5948685Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:09.5948796Z 0x0000000000000017 (JMPREL) 0x75e68 2025-05-07T20:11:09.5948896Z 0x0000000000000007 (RELA) 0x67708 2025-05-07T20:11:09.5949102Z 0x0000000000000008 (RELASZ) 59232 (bytes) 2025-05-07T20:11:09.5949226Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:09.5949317Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:09.5949614Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:09.5949770Z 0x000000006ffffffe (VERNEED) 0x675a8 2025-05-07T20:11:09.5949880Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:09.5949994Z 0x000000006ffffff0 (VERSYM) 0x667a8 2025-05-07T20:11:09.5950144Z 0x000000006ffffff9 (RELACOUNT) 1167 2025-05-07T20:11:09.5950252Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:09.5950257Z 2025-05-07T20:11:09.5950371Z ################################################################################ 2025-05-07T20:11:09.5950375Z 2025-05-07T20:11:09.5950379Z 2025-05-07T20:11:09.5950490Z ################################################################################ 2025-05-07T20:11:09.5950828Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:11:09.5950931Z [CHECK] Listing out library size: 2025-05-07T20:11:09.5951252Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:11:09.5951258Z 2025-05-07T20:11:09.5951523Z 329 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:11:09.5951527Z 2025-05-07T20:11:09.5951978Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:11:09.5952530Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:09.5952548Z 2025-05-07T20:11:09.6466107Z GLIBC_2.2.5 2025-05-07T20:11:09.6466271Z GLIBC_2.3 2025-05-07T20:11:09.6466411Z GLIBC_2.14 2025-05-07T20:11:09.6466465Z 2025-05-07T20:11:09.6466472Z 2025-05-07T20:11:09.6467317Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:11:09.6467904Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:09.6467923Z 2025-05-07T20:11:09.7059152Z GLIBCXX_3.4 2025-05-07T20:11:09.7059458Z GLIBCXX_3.4.9 2025-05-07T20:11:09.7059723Z GLIBCXX_3.4.11 2025-05-07T20:11:09.7059980Z GLIBCXX_3.4.18 2025-05-07T20:11:09.7060211Z GLIBCXX_3.4.20 2025-05-07T20:11:09.7060544Z GLIBCXX_3.4.21 2025-05-07T20:11:09.7060774Z GLIBCXX_3.4.29 2025-05-07T20:11:09.7060793Z 2025-05-07T20:11:09.7060806Z 2025-05-07T20:11:09.7086570Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so > /tmp/tmp.TUk5EDLUkz.symbols.txt 2025-05-07T20:11:09.7086588Z 2025-05-07T20:11:09.7651058Z 2025-05-07T20:11:09.7688595Z [CHECK] Total Number of symbols: 3670 2025-05-07T20:11:09.7742268Z [CHECK] Number of fbgemm symbols: 456 2025-05-07T20:11:09.7758432Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so > /tmp/tmp.N76ib2jNTL.usymbols.txt 2025-05-07T20:11:09.7758484Z 2025-05-07T20:11:09.7788303Z 2025-05-07T20:11:09.7818053Z [CHECK] Listing out undefined symbols (185 total): 2025-05-07T20:11:09.7832446Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:09.7834116Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:09.7834224Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:09.7834372Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:09.7834529Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:09.7834662Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:09.7834863Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:09.7834992Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:09.7835115Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:09.7835334Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:09.7835439Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:09.7835547Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:09.7835663Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:09.7835763Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:09.7835876Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:09.7836003Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:09.7836102Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:09.7836213Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:11:09.7836452Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:11:09.7836641Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:11:09.7836779Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:11:09.7837074Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:11:09.7837238Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:11:09.7837805Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:09.7838437Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:09.7838619Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:09.7838914Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:11:09.7839383Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:09.7840018Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:09.7840176Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:11:09.7840316Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:11:09.7840412Z U at::globalContext() 2025-05-07T20:11:09.7840607Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:09.7840706Z U c10::BoolType::get() 2025-05-07T20:11:09.7840872Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:09.7840970Z U c10::FloatType::get() 2025-05-07T20:11:09.7841084Z U c10::GeneratorImpl::device() const 2025-05-07T20:11:09.7841254Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:09.7841386Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:09.7841501Z U c10::IntType::get() 2025-05-07T20:11:09.7841665Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:09.7841783Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:09.7841930Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:09.7842067Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:11:09.7842207Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:09.7842368Z U c10::SymBool::guard_size_oblivious(char const*, long) const 2025-05-07T20:11:09.7842507Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:09.7842925Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:09.7843053Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:09.7843190Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:11:09.7843312Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:09.7843431Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:11:09.7843552Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:11:09.7843684Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:11:09.7843784Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:09.7843881Z U c10::SymIntType::get() 2025-05-07T20:11:09.7844032Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:09.7844175Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:09.7844269Z U c10::TensorType::get() 2025-05-07T20:11:09.7844396Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:09.7845081Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:09.7845209Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:09.7845332Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:09.7845443Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:09.7845551Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:09.7845674Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:09.7845779Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:09.7846013Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:09.7846140Z U c10::cuda::device_count() 2025-05-07T20:11:09.7846272Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:09.7846419Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:09.7846549Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:09.7846688Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:09.7846834Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:09.7846938Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:09.7847444Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:09.7847678Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:09.7848162Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:09.7848509Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:09.7849068Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:09.7849186Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:09.7849286Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:09.7849427Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:09.7849597Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:09.7849727Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:09.7849846Z U c10::operator+(c10::SymInt const&, int) 2025-05-07T20:11:09.7849970Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:11:09.7850107Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:09.7850241Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:09.7850376Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:11:09.7850494Z U c10::throwNullDataPtrError() 2025-05-07T20:11:09.7850597Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:09.7850710Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:09.7850905Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:09.7851025Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:09.7851153Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:09.7851281Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:09.7851410Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:09.7851522Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:09.7851655Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:09.7851761Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:09.7851870Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:09.7851989Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:09.7852110Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:11:09.7852243Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:09.7852357Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:09.7852473Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:09.7852578Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:09.7852687Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:09.7852833Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:09.7852947Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:09.7855091Z U embedding_ops::split_embedding_backward_codegen_find_long_segments(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, int, bool) 2025-05-07T20:11:09.7855291Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:11:09.7855407Z U float at::Tensor::item() const 2025-05-07T20:11:09.7855540Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:09.7855699Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:09.7855833Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:09.7855969Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:09.7856147Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:09.7856273Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:09.7856414Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:09.7856515Z U memcpy@GLIBC_2.14 2025-05-07T20:11:09.7856608Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:09.7856701Z U memset@GLIBC_2.2.5 2025-05-07T20:11:09.7856853Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:09.7856988Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:09.7857300Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:11:09.7857607Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:11:09.7857903Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:11:09.7858216Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:11:09.7858524Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:11:09.7858827Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:11:09.7859156Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:09.7859538Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:09.7859865Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:09.7860222Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:09.7860337Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:09.7860448Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:09.7860581Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:09.7860718Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:09.7860881Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:09.7861041Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:09.7861302Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:09.7861634Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:09.7862192Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:09.7862691Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:09.7862807Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:09.7862924Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:09.7863039Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:09.7863148Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:09.7863277Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:09.7863395Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:09.7863566Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:09.7863794Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:09.7863921Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:09.7864019Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:09.7864113Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:09.7864238Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:09.7864827Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:09.7865277Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:09.7865532Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:09.7865875Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:09.7866419Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:11:09.7868329Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:09.7870592Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:09.7872655Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:09.7874681Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:09.7876688Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:09.7878538Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:09.7880234Z U void embedding_ops::split_embedding_backward_count_unique_indices_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int) 2025-05-07T20:11:09.7880383Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:09.7880550Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:09.7880694Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:09.7881031Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:09.7881357Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:09.7881686Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:09.7881886Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:09.7882097Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:09.7882202Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:09.7882302Z w _ITM_registerTMCloneTable 2025-05-07T20:11:09.7882409Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:09.7882496Z w __gmon_start__ 2025-05-07T20:11:09.7882595Z w __pthread_key_create 2025-05-07T20:11:09.7883289Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:09.7883566Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:09.7883715Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:09.7884005Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:11:09.7884013Z 2025-05-07T20:11:09.7885981Z linux-vdso.so.1 (0x00007ffe901ed000) 2025-05-07T20:11:09.7886079Z libc10.so => not found 2025-05-07T20:11:09.7886182Z libc10_cuda.so => not found 2025-05-07T20:11:09.7886782Z fbgemm_gpu_tbe_training_backward.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so (0x00007f2a6b200000) 2025-05-07T20:11:09.7886878Z libtorch.so => not found 2025-05-07T20:11:09.7886976Z libtorch_cpu.so => not found 2025-05-07T20:11:09.7887079Z libtorch_cuda.so => not found 2025-05-07T20:11:09.7887177Z libcudart.so.12 => not found 2025-05-07T20:11:09.7887345Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f2a6af9c000) 2025-05-07T20:11:09.7887506Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f2aba42f000) 2025-05-07T20:11:09.7887699Z libc.so.6 => /lib64/libc.so.6 (0x00007f2a6ad94000) 2025-05-07T20:11:09.7887831Z /lib64/ld-linux-x86-64.so.2 (0x00007f2aba463000) 2025-05-07T20:11:09.7887927Z libc10.so => not found 2025-05-07T20:11:09.7888021Z libc10_cuda.so => not found 2025-05-07T20:11:09.7888507Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f2a6aa00000) 2025-05-07T20:11:09.7889068Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f2a6a850000) 2025-05-07T20:11:09.7889169Z libtorch.so => not found 2025-05-07T20:11:09.7889532Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so (0x00007f2a6a200000) 2025-05-07T20:11:09.7890080Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f2a69000000) 2025-05-07T20:11:09.7890177Z libtorch_cpu.so => not found 2025-05-07T20:11:09.7890276Z libtorch_cuda.so => not found 2025-05-07T20:11:09.7890384Z libcudart.so.12 => not found 2025-05-07T20:11:09.7890508Z libm.so.6 => /lib64/libm.so.6 (0x00007f2a6a775000) 2025-05-07T20:11:09.7890596Z libc10.so => not found 2025-05-07T20:11:09.7890693Z libc10_cuda.so => not found 2025-05-07T20:11:09.7891167Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so (0x00007f2aba41d000) 2025-05-07T20:11:09.7891263Z libtorch.so => not found 2025-05-07T20:11:09.7891361Z libtorch_cpu.so => not found 2025-05-07T20:11:09.7891478Z libtorch_cuda.so => not found 2025-05-07T20:11:09.7891573Z libcudart.so.12 => not found 2025-05-07T20:11:09.7891665Z libc10.so => not found 2025-05-07T20:11:09.7891761Z libc10_cuda.so => not found 2025-05-07T20:11:09.7891875Z libtorch.so => not found 2025-05-07T20:11:09.7891968Z libtorch_cpu.so => not found 2025-05-07T20:11:09.7892065Z libtorch_cuda.so => not found 2025-05-07T20:11:09.7892174Z libcudart.so.12 => not found 2025-05-07T20:11:09.7892260Z libc10.so => not found 2025-05-07T20:11:09.7892624Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so (0x00007f2aa5385000) 2025-05-07T20:11:09.7892719Z libtorch.so => not found 2025-05-07T20:11:09.7892830Z libtorch_cpu.so => not found 2025-05-07T20:11:09.7892926Z libtorch_cuda.so => not found 2025-05-07T20:11:09.7893018Z libtorch.so => not found 2025-05-07T20:11:09.7893247Z libc10.so => not found 2025-05-07T20:11:09.7893337Z libc10_cuda.so => not found 2025-05-07T20:11:09.7893429Z libtorch_cpu.so => not found 2025-05-07T20:11:09.7893629Z libtorch_cuda.so => not found 2025-05-07T20:11:09.7893737Z libcudart.so.12 => not found 2025-05-07T20:11:09.7893822Z libc10.so => not found 2025-05-07T20:11:09.7893905Z libtorch_cpu.so => not found 2025-05-07T20:11:09.7894047Z libtorch_cuda.so => not found 2025-05-07T20:11:09.7894134Z libtorch.so => not found 2025-05-07T20:11:09.7894222Z libtorch_cpu.so => not found 2025-05-07T20:11:09.7894345Z libtorch_cuda.so => not found 2025-05-07T20:11:09.7894444Z libtorch.so => not found 2025-05-07T20:11:09.7894449Z 2025-05-07T20:11:09.7894551Z [CHECK] Displaying ELF information: 2025-05-07T20:11:09.7894824Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:11:09.7894829Z 2025-05-07T20:11:09.7932134Z 2025-05-07T20:11:09.7932985Z Dynamic section at offset 0x148571f8 contains 39 entries: 2025-05-07T20:11:09.7933343Z Tag Type Name/Value 2025-05-07T20:11:09.7933941Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:09.7934582Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:09.7934860Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:11:09.7935072Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:09.7935391Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:09.7935596Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:09.7935801Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:09.7936019Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:09.7936218Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:09.7936412Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:09.7936639Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:09.7936982Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_vbe.so] 2025-05-07T20:11:09.7937170Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:09.7937300Z 0x000000000000000c (INIT) 0x1c3000 2025-05-07T20:11:09.7937416Z 0x000000000000000d (FINI) 0xf0879c 2025-05-07T20:11:09.7937644Z 0x0000000000000019 (INIT_ARRAY) 0x14856518 2025-05-07T20:11:09.7937769Z 0x000000000000001b (INIT_ARRAYSZ) 680 (bytes) 2025-05-07T20:11:09.7937895Z 0x000000000000001a (FINI_ARRAY) 0x148567c0 2025-05-07T20:11:09.7938011Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:09.7938116Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:09.7938241Z 0x000000006ffffef5 (GNU_HASH) 0x4b88 2025-05-07T20:11:09.7938348Z 0x0000000000000005 (STRTAB) 0x1fa30 2025-05-07T20:11:09.7938451Z 0x0000000000000006 (SYMTAB) 0xa208 2025-05-07T20:11:09.7938598Z 0x000000000000000a (STRSZ) 1419969 (bytes) 2025-05-07T20:11:09.7938711Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:09.7938833Z 0x0000000000000003 (PLTGOT) 0x148574a8 2025-05-07T20:11:09.7938963Z 0x0000000000000002 (PLTRELSZ) 18120 (bytes) 2025-05-07T20:11:09.7939079Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:09.7939188Z 0x0000000000000017 (JMPREL) 0x1bded8 2025-05-07T20:11:09.7939294Z 0x0000000000000007 (RELA) 0x17c2e0 2025-05-07T20:11:09.7939434Z 0x0000000000000008 (RELASZ) 269304 (bytes) 2025-05-07T20:11:09.7939549Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:09.7939643Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:09.7939774Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:09.7939887Z 0x000000006ffffffe (VERNEED) 0x17c1a0 2025-05-07T20:11:09.7939994Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:09.7940109Z 0x000000006ffffff0 (VERSYM) 0x17a4f2 2025-05-07T20:11:09.7940228Z 0x000000006ffffff9 (RELACOUNT) 7406 2025-05-07T20:11:09.7940362Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:09.7940369Z 2025-05-07T20:11:09.7940495Z ################################################################################ 2025-05-07T20:11:09.7940534Z 2025-05-07T20:11:09.7940539Z 2025-05-07T20:11:09.7940760Z [CHECK] Verifying sample subset of symbols in the built libraries ... 2025-05-07T20:11:09.8052857Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:11:09.8075375Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:11:09.8306719Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:11:09.8339851Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:11:09.8387654Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:11:09.8426774Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:11:09.8466380Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:11:09.8497093Z [CHECK] Found symbol in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:11:09.8610926Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:09.8642335Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:09.8867647Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:09.8906733Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:09.8957551Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:09.8989904Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:09.9025196Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:09.9061198Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:09.9460535Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:09.9821167Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:10.0013126Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:10.0946462Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:10.0979024Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:10.1071251Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:10.1384290Z [CHECK] Found symbol in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:10.1385390Z ################################################################################ 2025-05-07T20:11:10.1385959Z [BUILD] Wheel Audit: dist/fbgemm_gpu_nightly-2025.5.7-cp311-cp311-manylinux_2_28_x86_64.whl 2025-05-07T20:11:10.1386449Z 2025-05-07T20:11:10.1386953Z + conda run --no-capture-output -n build_binary auditwheel show dist/fbgemm_gpu_nightly-2025.5.7-cp311-cp311-manylinux_2_28_x86_64.whl 2025-05-07T20:11:10.1387541Z 2025-05-07T20:11:21.6272585Z 2025-05-07T20:11:21.6273271Z fbgemm_gpu_nightly-2025.5.7-cp311-cp311-manylinux_2_28_x86_64.whl is 2025-05-07T20:11:21.6273860Z consistent with the following platform tag: "linux_x86_64". 2025-05-07T20:11:21.6274202Z 2025-05-07T20:11:21.6274393Z The wheel references external versioned symbols in these 2025-05-07T20:11:21.6274863Z system-provided shared libraries: libgcc_s.so.1 with versions 2025-05-07T20:11:21.6275483Z {'GCC_3.0', 'GCC_3.4'}, libstdc++.so.6 with versions {'CXXABI_1.3.8', 2025-05-07T20:11:21.6275959Z 'GLIBCXX_3.4.15', 'GLIBCXX_3.4.9', 'CXXABI_1.3', 'GLIBCXX_3.4.19', 2025-05-07T20:11:21.6276431Z 'GLIBCXX_3.4.11', 'CXXABI_1.3.7', 'CXXABI_1.3.5', 'GLIBCXX_3.4', 2025-05-07T20:11:21.6276901Z 'GLIBCXX_3.4.14', 'GLIBCXX_3.4.18', 'CXXABI_1.3.11', 'CXXABI_1.3.9', 2025-05-07T20:11:21.6277713Z 'GLIBCXX_3.4.21', 'GLIBCXX_3.4.29', 'CXXABI_1.3.3', 'GLIBCXX_3.4.20'}, 2025-05-07T20:11:21.6278187Z libc.so.6 with versions {'GLIBC_2.2.5', 'GLIBC_2.14'}, libm.so.6 with 2025-05-07T20:11:21.6278627Z versions {'GLIBC_2.2.5'}, libcudart.so.12 with versions 2025-05-07T20:11:21.6278976Z {'libcudart.so.12'} 2025-05-07T20:11:21.6279115Z 2025-05-07T20:11:21.6279313Z This constrains the platform tag to "manylinux_2_34_x86_64". In order 2025-05-07T20:11:21.6279824Z to achieve a more compatible tag, you would need to recompile a new 2025-05-07T20:11:21.6280314Z wheel from source on a system with earlier versions of these 2025-05-07T20:11:21.6280783Z libraries, such as a recent manylinux image. 2025-05-07T20:11:21.7108648Z 2025-05-07T20:11:21.7108819Z 2025-05-07T20:11:21.7109547Z ################################################################################ 2025-05-07T20:11:21.7110661Z [BUILD] Enumerating the built wheels ... 2025-05-07T20:11:21.7112091Z + ls -lth dist/fbgemm_gpu_nightly-2025.5.7-cp311-cp311-manylinux_2_28_x86_64.whl 2025-05-07T20:11:21.7113157Z 2025-05-07T20:11:21.7128540Z -rw-r--r--. 1 root root 511M May 7 20:11 dist/fbgemm_gpu_nightly-2025.5.7-cp311-cp311-manylinux_2_28_x86_64.whl 2025-05-07T20:11:21.7129946Z 2025-05-07T20:11:21.7130300Z [BUILD] Enumerating the wheel SHAs ... 2025-05-07T20:11:21.7131692Z + sha1sum dist/fbgemm_gpu_nightly-2025.5.7-cp311-cp311-manylinux_2_28_x86_64.whl 2025-05-07T20:11:21.7132784Z 2025-05-07T20:11:22.6310307Z e558208d4c09c2a8f544c29f24cb7c2aac9f31a0 dist/fbgemm_gpu_nightly-2025.5.7-cp311-cp311-manylinux_2_28_x86_64.whl 2025-05-07T20:11:22.6311964Z 2025-05-07T20:11:22.6312773Z + sha256sum dist/fbgemm_gpu_nightly-2025.5.7-cp311-cp311-manylinux_2_28_x86_64.whl 2025-05-07T20:11:22.6313924Z 2025-05-07T20:11:24.8249168Z e6091571777d9754f2d0c431d6b221cc6217b3ec1a52a71fce44a22bbcb221c9 dist/fbgemm_gpu_nightly-2025.5.7-cp311-cp311-manylinux_2_28_x86_64.whl 2025-05-07T20:11:24.8251211Z 2025-05-07T20:11:24.8251959Z + md5sum dist/fbgemm_gpu_nightly-2025.5.7-cp311-cp311-manylinux_2_28_x86_64.whl 2025-05-07T20:11:24.8253042Z 2025-05-07T20:11:25.6402170Z 86540de58e444d24256e6d3c614426c9 dist/fbgemm_gpu_nightly-2025.5.7-cp311-cp311-manylinux_2_28_x86_64.whl 2025-05-07T20:11:25.6403013Z 2025-05-07T20:11:25.6403151Z [BUILD] FBGEMM-GPU build + package completed 2025-05-07T20:11:25.6521383Z ##[group]Run actions/upload-artifact@v4 2025-05-07T20:11:25.6521854Z with: 2025-05-07T20:11:25.6522151Z name: fbgemm_default_x86_gcc_py3.11_cu12.6.3.whl 2025-05-07T20:11:25.6522517Z path: fbgemm_gpu/dist/*.whl 2025-05-07T20:11:25.6522852Z if-no-files-found: error 2025-05-07T20:11:25.6523135Z compression-level: 6 2025-05-07T20:11:25.6523438Z overwrite: false 2025-05-07T20:11:25.6523723Z include-hidden-files: false 2025-05-07T20:11:25.6524002Z env: 2025-05-07T20:11:25.6524268Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T20:11:25.6524598Z BUILD_ENV: build_binary 2025-05-07T20:11:25.6524907Z BUILD_TARGET: default 2025-05-07T20:11:25.6525162Z BUILD_VARIANT: cuda 2025-05-07T20:11:25.6525447Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T20:11:25.6525717Z ##[endgroup] 2025-05-07T20:11:25.6529516Z ##[command]/usr/bin/docker exec a4cdfef5f67791de1447d833ba4c241d003473f6a3bbff3a88701c507bc3b898 sh -c "cat /etc/*release | grep ^ID" 2025-05-07T20:11:26.0954410Z With the provided path, there will be 1 file uploaded 2025-05-07T20:11:26.0958156Z Artifact name is valid! 2025-05-07T20:11:26.0958898Z Root directory input is valid! 2025-05-07T20:11:26.1977405Z Beginning upload of artifact content to blob storage 2025-05-07T20:11:27.0210930Z Uploaded bytes 8388608 2025-05-07T20:11:27.3236373Z Uploaded bytes 16777216 2025-05-07T20:11:27.7905858Z Uploaded bytes 25165824 2025-05-07T20:11:28.2351306Z Uploaded bytes 33554432 2025-05-07T20:11:28.6746843Z Uploaded bytes 41943040 2025-05-07T20:11:29.1532479Z Uploaded bytes 50331648 2025-05-07T20:11:29.5709629Z Uploaded bytes 58720256 2025-05-07T20:11:30.0051609Z Uploaded bytes 67108864 2025-05-07T20:11:30.4397436Z Uploaded bytes 75497472 2025-05-07T20:11:30.8617964Z Uploaded bytes 83886080 2025-05-07T20:11:31.3794419Z Uploaded bytes 92274688 2025-05-07T20:11:31.7719254Z Uploaded bytes 100663296 2025-05-07T20:11:32.2154457Z Uploaded bytes 109051904 2025-05-07T20:11:32.6746544Z Uploaded bytes 117440512 2025-05-07T20:11:33.0860569Z Uploaded bytes 125829120 2025-05-07T20:11:33.5348004Z Uploaded bytes 134217728 2025-05-07T20:11:33.9441508Z Uploaded bytes 142606336 2025-05-07T20:11:34.3865589Z Uploaded bytes 150994944 2025-05-07T20:11:34.8350778Z Uploaded bytes 159383552 2025-05-07T20:11:35.2492145Z Uploaded bytes 167772160 2025-05-07T20:11:35.6825172Z Uploaded bytes 176160768 2025-05-07T20:11:36.1211154Z Uploaded bytes 184549376 2025-05-07T20:11:36.6031444Z Uploaded bytes 192937984 2025-05-07T20:11:36.9785111Z Uploaded bytes 201326592 2025-05-07T20:11:37.4747649Z Uploaded bytes 209715200 2025-05-07T20:11:37.9271446Z Uploaded bytes 218103808 2025-05-07T20:11:38.2966187Z Uploaded bytes 226492416 2025-05-07T20:11:38.7392359Z Uploaded bytes 234881024 2025-05-07T20:11:39.1626407Z Uploaded bytes 243269632 2025-05-07T20:11:39.6240324Z Uploaded bytes 251658240 2025-05-07T20:11:40.0016303Z Uploaded bytes 260046848 2025-05-07T20:11:40.4072598Z Uploaded bytes 268435456 2025-05-07T20:11:40.8343246Z Uploaded bytes 276824064 2025-05-07T20:11:41.2191039Z Uploaded bytes 285212672 2025-05-07T20:11:41.7014334Z Uploaded bytes 293601280 2025-05-07T20:11:42.0695451Z Uploaded bytes 301989888 2025-05-07T20:11:42.5112242Z Uploaded bytes 310378496 2025-05-07T20:11:42.9889626Z Uploaded bytes 318767104 2025-05-07T20:11:43.4597601Z Uploaded bytes 327155712 2025-05-07T20:11:43.9105282Z Uploaded bytes 335544320 2025-05-07T20:11:44.4209878Z Uploaded bytes 343932928 2025-05-07T20:11:44.7666647Z Uploaded bytes 352321536 2025-05-07T20:11:45.2449073Z Uploaded bytes 360710144 2025-05-07T20:11:45.6215246Z Uploaded bytes 369098752 2025-05-07T20:11:46.0097239Z Uploaded bytes 377487360 2025-05-07T20:11:46.5206178Z Uploaded bytes 385875968 2025-05-07T20:11:46.9348570Z Uploaded bytes 394264576 2025-05-07T20:11:47.4019490Z Uploaded bytes 402653184 2025-05-07T20:11:47.8232192Z Uploaded bytes 411041792 2025-05-07T20:11:48.2178787Z Uploaded bytes 419430400 2025-05-07T20:11:48.6753017Z Uploaded bytes 427819008 2025-05-07T20:11:49.1634089Z Uploaded bytes 436207616 2025-05-07T20:11:49.4919981Z Uploaded bytes 444596224 2025-05-07T20:11:50.0680951Z Uploaded bytes 452984832 2025-05-07T20:11:50.3985771Z Uploaded bytes 461373440 2025-05-07T20:11:50.8272052Z Uploaded bytes 469762048 2025-05-07T20:11:51.2554510Z Uploaded bytes 478150656 2025-05-07T20:11:51.7068052Z Uploaded bytes 486539264 2025-05-07T20:11:52.0832908Z Uploaded bytes 494927872 2025-05-07T20:11:52.5042942Z Uploaded bytes 503316480 2025-05-07T20:11:52.9118982Z Uploaded bytes 511705088 2025-05-07T20:11:53.3463433Z Uploaded bytes 520093696 2025-05-07T20:11:53.5432045Z Uploaded bytes 524579930 2025-05-07T20:11:53.5648226Z Finished uploading artifact content to blob storage! 2025-05-07T20:11:53.5649032Z SHA256 digest of uploaded artifact zip is 5b1fa6bb606af48c0159ab67c5cdf7cde91ec78c92ec78ba89e25d1887664e25 2025-05-07T20:11:53.5649779Z Finalizing artifact upload 2025-05-07T20:11:53.6576734Z Artifact fbgemm_default_x86_gcc_py3.11_cu12.6.3.whl.zip successfully finalized. Artifact ID 3081459851 2025-05-07T20:11:53.6577722Z Artifact fbgemm_default_x86_gcc_py3.11_cu12.6.3.whl has been successfully uploaded! Final size is 524579930 bytes. Artifact ID is 3081459851 2025-05-07T20:11:53.6586995Z Artifact download URL: https://github.com/pytorch/FBGEMM/actions/runs/14891846252/artifacts/3081459851 2025-05-07T20:11:53.6828221Z Post job cleanup. 2025-05-07T20:11:53.6834227Z ##[command]/usr/bin/docker exec a4cdfef5f67791de1447d833ba4c241d003473f6a3bbff3a88701c507bc3b898 sh -c "cat /etc/*release | grep ^ID" 2025-05-07T20:11:53.9871342Z [command]/usr/bin/git version 2025-05-07T20:11:53.9907966Z git version 2.47.1 2025-05-07T20:11:53.9938554Z Copying '/github/home/.gitconfig' to '/__w/_temp/440469c1-4298-4dfb-b932-aafb180997a6/.gitconfig' 2025-05-07T20:11:53.9947193Z Temporarily overriding HOME='/__w/_temp/440469c1-4298-4dfb-b932-aafb180997a6' before making global git config changes 2025-05-07T20:11:53.9951080Z Adding repository directory to the temporary git global config as a safe directory 2025-05-07T20:11:53.9953110Z [command]/usr/bin/git config --global --add safe.directory /__w/FBGEMM/FBGEMM 2025-05-07T20:11:53.9998204Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2025-05-07T20:11:54.0024874Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2025-05-07T20:11:54.0295472Z Entering 'external/asmjit' 2025-05-07T20:11:54.0349600Z Entering 'external/composable_kernel' 2025-05-07T20:11:54.0405832Z Entering 'external/cpuinfo' 2025-05-07T20:11:54.0471266Z Entering 'external/cutlass' 2025-05-07T20:11:54.0546404Z Entering 'external/googletest' 2025-05-07T20:11:54.0613401Z Entering 'external/hipify_torch' 2025-05-07T20:11:54.0661243Z Entering 'external/json' 2025-05-07T20:11:54.0718990Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2025-05-07T20:11:54.0740071Z http.https://github.com/.extraheader 2025-05-07T20:11:54.0744792Z [command]/usr/bin/git config --local --unset-all http.https://github.com/.extraheader 2025-05-07T20:11:54.0770399Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2025-05-07T20:11:54.1044588Z Entering 'external/asmjit' 2025-05-07T20:11:54.1088005Z http.https://github.com/.extraheader 2025-05-07T20:11:54.1118253Z Entering 'external/composable_kernel' 2025-05-07T20:11:54.1160722Z http.https://github.com/.extraheader 2025-05-07T20:11:54.1200738Z Entering 'external/cpuinfo' 2025-05-07T20:11:54.1244473Z http.https://github.com/.extraheader 2025-05-07T20:11:54.1285925Z Entering 'external/cutlass' 2025-05-07T20:11:54.1315239Z http.https://github.com/.extraheader 2025-05-07T20:11:54.1370229Z Entering 'external/googletest' 2025-05-07T20:11:54.1399361Z http.https://github.com/.extraheader 2025-05-07T20:11:54.1435728Z Entering 'external/hipify_torch' 2025-05-07T20:11:54.1480535Z http.https://github.com/.extraheader 2025-05-07T20:11:54.1512270Z Entering 'external/json' 2025-05-07T20:11:54.1546798Z http.https://github.com/.extraheader 2025-05-07T20:11:54.1698433Z Stop and remove container: be310ab4d61e4fc4b18391c314211a7d_amazonlinux2023_40b1c5 2025-05-07T20:11:54.1703387Z ##[command]/usr/bin/docker rm --force a4cdfef5f67791de1447d833ba4c241d003473f6a3bbff3a88701c507bc3b898 2025-05-07T20:11:54.9904672Z a4cdfef5f67791de1447d833ba4c241d003473f6a3bbff3a88701c507bc3b898 2025-05-07T20:11:54.9930328Z Remove container network: github_network_d1c31e94b16749b88a8f16884e015518 2025-05-07T20:11:54.9934740Z ##[command]/usr/bin/docker network rm github_network_d1c31e94b16749b88a8f16884e015518 2025-05-07T20:11:56.0644054Z github_network_d1c31e94b16749b88a8f16884e015518 2025-05-07T20:11:56.0673913Z A job completed hook has been configured by the self-hosted runner administrator 2025-05-07T20:11:56.0693505Z ##[group]Run '/home/ec2-user/runner-scripts/after_job.sh' 2025-05-07T20:11:56.0699377Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-05-07T20:11:56.0699766Z ##[endgroup] 2025-05-07T20:11:56.0805011Z [!ALERT!] Swap in detected! [!ALERT!] 2025-05-07T20:12:06.1968749Z [!ALERT!] Swap out detected [!ALERT!] 2025-05-07T20:12:22.2356725Z Cleaning up orphan processes